Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

jrahn
/
gpt2_350M_edu_hermes

Text Generation
Transformers
Safetensors
English
gpt2
llm.c
text-generation-inference
Model card Files Files and versions Community
gpt2_350M_edu_hermes
Ctrl+K
Ctrl+K
  • 1 contributor
History: 13 commits
jrahn's picture
jrahn
Update README.md
37d7cbd verified 10 months ago
  • .gitattributes
    1.52 kB
    initial commit 10 months ago
  • README.md
    4.06 kB
    Update README.md 10 months ago
  • config.json
    770 Bytes
    Upload model 10 months ago
  • edu_fineweb_hermes.py
    7.5 kB
    Upload edu_fineweb_hermes.py with huggingface_hub 10 months ago
  • generation_config.json
    119 Bytes
    Upload model 10 months ago
  • loss_curve.png
    92.1 kB
    Rename IMG_0051.png to loss_curve.png 10 months ago
  • main.log
    796 kB
    Upload main.log with huggingface_hub 10 months ago
  • merges.txt
    456 kB
    Upload tokenizer 10 months ago
  • model.safetensors
    710 MB
    LFS
    Upload model 10 months ago
  • run_gpt2_350M_edu_hermes.sh
    1.41 kB
    Upload run_gpt2_350M_edu_hermes.sh with huggingface_hub 10 months ago
  • special_tokens_map.json
    438 Bytes
    Upload tokenizer 10 months ago
  • tokenizer_config.json
    514 Bytes
    Upload tokenizer 10 months ago
  • vocab.json
    999 kB
    Upload tokenizer 10 months ago