Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
amang1802 's Collections
ThinkTransformer experiments
Smol-Math
Small model pretraining experiments
PPO experiments
Synthetic Data rewrite (model checkpoints)
Synthetic Data rewrite research (training and eval datasets)
WildeWeb Research

Small model pretraining experiments

updated Feb 9
Upvote
-

  • amang1802/llama_162M_fineweb100BT

    Text Generation • Updated Dec 24, 2024 • 5

  • amang1802/llama_162M_fineweb10BT

    Text Generation • Updated Dec 22, 2024 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs