Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
euclaise 's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements

Small-ish SoTA (<5B), (quasi-)base

updated Aug 10, 2024
Upvote
1

  • nvidia/Minitron-4B-Base

    Text Generation • Updated Feb 14 • 2.5k • 134

  • h2oai/h2o-danube3-4b-base

    Text Generation • 4B • Updated Jul 15, 2024 • 1.83k • 21

  • stabilityai/stablelm-3b-4e1t

    Text Generation • 3B • Updated Mar 7, 2024 • 13.4k • 312

  • Qwen/Qwen2-1.5B

    Text Generation • 2B • Updated Jun 6, 2024 • 91.3k • • 94

  • internlm/internlm2_5-1_8b-chat

    Text Generation • 2B • Updated Mar 13 • 2.98k • 25

  • Qwen/Qwen1.5-4B

    Text Generation • 4B • Updated Apr 5, 2024 • 15.5k • 35

  • tensoropera/Fox-1-1.6B

    Text Generation • 2B • Updated Nov 21, 2024 • 1.74k • 33

  • TRI-ML/DCLM-1B

    1B • Updated Jul 25, 2024 • 72 • 13
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs