Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
euclaise
's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements
Small-ish SoTA (<5B), (quasi-)base
updated
Aug 10, 2024
Upvote
1
nvidia/Minitron-4B-Base
Text Generation
•
Updated
Feb 14
•
2.5k
•
134
h2oai/h2o-danube3-4b-base
Text Generation
•
4B
•
Updated
Jul 15, 2024
•
1.83k
•
21
stabilityai/stablelm-3b-4e1t
Text Generation
•
3B
•
Updated
Mar 7, 2024
•
13.4k
•
312
Qwen/Qwen2-1.5B
Text Generation
•
2B
•
Updated
Jun 6, 2024
•
91.3k
•
•
94
internlm/internlm2_5-1_8b-chat
Text Generation
•
2B
•
Updated
Mar 13
•
2.98k
•
25
Qwen/Qwen1.5-4B
Text Generation
•
4B
•
Updated
Apr 5, 2024
•
15.5k
•
35
tensoropera/Fox-1-1.6B
Text Generation
•
2B
•
Updated
Nov 21, 2024
•
1.74k
•
33
TRI-ML/DCLM-1B
1B
•
Updated
Jul 25, 2024
•
72
•
13
Upvote
1
Share collection
View history
Collection guide
Browse collections