Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
euclaise
's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements
Small-ish SoTA (<5B), (quasi-)base
updated
Aug 10, 2024
Upvote
1
nvidia/Minitron-4B-Base
Text Generation
•
Updated
Feb 14
•
662
•
134
h2oai/h2o-danube3-4b-base
Text Generation
•
Updated
Jul 15, 2024
•
285
•
21
stabilityai/stablelm-3b-4e1t
Text Generation
•
Updated
Mar 7, 2024
•
9.04k
•
310
Qwen/Qwen2-1.5B
Text Generation
•
Updated
Jun 6, 2024
•
184k
•
91
internlm/internlm2_5-1_8b-chat
Text Generation
•
Updated
Mar 13
•
1.44k
•
25
Qwen/Qwen1.5-4B
Text Generation
•
Updated
Apr 5, 2024
•
4.01k
•
36
tensoropera/Fox-1-1.6B
Text Generation
•
Updated
Nov 21, 2024
•
1.31k
•
32
TRI-ML/DCLM-1B
Updated
Jul 25, 2024
•
53
•
13
Upvote
1
Share collection
View history
Collection guide
Browse collections