Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
EssentialAI
's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training
Essential-Web v1.0
updated
Jun 18
Upvote
8
Essential-Web v1.0: 24T tokens of organized web data
Paper
•
2506.14111
•
Published
Jun 17
•
42
EssentialAI/essential-web-v1.0
Preview
•
Updated
Jun 22
•
38.3k
•
194
EssentialAI/eai-distill-0.5b
0.6B
•
Updated
Jun 18
•
3.56k
•
22
EssentialAI/eai-taxonomy-math-w-fm
Viewer
•
Updated
Jun 22
•
21.6M
•
3.22k
•
6
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
Jun 22
•
274M
•
8.79k
•
7
EssentialAI/eai-taxonomy-code-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
46.2M
•
431
•
2
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
Jun 22
•
81.2M
•
3.45k
•
8
EssentialAI/eai-taxonomy-med-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
36.6M
•
1.64k
•
2
EssentialAI/eai-taxonomy-stem-w-dclm
Preview
•
Updated
Jun 22
•
5.46k
•
5
EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
35.5M
•
2.49k
•
4
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections