Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Common Pile

Enterprise
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

craffel  authored a paper 3 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
conceptofmind  authored a paper 7 days ago
Bridging the Data Provenance Gap Across Text, Speech and Video
conceptofmind  authored a paper 7 days ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity

Articles

Announcing the Common Pile and Comma v0.1

23 days ago
• 15

Colin Raffel's profile picture Alon Albalak's profile picture Sebastian Majstorovic's profile picture Lintang Sutawika's profile picture Enrico Shippole's profile picture Luca Soldaini's profile picture Zhenlin Xu's profile picture Nikhil Kandpal's profile picture Brian Lester's profile picture Baber Abbasi's profile picture Stella Biderman's profile picture Aviya Skowron's profile picture John Kirchenbauer's profile picture

common-pile 's models 3

common-pile/comma-v0.1-2t

7B • Updated 24 days ago • 1.51k • 29

common-pile/comma-v0.1-1t

7B • Updated 24 days ago • 2.9k • 20

common-pile/comma-v0.1-2t-checkpoints

Updated 25 days ago • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs