Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
WyattTheSkid 's Collections
R1 datasets
Kitsune Data
Bad stuff oooo
highly optimized good models
loras
tiny baby llamas to merge together 🦙
questions
GPT-4
Big pretraining

Big pretraining

updated Mar 13
Upvote
-

  • HuggingFaceTB/finemath

    Viewer • Updated Feb 6 • 48.3M • 20k • 314

  • bigcode/the-stack-v2-dedup

    Viewer • Updated Apr 23, 2024 • 2.3B • 2.31k • 96

  • Zyphra/Zyda-2

    Viewer • Updated Dec 12, 2024 • 1.62B • 87.7k • 82

  • LLM360/TxT360

    Updated 13 days ago • 26.1k • 236

  • bigcode/the-stack-v2

    Viewer • Updated Apr 23, 2024 • 5.45B • 2.3k • 377

  • bigcode/starcoderdata

    Viewer • Updated May 16, 2023 • 207M • 4.37k • 440

  • HuggingFaceFW/fineweb

    Viewer • Updated Jan 31 • 25B • 387k • 2.19k

  • HuggingFaceFW/fineweb-2

    Viewer • Updated Jan 8 • 12.5B • 46.5k • 489

  • hieuhocnlp/deep-research

    Updated Apr 3, 2023 • 25

  • GeneralReasoning/GeneralThought-195K

    Viewer • Updated Mar 10 • 195k • 169 • 69
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs