Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
euclaise 's Collections
MQA
SuperMC
Small-ish SoTA (<5B), (quasi-)base
Interesting smol pretraining expirements

Interesting smol pretraining expirements

updated Sep 10, 2024
Upvote
-

  • UUFO-Aigis/Pico-OpenLAiNN-250M

    0.3B • Updated Feb 24 • 73 • 3

  • crumb/distilpythia

    Text Generation • 0.1B • Updated Jul 20, 2023 • 24 • 4

  • crumb/GLORT2

    Text Generation • 0.2B • Updated Aug 26, 2024 • 28

  • pszemraj/jamba-900M-v0.13-KIx2

    Text Generation • 0.9B • Updated May 18, 2024 • 13 • 4

  • pszemraj/mega-ar-350m-v0.13

    Text Generation • 0.3B • Updated May 15, 2024 • 2.31k

  • BEE-spoke-data/smol_llama-220M-GQA

    Text Generation • 0.2B • Updated Jun 28, 2024 • 3.48k • 12

  • pszemraj/stablelm-4e1t-2b-v0.1

    Text Generation • 2B • Updated May 20, 2024 • 46

  • Locutusque/TinyMistral-248M-v2

    Text Generation • 0.2B • Updated Jan 8, 2024 • 770 • 17

  • upstage/TinySolar-248m-4k

    Text Generation • 0.2B • Updated Feb 7, 2024 • 2.37k • 7

  • appvoid/arco

    0.5B • Updated Dec 5, 2024 • 17 • 13
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs