Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 3 days ago • 92
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 6 days ago • 10
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 19 days ago • 135
Hamanasu Collection A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 24 items • Updated 6 days ago • 8
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 24 days ago • 27
DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated 24 days ago • 27
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 25 days ago • 371
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 30 days ago • 76
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Feb 20 • 50