Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated 2 days ago • 21
SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users Paper • 2504.10157 • Published 12 days ago • 16
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing Paper • 2504.07964 • Published 16 days ago • 61
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 8 days ago • 170
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 26 days ago • 11
Hamanasu Collection A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 31 items • Updated 19 days ago • 8
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published Mar 13 • 28
DeepHermes Collection Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 30
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 400
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 78