view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen • Jan 15 • 185
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 11 days ago • 118
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published 20 days ago • 76
view article Article I trained a Language Model to schedule events with GRPO! By anakin87 • Apr 29 • 75
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10 • 85
view article Article You could have designed state of the art positional encoding By FL33TW00D-HF • Nov 25, 2024 • 280
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated 13 days ago • 146
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated 19 days ago • 17
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 13 days ago • 114
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 161
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 61