MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published 12 days ago • 76
Smoothie Qwen3 Collection For more details, please visit https://github.com/dnotitia/smoothie-qwen • 8 items • Updated 16 days ago • 5
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 25 days ago • 43
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Apr 3 • 57
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Feb 20 • 56
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Apr 3 • 21
Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated Apr 3 • 28
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 24 days ago • 303
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Apr 3 • 85
Reranking Model Collection A collection of Korean-specific reranking models • 2 items • Updated Aug 16, 2024 • 3