PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated 12 days ago β’ 119
BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages β’ 11 items β’ Updated 27 days ago β’ 12
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 3 days ago β’ 195
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper β’ 2410.02073 β’ Published Oct 2 β’ 41
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 28 days ago β’ 289
π―DART-Math Collection Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math β’ 20 items β’ Updated Sep 26 β’ 6
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated 20 days ago β’ 548
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5 β’ 182
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. β’ 8 items β’ Updated 24 days ago β’ 49
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Paper β’ 2309.17452 β’ Published Sep 29, 2023 β’ 3
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize β’ 7 items β’ Updated Jul 19 β’ 11
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 3 days ago β’ 204
Distil-Whisper Models Collection The first version of the Distil-Whisper models released with the Distil-Whisper paper. β’ 4 items β’ Updated Mar 21 β’ 36