Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19 β’ 11
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 8 days ago β’ 103
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 16 items β’ Updated 20 days ago β’ 181
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated 12 days ago β’ 119
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published 21 days ago β’ 118
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog β’ 9 items β’ Updated 28 days ago β’ 50
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 45 items β’ Updated 28 days ago β’ 444
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 3 days ago β’ 195
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw β’ Oct 16 β’ 18
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper β’ 2409.17146 β’ Published Sep 25 β’ 104
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 28 days ago β’ 289
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18 β’ 224
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text Paper β’ 2409.02078 β’ Published Sep 3 β’ 9