view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others โข 9 days ago โข 32
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper โข 2505.03335 โข Published 18 days ago โข 159
view article Article LLaMA 4 Fine-Tuning with Mental Health Counseling Data By ImranzamanML โข Apr 14 โข 3
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper โข 2504.10479 โข Published Apr 14 โข 260
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated 26 days ago โข 476
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme Paper โข 2504.02587 โข Published Apr 3 โข 30
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others โข Apr 5 โข 144
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper โข 2411.10440 โข Published Nov 15, 2024 โข 125
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. โข 27 items โข Updated 23 days ago โข 63
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Paper โข 2503.12797 โข Published Mar 17 โข 30
Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation Paper โข 2503.13070 โข Published Mar 17 โข 9
Gemma 3 Collection All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. โข 50 items โข Updated 22 days ago โข 63
Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Paper โข 2502.04328 โข Published Feb 6 โข 30
view article Article ๐ชข Langfuse and ๐ค Hugging Face: 5 Ways to use them Together By MJannik โข Mar 14 โข 13
view article Article LeRobot goes to driving school: Worldโs largest open-source self-driving dataset By sandhawalia and 1 other โข Mar 11 โข 80
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others โข Mar 12 โข 421