view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 890
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 51
view article Article SigLIP 2: A better multilingual vision language encoder +1 ariG23498, merve, qubvel-hf • Feb 21, 2025 • 211
view article Article LoRA training scripts of the world, unite! linoyts, multimodalart • Jan 2, 2024 • 79
view article Article Advanced Flux Dreambooth LoRA Training with 🧨 diffusers linoyts • Oct 21, 2024 • 42
Running Featured 561 Vision Arena (Testing VLMs side-by-side) 🖼 561 Explore AI-powered visual tasks in Vision Arena
Running Agents 351 VBench Leaderboard 📊 351 Submit video model evaluation results to a public benchmark
view article Article Using LoRA for Efficient Stable Diffusion Fine-Tuning pcuenq, sayakpaul • Jan 26, 2023 • 82
Running on Zero Agents Featured 169 IDEFICS2 Playground 🐨 169 Chat with a visual AI assistant using text and images
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Leyo, HugoLaurencon, VictorSanh • Apr 15, 2024 • 191