view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others β’ Nov 26, 2024 β’ 333
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.27k
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper β’ 2501.08326 β’ Published Jan 14 β’ 35
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ Nov 19, 2024 β’ 112
Running 22 22 Hugging Face Values π€ Empower users to use machine learning through an open collaboration platform
view article Article Design choices for Vision Language Models in 2024 By gigant β’ Apr 16, 2024 β’ 29
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x β’ Jun 23, 2024 β’ 93
Running 552 552 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects