view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others • Feb 21 • 174
view article Article Llama can now see and run on your device - welcome Llama 3.2 By merve and 6 others • Sep 25, 2024 • 190
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 222
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others • Jun 24, 2024 • 197
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model By merve and 2 others • May 14, 2024 • 260
Zero-Shot Detection and Segmentation Collection Demos of projects focused on zero-shot detection and segmentation. • 4 items • Updated Feb 7, 2024 • 3
OpenAI Vision API Collection Demos of projects using the OpenAI Vision API. • 3 items • Updated Nov 22, 2023 • 3
LMMs - Large Multimodal Models Collection Demos of LMM projects. • 5 items • Updated Apr 24, 2024 • 1