Running on Zero 198 198 Better Florence 2 🔥 Interact with Florence-2 to analyze images and generate descriptions
Running on Zero 779 779 Florence 2 📉 Analyze images to generate captions, detect objects, or perform OCR
GIT Collection GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering. • 18 items • Updated May 1 • 13
view article Article SmolVLM2: Bringing Video Understanding to Every Device By orrzohar and 6 others • Feb 20 • 283