moondream2
a tiny vision language model
a tiny vision language model
Chat about images with AI
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Generate text from images and prompts
Generate images from text prompts with various styles
Meta Llama3 8b with Llava Multimodal capabilities
Generate text and segment images using PaliGemma
Answer questions about images by chatting
Generate image descriptions
Microsoft Phi-3 Vision 128k with Multimodal capabilities
let's talk about the meaning of life
Convert images to grayscale
Analyze images to generate captions, detect objects, or perform OCR
Generate detailed captions for images
Generate detailed captions from images
Interact with Florence-2 to analyze images and generate descriptions
A private and powerful multimodal AI chatbot that runs local
Create images from descriptions or images
Generate text based on an image and prompt
Engage in multi-modal conversations with images and videos
Ask questions about images
Generate text from an image and question
GOT - OCR (from : UCAS, Beijing)
Chat about images by uploading them and typing questions
Huggingface space for JanusFlow-1.3B
Generate text responses using images and text prompts
Generate captions for images