Images to Text - a ijohn07 Collection

ijohn07 's Collections

LoRA

Text to images NSFW

Justines's Llamafiles

Images to Text

updated Jan 18

Running

432

432

moondream2

🌔

a tiny vision language model
Runtime error

124

124

nanoLLaVA-1.5

🚀

Chat about images by uploading them
Running

36

36

Candle Moondream 2

🕯

MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Running

145

145

Idefics 8b

🐠

Generate text from images and prompts
Running on CPU Upgrade

1.97k

1.97k

Stable Diffusion XL on TPUv5e

🏋

Generate images from text prompts with various styles
Running on Zero

88

88

Llava Llama-3 8B

🔥

Meta Llama3 8b with Llava Multimodal capabilities
Running

81

81

Paligemma HF

🤗

Generate text and segment images using PaliGemma
Running on Zero

149

149

Llava Next

🔥

Answer questions about images by chatting
Running on Zero

215

215

Microsoft Phi-3-Vision-128k

😻

Generate image descriptions
Running on Zero

46

46

Microsoft Phi-3 Vision 128k

🔥

Microsoft Phi-3 Vision 128k with Multimodal capabilities
Running on Zero

51

51

Contemplative moondream

🌜

let's talk about the meaning of life
Running

3

3

Gradio Lite

🖼

Convert images to grayscale
Running on Zero

759

759

Florence 2

📉

Analyze images to generate captions, detect objects, or perform OCR
Running on Zero

256

256

SD3 Long Captioner

🏃

Generate detailed captions for images
Running

35

35

Florence 2 SD3 Captioner

⚡

Generate detailed captions from images
Running on Zero

196

196

Better Florence 2

🔥

Interact with Florence-2 to analyze images and generate descriptions
Running

18

18

LLaVA WebGPU

🌋

A private and powerful multimodal AI chatbot that runs local
Running on Zero

89

89

AuraFlow-v0.3 with Captioner

🖼

Create images from descriptions or images
Running

102

102

Idefics3

📊

Generate text based on an image and prompt
Running on Zero

30

30

Phi 3.5 Vision

👁

Ask questions about images
Runtime error

220

220

Phi 3.5 Vision

🔥

Generate text from an image and question
Running on Zero

176

176

Tonic's GOT OCR

📲

GOT - OCR (from : UCAS, Beijing)
Runtime error

391

391

Llama-Vision-11B

🚀

Ask questions about images and get detailed answers
Running on Zero

213

213

JanusFlow 1.3B

🏃

Huggingface space for JanusFlow-1.3B
Running on Zero

134

134

SmolVLM

📊

Generate text responses using images and text prompts
Sleeping

1

1

SD3 Long Captioner

🏃

Generate captions for images