Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Benjy
's Collections
Music AI
Agentic
Image-to-Video
Multimodal
Image-to-Image
Image-to-Text
Speech Recognition
Text-to-Video
OCR
Image Models
Leading Research
Coding LLMs
Text-to-Image
Small LLMs
Leading LLMs
Agentic
updated
9 days ago
Upvote
-
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
9 days ago
•
30.1k
•
221
Upvote
-
Share collection
View history
Collection guide
Browse collections