microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 2 hours ago • 411k • 1.1k
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208