Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 4 items • Updated 18 days ago • 15 ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • Updated about 1 month ago • 1.07k • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated 18 days ago Qwen 2 VL and Qwen 2.5 VL Collection 4 items • Updated 18 days ago
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • Updated about 1 month ago • 1.07k • 4
VAD Voice Activity Detection (VAD) models for whisper.cpp. ggml-org/whisper-vad Updated 19 days ago • 1