BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 6 items • Updated 6 days ago • 28
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 4 items • Updated about 10 hours ago • 91
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated Mar 17 • 112
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated Mar 18 • 86
Llama Nemotron Collection Open, Production-ready Enterprise Models • 4 items • Updated 1 day ago • 37
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated Mar 19 • 106
Command Models Collection Latest Cohere Labs Command models • 6 items • Updated 9 days ago • 22
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 9 days ago • 68
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated Feb 9 • 35
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated about 12 hours ago • 37
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated about 12 hours ago • 62
Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 6 items • Updated about 12 hours ago • 35
Phi-4 (All Versions) Collection Microsoft's new Phi-4 models including mini in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 8 items • Updated about 12 hours ago • 51
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 12 hours ago • 219
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 24 days ago • 448
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Feb 26 • 118