AI & ML interests

Open Source AI 💚

Recent Activity

unsloth 's collections 24

Unsloth Dynamic 2.0 Quants
New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
Qwen3
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
Llama 4
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
Qwen3-Coder
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
Unsloth 4-bit Dynamic Quants
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
Text-to-Speech (TTS) models
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
Mistral Small 3 (All Versions)
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
Qwen QwQ-32B Collection
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Llama 3.2 Vision
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
Llama 3.1 Collection
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
Load 4bit models 4x faster
Native bitsandbytes 4bit pre quantized models
DeepSeek R1 (All Versions)
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
Gemma 3n
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
Phi-4 (All Versions)
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Gemma 3
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
Deepseek V3 (All Versions)
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
Llama 3.2
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
Qwen2.5-VL (All Versions)
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
Vision/multimodal Models
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
Qwen 2.5 Coder
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
Unsloth Dynamic 2.0 Quants
New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance.
DeepSeek R1 (All Versions)
DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.
Qwen3
Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants.
Gemma 3n
Google Gemma 3n models, all versions including Dynamic GGUF, 4-bit, 16-bit and formats!
Llama 4
Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth!
Phi-4 (All Versions)
Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes
Qwen3-Coder
The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B.
Gemma 3
All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats.
Unsloth 4-bit Dynamic Quants
Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit
Deepseek V3 (All Versions)
Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions.
Text-to-Speech (TTS) models
A collection of 4-bit, Dynamic 4-bit and 16-bit voice models including Sesame-CSM, OpenAI's Whisper, Orpheus. Fine-tune them with Unsloth now!
Mistral Small 3 (All Versions)
A collection of Mistral's new Small 3.2 and 3 models including GGUF, 4-bit and more!
Llama 3.2
Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions.
Qwen2.5-VL (All Versions)
All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more!
Qwen QwQ-32B Collection
Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions.
Vision/multimodal Models
Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
Llama 3.2 Vision
Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions.
Qwen 2.5 Coder
Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats.
Llama 3.1 Collection
Meta's Llama 3.1 models including 8B, 70B, 405B. Includes 4-bit bnb and original versions.
Load 4bit models 4x faster
Native bitsandbytes 4bit pre quantized models