Gemma-4-31B-IT-unsloth-mlx Collection Gemma-4-31B-IT (dense vision-language) quantized for Apple Silicon (MLX) — Unsloth Dynamic 2.0 with AWQ imatrix pre-scaling. • 9 items • Updated 16 days ago • 1
Gemma-4-26B-IT-unsloth-mlx Collection Gemma-4-26B-A4B-IT MoE quantized for Apple Silicon (MLX) — Unsloth Dynamic 2.0 with AWQ imatrix pre-scaling. • 8 items • Updated 16 days ago • 1
Qwen-3.6-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 18 items • Updated 17 days ago • 19
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated Apr 22 • 153
Qwen3.5-122B-A10B Collection MINT quantized versions of Qwen3.5-122B-A10B at multiple budget targets (MLX & GGUF) • 4 items • Updated Apr 7 • 2
APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 36 items • Updated 2 days ago • 104
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated Apr 22 • 196
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated Mar 29 • 20
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 507