view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 505
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 892
view article Article Liberate your OpenClaw +6 clem, burtenshaw, pcuenq, jeffboudier, merve, nielsr, victor, mishig • Mar 27 • 45
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 YiYiXu, OzzyGT, dn6, sayakpaul • Mar 5 • 51
view article Article Introducing Storage Buckets on the Hugging Face Hub +10 Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner • Mar 10 • 194
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 327
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 159
view article Article Compute and Competition in AI: Different FlOPs for Different Folks sasha • Feb 12 • 15
view article Article Custom Kernels for All from Codex and Claude +2 burtenshaw, sayakpaul, ariG23498, evalstate • Feb 13 • 75
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 379
view article Article Easily Build and Share ROCm Kernels with Hugging Face +2 badaoui, daniehua, ColorsWind, ftyghome • Nov 17, 2025 • 38
view article Article Running Large Transformer Models on Mobile and Edge Devices tugrulkaya • Nov 3, 2025 • 13
view article Article Get your VLM running in 3 simple steps on Intel CPUs +3 ezelanza, helenai, nikita-savelyev-intel, echarlaix, IlyasMoutawwakil • Oct 15, 2025 • 22
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Jiqing, MatrixYao, kding1, IlyasMoutawwakil • Oct 16, 2025 • 18
view article Article Hugging Face and VirusTotal collaborate to strengthen AI security XciD, bquintero • Oct 22, 2025 • 55
view article Article The GPT-OSS models are here… and they’re energy-efficient! sasha • Aug 7, 2025 • 20