view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 117
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware By RakshitAralimatti • 17 days ago • 17
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 8 days ago • 40
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 18 days ago • 69
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 657
view article Article ScreenEnv: Deploy your full stack Desktop Agent By A-Mahla and 1 other • Jul 10 • 64