view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ 1 day ago β’ 42
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ 13 days ago β’ 366
view article Article Introducing Pull Requests and Discussions π₯³ By victor β’ May 25, 2022 β’ 13
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others β’ Mar 24 β’ 18
view article Article Rearchitecting Hugging Face Uploads and Downloads By jsulz and 2 others β’ Nov 26, 2024 β’ 46
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 421
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 479
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other β’ Jan 23 β’ 68
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other β’ Jan 16 β’ 74
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others β’ Oct 8, 2024 β’ 46
view article Article Improving Parquet Dedupe on Hugging Face Hub By yuchenglow and 1 other β’ Oct 5, 2024 β’ 33
view article Article Google releases Gemma 2 2B, ShieldGemma and Gemma Scope By Xenova and 3 others β’ Jul 31, 2024 β’ 59
view article Article Welcome Gemma 2 - Google's new open LLM By philschmid and 5 others β’ Jun 27, 2024 β’ 129
view article Article From cloud to developers: Hugging Face and Microsoft Deepen Collaboration By jeffboudier and 1 other β’ May 21, 2024 β’ 9