Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 89
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 436
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 613
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131 • Apr 16, 2025 • 42
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference mfuntowicz, hlarcher • Jan 16, 2025 • 76
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding +9 ofirzaf, echarlaix, imargulis, danielkorat, jmamou, guybd, orenpereg, moshew, Haihao, aayasin, FanZhao • Jan 30, 2024 • 9
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18
view article Article CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG +4 peterizsak, mber, danf, echarlaix, mfuntowicz, moshew • Mar 15, 2024 • 14