view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 1 day ago • 54
EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity Paper • 2507.21848 • Published 4 days ago • 6
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published 9 days ago • 13
ULD Loss (Universal LLMs Distillation) Collection The ULD loss, based on optimal transport, enables distillation across different LLM families without requiring shared tokenizers. • 14 items • Updated 18 days ago • 2
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published Nov 15, 2024 • 85
ThinkPRM Collection Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated 3 days ago • 3
view article Article Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure By jcudit • 25 days ago • 9
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 24 days ago • 623
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 25 days ago • 602
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • about 1 month ago • 70
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 116
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • Jun 26 • 113
view article Article xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy By BobWue • Jun 4 • 12
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 79
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • May 23 • 152
view article Article Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face By Nutanix and 1 other • May 19 • 18
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • May 21 • 31