Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated about 21 hours ago • 43
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for edge AI and on-device deployment. • 6 items • Updated 2 days ago • 55
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated about 18 hours ago • 207
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 10 days ago • 45
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 4 days ago • 484
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture Paper • 2507.05163 • Published 5 days ago • 37
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 69
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • 16 days ago • 106
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 475
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated 1 day ago • 149
view article Article Deploying TensorFlow Vision Models in Hugging Face with TF Serving By sayakpaul • Jul 25, 2022 • 2
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published 19 days ago • 56
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published 16 days ago • 60
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 22 days ago • 52