Fantastic Pretraining Optimizers and Where to Find Them Paper • 2509.02046 • Published 4 days ago • 10
AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published 9 days ago • 37
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ By Wauplin and 2 others • Jul 25 • 80
view article Article NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset By nvidia and 4 others • 17 days ago • 15
view article Article MCP for Research: How to Connect AI to Research Tools By dylanebert • 20 days ago • 44
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 23 days ago • 56
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes Paper • 2507.11407 • Published Jul 15 • 57
view article Article How to train a Language Model with Megatron-LM By loubnabnl • Sep 7, 2022 • 18
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 26 days ago • 72
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • Aug 5 • 489
view article Article retrain-pipelines and the almighty function-caller By Aurelien-Morgan • Apr 28 • 8
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • Jul 31 • 63
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • Jul 29 • 169
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • Jul 18 • 47
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 26 days ago • 227