view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • May 28, 2024 • 21
Step 1: Reproducing DeepSeek's Distilled Models Collection Code for training and evaluation: https://github.com/huggingface/open-r1 • 3 items • Updated 3 days ago • 1
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • 8 days ago • 25
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 10 days ago • 24
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • 13 days ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • 15 days ago • 104
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper • 2505.07291 • Published 17 days ago • 11
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning Paper • 2504.11354 • Published Apr 15 • 5
view article Article Page-to-Video: Generate videos from webpages 🪄🎬 By burtenshaw • 23 days ago • 27
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other • 30 days ago • 127
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate By muellerzr and 3 others • Jun 13, 2024 • 54
view article Article Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other • Apr 10 • 15
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl By axolotl-ai-co and 1 other • Apr 4 • 8
view article Article Training Large Language Models with Interpreter Feedback using WebAssembly By axolotl-ai-co and 1 other • Apr 3 • 13