MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper โข 2511.11793 โข Published Nov 14, 2025 โข 170
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper โข 2511.08892 โข Published Nov 12, 2025 โข 203
Running 3.63k The Ultra-Scale Playbook ๐ 3.63k The ultimate guide to training LLM on large GPU Clusters
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper โข 2509.02479 โข Published Sep 2, 2025 โข 83
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper โข 2507.14683 โข Published Jul 19, 2025 โข 134
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper โข 2507.01352 โข Published Jul 2, 2025 โข 56