UniSD: Towards a Unified Self-Distillation Framework for Large Language Models Paper • 2605.06597 • Published 11 days ago • 15
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 11 days ago • 108
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 12 days ago • 97
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628
stefanocarrera/autophagycode_D_train_Qwen3-0.6B_lr0.0001_c142_sem_g4 Viewer • Updated Apr 4 • 103 • 13