CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 7 days ago • 14
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 8 days ago • 16
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 6 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 229
CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions By paulml • 7 days ago • 14
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 8 days ago • 16
Gaia2 Leaderboard Update: New Models and New Observations By meta-agents-research-environments and 3 others • 6 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 229