JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes Paper • 2505.06771 • Published May 10 • 1
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers Paper • 2506.14702 • Published 29 days ago • 4
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 14 days ago • 52
view article Article Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • 15 days ago • 21
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • 15 days ago • 67
Pretrained Transformers as Universal Computation Engines Paper • 2103.05247 • Published Mar 9, 2021 • 1
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs Paper • 2506.21862 • Published 20 days ago • 35
Approximating Language Model Training Data from Weights Paper • 2506.15553 • Published 28 days ago • 1
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper • 2410.21845 • Published Oct 29, 2024 • 16
Chain-of-Thought Reasoning is a Policy Improvement Operator Paper • 2309.08589 • Published Sep 15, 2023 • 2
view article Article Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm By nvidia and 4 others • Jun 11 • 68
Layer by Layer: Uncovering Hidden Representations in Language Models Paper • 2502.02013 • Published Feb 4 • 2
Autonomous Improvement of Instruction Following Skills via Foundation Models Paper • 2407.20635 • Published Jul 30, 2024 • 1
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning Paper • 2412.09858 • Published Dec 13, 2024 • 2
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning Paper • 2401.16013 • Published Jan 29, 2024 • 26
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper • 2505.22954 • Published May 29 • 12