Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 14 days ago • 186
SEIF: Self-Evolving Reinforcement Learning for Instruction Following Paper • 2605.07465 • Published 13 days ago • 29
Chain of Evidence: Pixel-Level Visual Attribution for Iterative Retrieval-Augmented Generation Paper • 2605.01284 • Published 19 days ago • 3
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 21 days ago • 57
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 21 days ago • 217
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published Apr 15 • 62
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published Apr 9 • 7
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248