SRT-H: A Hierarchical Framework for Autonomous Surgery via Language Conditioned Imitation Learning Paper • 2505.10251 • Published May 15 • 3 • 3
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation Paper • 2507.02608 • Published 8 days ago • 20 • 3
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution Paper • 2506.19838 • Published 16 days ago • 11 • 1
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition Paper • 2506.17201 • Published 20 days ago • 52 • 3
Show-o2: Improved Native Unified Multimodal Models Paper • 2506.15564 • Published 23 days ago • 29 • 3
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs Paper • 2506.14731 • Published 24 days ago • 9 • 2
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published about 1 month ago • 48 • 2
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control Paper • 2505.22642 • Published May 28 • 3 • 2
DiSA: Diffusion Step Annealing in Autoregressive Image Generation Paper • 2505.20297 • Published May 26 • 2 • 1
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model Paper • 2505.14135 • Published May 20 • 15 • 2
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Paper • 2505.11480 • Published May 16 • 8 • 2
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Paper • 2502.12894 • Published Feb 18 • 16 • 3
Aya Vision: Advancing the Frontier of Multilingual Multimodality Paper • 2505.08751 • Published May 13 • 12 • 2
AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Paper • 2505.08311 • Published May 13 • 18 • 2
Generating Physically Stable and Buildable LEGO Designs from Text Paper • 2505.05469 • Published May 8 • 28 • 2