Exploring the Evolution of Physics Cognition in Video Generation: A Survey Paper • 2503.21765 • Published 24 days ago • 11 • 2
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction Paper • 2412.06782 • Published Dec 9, 2024 • 7 • 2
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration Paper • 2411.17686 • Published Nov 26, 2024 • 21 • 2
PiTe: Pixel-Temporal Alignment for Large Video-Language Model Paper • 2409.07239 • Published Sep 11, 2024 • 14 • 2