X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real Paper • 2505.07096 • Published 21 days ago • 3
Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model Paper • 2408.00754 • Published Aug 1, 2024 • 25
ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Paper • 2406.06133 • Published Jun 10, 2024 • 12
BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published Apr 18, 2024 • 27
BLINK: Multimodal Large Language Models Can See but Not Perceive Paper • 2404.12390 • Published Apr 18, 2024 • 27
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video Paper • 2404.09833 • Published Apr 15, 2024 • 31