Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 2 days ago • 18
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published 2 days ago • 55
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Paper • 2504.17040 • Published 3 days ago • 8
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Paper • 2504.17789 • Published 2 days ago • 10
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 2 days ago • 18
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 2 days ago • 18 • 3
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published 5 days ago • 61
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 4 days ago • 49
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping Paper • 2504.08902 • Published 15 days ago • 8
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published 5 days ago • 19
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs Paper • 2504.15415 • Published 5 days ago • 20
Perception Encoder: The best visual embeddings are not at the output of the network Paper • 2504.13181 • Published 9 days ago • 31
DRAGON: Distributional Rewards Optimize Diffusion Generative Models Paper • 2504.15217 • Published 5 days ago • 10
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation Paper • 2504.14396 • Published 7 days ago • 27
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 8 days ago • 98
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published 9 days ago • 48