T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published 2 days ago • 26
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper • 2504.20734 • Published 5 days ago • 55
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 5 days ago • 76
Running on Zero 59 59 Stable Video Diffusion Img2Vid ✨ Animate Your Pictures With Stable VIdeo DIffusion
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 8 days ago • 40
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 10 days ago • 28
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models Paper • 2504.16074 • Published 11 days ago • 35