Unicorn: Text-Only Data Synthesis for Vision Language Model Training Paper β’ 2503.22655 β’ Published 6 days ago β’ 31
Efficient Inference for Large Reasoning Models: A Survey Paper β’ 2503.23077 β’ Published 5 days ago β’ 37
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper β’ 2503.24379 β’ Published 3 days ago β’ 52
Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL Paper β’ 2503.23157 β’ Published 5 days ago β’ 4
Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models Paper β’ 2503.22165 β’ Published 6 days ago β’ 17
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors Paper β’ 2504.01016 β’ Published 1 day ago β’ 18
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper β’ 2502.05171 β’ Published Feb 7 β’ 132
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 22 days ago β’ 363