Submitted by zhen-nan 70 TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes · 8 authors 3
Submitted by akhaliq 51 Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model · 6 authors 3
Submitted by DonJoey 40 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models · 10 authors 2
Submitted by Wizardcoast 32 Unicorn: Text-Only Data Synthesis for Vision Language Model Training · 10 authors 2
Submitted by vanilla1116 24 RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy · 7 authors 2
Submitted by lianganimation 20 TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization · 8 authors 2
Submitted by tongwu2020 16 Effectively Controlling Reasoning Models through Thinking Intervention · 4 authors 3
Submitted by ZhiyuanthePony 13 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data · 6 authors 2
Submitted by JimmyMa99 9 TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection · 10 authors 2
Submitted by jianguozhang 9 ActionStudio: A Lightweight Framework for Data and Training of Large Action Models · 16 authors 2
Submitted by abcorrea 9 Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code · 3 authors 1
Submitted by rover-xingyu 6 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training · 5 authors 2
Submitted by 77Hui 5 UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation · 10 authors 2
Submitted by Lp256 4 MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs · 8 authors 2
Submitted by lastdefiance20 3 KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language · 2 authors 2
Submitted by ZhenyuLiang 3 Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization · 5 authors 3
Submitted by akhaliq 2 DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness · 4 authors 2