RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 4 days ago • 94
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 9 days ago • 310
Appear2Meaning: A Cross-Cultural Benchmark for Structured Cultural Metadata Inference from Images Paper • 2604.07338 • Published 9 days ago • 5
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 15 days ago • 473
Devy1/Qwen2.5-Coder-CONTROL-checkpoints_multi_language_2k-1.5B-Base-3 2B • Updated 15 days ago • 18 • 1
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 263