Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 18 days ago • 157
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Paper • 2505.07916 • Published 12 days ago • 116
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 4 days ago • 113
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 16 days ago • 74
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published 17 days ago • 63
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published 4 days ago • 51
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space Paper • 2505.15778 • Published 3 days ago • 10
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification Paper • 2505.16938 • Published 2 days ago • 92
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published 3 days ago • 12