Abstract
AlphaOne dynamically modulates reasoning in large models by introducing $\alpha$ moment and Bernoulli process for slow thinking, improving efficiency and capability across diverse domains.
This paper presents AlphaOne (alpha1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. alpha1 first introduces alpha moment, which represents the scaled thinking phase with a universal parameter alpha. Within this scaled pre-alpha moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the alpha moment, alpha1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains demonstrate alpha1's superior reasoning capability and efficiency. Project page: https://alphaone-project.github.io/
Community
Paper: https://arxiv.org/abs/2505.24863
Project: https://alphaone-project.github.io/
Code (Coming soon): https://github.com/ASTRAL-Group/AlphaOne
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- TrimR: Verifier-based Training-Free Thinking Compression for Efficient Test-Time Scaling (2025)
- LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling (2025)
- Scaling over Scaling: Exploring Test-Time Scaling Pareto in Large Reasoning Models (2025)
- Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning (2025)
- ProxyThinker: Test-Time Guidance through Small Visual Reasoners (2025)
- Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting (2025)
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper