view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion ⢠4 days ago ⢠11
view article Article Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training By codelion ⢠7 days ago ⢠4
Pivotal Token Search Collection Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success ⢠9 items ⢠Updated 10 days ago ⢠3
Scalable Chain of Thoughts via Elastic Reasoning Paper ⢠2505.05315 ⢠Published 16 days ago ⢠24
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper ⢠2505.04588 ⢠Published 17 days ago ⢠63
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper ⢠2505.03335 ⢠Published 18 days ago ⢠157
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper ⢠2504.21233 ⢠Published 24 days ago ⢠43
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper ⢠2504.21776 ⢠Published 24 days ago ⢠53
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper ⢠2504.20734 ⢠Published 25 days ago ⢠61
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper ⢠2504.20571 ⢠Published 25 days ago ⢠91
ReasonIR: Training Retrievers for Reasoning Tasks Paper ⢠2504.20595 ⢠Published 25 days ago ⢠52
Step1X-Edit: A Practical Framework for General Image Editing Paper ⢠2504.17761 ⢠Published 30 days ago ⢠88
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper ⢠2504.15279 ⢠Published Apr 21 ⢠74
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Paper ⢠2504.16078 ⢠Published Apr 22 ⢠20