Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques Paper • 2506.08060 • Published 6 days ago • 6
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model Paper • 2505.14135 • Published 26 days ago • 15
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack Paper • 2406.10149 • Published Jun 14, 2024 • 53
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published 17 days ago • 23
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 13 days ago • 151
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published 16 days ago • 91
view article Article System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience By codelion • 13 days ago • 12
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 23 days ago • 77
view article Article AutoThink: Adaptive Reasoning for Large Language Models By codelion • 19 days ago • 4
view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • 26 days ago • 22
view article Article Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training By codelion • 29 days ago • 5
Pivotal Token Search Collection Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 9 items • Updated May 14 • 3
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7 • 64