view article Article Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 4 days ago • 8
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs Paper • 2506.05629 • Published 10 days ago • 33
Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design Paper • 2506.04734 • Published 11 days ago • 19
From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval Paper • 2505.23059 • Published 18 days ago • 13
view article Article *Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings By manu and 1 other • 14 days ago • 23
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms Paper • 2505.20322 • Published 23 days ago • 14
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Paper • 2505.20355 • Published 21 days ago • 36
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published 27 days ago • 17
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 24 days ago • 77
Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published 26 days ago • 19
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning Paper • 2505.13866 • Published 27 days ago • 16
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Paper • 2505.11480 • Published about 1 month ago • 8
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning Paper • 2505.11896 • Published 30 days ago • 57
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published May 15 • 119
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published Apr 28 • 36