MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning Paper • 2505.24846 • Published 8 days ago • 15
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind Paper • 2505.22961 • Published 10 days ago • 8
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper • 2505.20286 • Published 12 days ago • 6
AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting Paper • 2505.18822 • Published 14 days ago • 14
Time-R1: Towards Comprehensive Temporal Reasoning in LLMs Paper • 2505.13508 • Published 23 days ago • 14
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Paper • 2505.02391 • Published May 5 • 24
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction Paper • 2410.19743 • Published Oct 10, 2024 • 1
Self-DC: When to retrieve and When to generate? Self Divide-and-Conquer for Compositional Unknown Questions Paper • 2402.13514 • Published Feb 21, 2024 • 1