Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 10 days ago • 23
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 25 days ago • 104