Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data Paper • 2507.08761 • Published Jul 11, 2025 • 1
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 122
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection Paper • 2505.15182 • Published May 21, 2025 • 6