neutrino12 's Collections Agent
updated
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
• 2508.03680
• Published
• 137
Training Long-Context, Multi-Turn Software Engineering Agents with
Reinforcement Learning
Paper
• 2508.03501
• Published
• 59
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from
Experience
Paper
• 2508.04700
• Published
• 52
RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong
Learning in Physical Embodied Systems
Paper
• 2508.01415
• Published
• 8
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
• 2508.06471
• Published
• 206
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
• 2508.07407
• Published
• 98
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with
Long-Term Memory
Paper
• 2508.09736
• Published
• 58
Memp: Exploring Agent Procedural Memory
Paper
• 2508.06433
• Published
• 36
OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks
Paper
• 2508.05614
• Published
• 20
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent
Foundation Models Training
Paper
• 2508.00414
• Published
• 94
Tool-integrated Reinforcement Learning for Repo Deep Search
Paper
• 2508.03012
• Published
• 20
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper
• 2507.23348
• Published
• 12
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
• 2508.21365
• Published
• 29
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex
Dynamic Environment? A Study on τ-bench
Paper
• 2508.20931
• Published
• 16
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published
• 160
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance
for Text-to-Image Generation
Paper
• 2508.18032
• Published
• 41
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
• 2508.20404
• Published
• 38
Understanding Tool-Integrated Reasoning
Paper
• 2508.19201
• Published
• 32
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper
• 2509.01055
• Published
• 79
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
Paper
• 2509.22576
• Published
• 135