RL Agents - a alexngai Collection

alexngai 's Collections

Latent Reasoning

Autonomous Research

Automated Research

Test-Time Compute/Optimal Scaling

Self-Improving Agents

Codegen Benchmarks

RL Agents

updated Apr 5

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 115
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 276