ChengpengLi
ChengpengLi
AI & ML interests
LLM for Reasoning, reinforcement learning, recommendation system, diffusion models
Recent Activity
upvoted
a
paper
1 day ago
Agentic Reinforced Policy Optimization
commented on
a paper
about 2 months ago
CoRT: Code-integrated Reasoning within Thinking
upvoted
a
paper
2 months ago
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement
Learning
Organizations
None yet