ChengpengLi's picture

3 10 2

ChengpengLi

ChengpengLi

·

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 2 days ago

Agentic Reinforced Policy Optimization

commented on a paper about 2 months ago

CoRT: Code-integrated Reasoning within Thinking

upvoted a paper 2 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 7 days ago • 114

upvoted a paper 2 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

upvoted a paper 5 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted 3 papers 7 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 100

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

upvoted 2 collections 11 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated 12 days ago • 83

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated 12 days ago • 52

upvoted 2 papers about 1 year ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21