Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ChengpengLi's picture
3 10 2

ChengpengLi

ChengpengLi
akhaliq's profile picture AndroidGuy's profile picture dongguanting's profile picture
·

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 2 days ago
Agentic Reinforced Policy Optimization
commented on a paper about 2 months ago
CoRT: Code-integrated Reasoning within Thinking
upvoted a paper 2 months ago
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
View all activity

Organizations

None yet

authored a paper 5 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114
authored 3 papers about 1 year ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs