Cheng Qian's picture

2 23

Cheng Qian

chengq9

·

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 14 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

upvoted a paper 3 months ago

Multimodal Policy Internalization for Conversational Agents

upvoted a paper 3 months ago

Self-Improving LLM Agents at Test-Time

View all activity

Organizations

Collections 1

Papers 17

arxiv:2509.19736

arxiv:2509.09614

arxiv:2507.22034

arxiv:2507.21046

models 3

chengq9/ToolRL-Qwen2.5-1.5B

2B • Updated Apr 22, 2025 • 75

chengq9/ToolRL-Qwen2.5-3B

3B • Updated Apr 22, 2025 • 4.86k • 1

chengq9/ToolRL-Llama3.2-3B

4B • Updated Apr 22, 2025 • 6

datasets 0

None public yet