6 14 44

Dacheng Li

DachengLi

https://dachengli1.github.io

AI & ML interests

None yet

Recent Activity

liked a dataset 16 days ago

glaiveai/reasoning-v1-20m

liked a dataset 18 days ago

EssentialAI/essential-web-v1.0

upvoted a paper 21 days ago

Truncated Proximal Policy Optimization

View all activity

Organizations

liked a dataset 16 days ago

glaiveai/reasoning-v1-20m

Viewer • Updated Mar 19 • 22.2M • 1.52k • 212

liked a dataset 18 days ago

EssentialAI/essential-web-v1.0

Preview • Updated 18 days ago • 442k • 178

upvoted a paper 21 days ago

Truncated Proximal Policy Optimization

Paper • 2506.15050 • Published 22 days ago • 11

upvoted a paper 24 days ago

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published 29 days ago • 56

upvoted a paper about 1 month ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 133

liked a model about 1 month ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Jun 5 • 12.4k • • 173

liked a dataset about 1 month ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 6.42k • 524

liked a model about 1 month ago

unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated 25 days ago • 151k • 171

liked a dataset about 1 month ago

open-r1/Mixture-of-Thoughts

Viewer • Updated May 26 • 699k • 17.1k • 255

updated a dataset about 2 months ago

Efficient-Large-Model/worldmodelbench

Updated May 16 • 75

upvoted a paper 3 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 60

liked 2 models 3 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 251k • • 505

all-hands/openhands-lm-32b-v0.1

Text Generation • 33B • Updated Apr 16 • 3.4k • • 384

upvoted a paper 3 months ago

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published Mar 12 • 40

updated a dataset 4 months ago

DachengLi/d1k

Viewer • Updated Mar 26 • 1k • 29

published a dataset 4 months ago

DachengLi/d1k

Viewer • Updated Mar 26 • 1k • 29

authored a paper 4 months ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 41

upvoted a collection 4 months ago

NovaSky Papers

Collection

2 items • Updated Feb 21 • 3

commented a paper 5 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63 •

authored a paper 5 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

Dacheng Li

AI & ML interests

Recent Activity

Organizations

DachengLi's activity