Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
4
17
Liang Ding
alphadl
Follow
21world's profile picture
AdinaY's profile picture
shaopengfu's profile picture
3 followers
·
17 following
https://liamding.cc
liangdingNLP
alphadl
AI & ML interests
Large Language Model, Machine Learning, Machine Translation
Recent Activity
liked
a model
about 11 hours ago
alphadl/ppo-gsm8k-0.5b
new
activity
about 15 hours ago
alphadl/ppo-gsm8k-0.5b:
Question about reference performance
updated
a model
9 days ago
alphadl/ppo-gsm8k-0.5b
View all activity
Organizations
Papers
58
arxiv:
2504.09130
arxiv:
2412.15303
arxiv:
2409.05923
arxiv:
2408.15556
Expand 58 papers
models
4
Sort: Recently updated
alphadl/ppo-gsm8k-0.5b
Text Generation
•
0.6B
•
Updated
9 days ago
•
9
•
2
alphadl/R1-Distill-1.5B-Qwen-GRPO
Text Generation
•
2B
•
Updated
Jun 28
•
5
alphadl/R1-Distill-0.6B-Qwen-GRPO
Text Generation
•
0.6B
•
Updated
Jun 13
•
6
alphadl/R1-Distill-0.6B-Qwen
Text Generation
•
0.6B
•
Updated
Jun 6
•
17
datasets
0
None public yet