Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
9
5
3
Gaotang Li
gaotang
Follow
John6666's profile picture
JohnRoger's profile picture
2 followers
·
0 following
https://gaotangli.github.io/
GaotangLi
AI & ML interests
None yet
Recent Activity
liked
a dataset
7 days ago
xinlai/Math-Step-DPO-10K
upvoted
a
paper
7 days ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
updated
a model
7 days ago
gaotang/RM-R1-Qwen2.5-Instruct-14B
View all activity
Organizations
None yet
gaotang
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
gaotang/RM-R1-DeepSeek-Distilled-Qwen-32B
7 days ago
Add pipeline tag and library name
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-Qwen2.5-Instruct-7B
7 days ago
Improve model card with pipeline tag and library name
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-DeepSeek-Distilled-Qwen-14B
7 days ago
Add pipeline tag, library name, and sample usage
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-Qwen2.5-Instruct-14B
7 days ago
Add pipeline tag and library name, add usage example and missing sections
1
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-Entire-RLVR-Train
7 days ago
Add task category
#2 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-after-Distill-RLVR
7 days ago
Add task category
1
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-Distill-SFT
7 days ago
Update dataset card with task category
1
#1 opened 8 days ago by
nielsr
New activity in
gaotang/RM-R1-Qwen2.5-Instruct-32B
7 days ago
Add library_name and pipeline_tag metadata
1
#1 opened 9 days ago by
nielsr
commented
a paper
9 days ago
RM-R1: Reward Modeling as Reasoning
Paper
•
2505.02387
•
Published
10 days ago
•
66
•
1