Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
Hanning Zhang
HanningZhang
Follow
RogerZhuo's profile picture
circulartext's profile picture
2 followers
·
7 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math_uniform
published
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math_uniform
updated
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math
View all activity
Organizations
HanningZhang
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math_uniform
Viewer
•
Updated
7 days ago
•
2k
•
52
published
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math_uniform
Viewer
•
Updated
7 days ago
•
2k
•
52
updated
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math
Viewer
•
Updated
7 days ago
•
2k
•
50
published
a dataset
7 days ago
HanningZhang/scalebio_distill_qwen_math
Viewer
•
Updated
7 days ago
•
2k
•
50
updated
a dataset
11 days ago
HanningZhang/scalebio_r1_distill_math
Viewer
•
Updated
11 days ago
•
1.15k
•
37
published
a dataset
11 days ago
HanningZhang/scalebio_r1_distill_math
Viewer
•
Updated
11 days ago
•
1.15k
•
37
updated
a model
12 days ago
HanningZhang/Llama3.1-RAG-Reward
Text Generation
•
Updated
12 days ago
•
14
published
a model
12 days ago
HanningZhang/Llama3.1-RAG-Reward
Text Generation
•
Updated
12 days ago
•
14
liked
a dataset
13 days ago
nvidia/AceMath-Instruct-Training-Data
Viewer
•
Updated
Jan 17
•
5.56M
•
1.36k
•
46
authored
a paper
13 days ago
Self-rewarding correction for mathematical reasoning
Paper
•
2502.19613
•
Published
14 days ago
•
76
upvoted
a
paper
13 days ago
Self-rewarding correction for mathematical reasoning
Paper
•
2502.19613
•
Published
14 days ago
•
76
updated
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step290-Vanilla
Updated
17 days ago
•
15
published
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step290-Vanilla
Updated
17 days ago
•
15
updated
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step280-Vanilla
Updated
17 days ago
•
14
published
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step280-Vanilla
Updated
17 days ago
•
14
updated
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step270-Vanilla
Updated
17 days ago
•
8
published
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step270-Vanilla
Updated
17 days ago
•
8
updated
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step260-Vanilla
Updated
17 days ago
•
12
published
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step260-Vanilla
Updated
17 days ago
•
12
updated
a model
17 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step250-Vanilla
Updated
17 days ago
•
10
Load more