Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mytestdpo
Activity Feed
Follow
4
AI & ML interests
None defined yet.
Recent Activity
HanningZhang
authored
a paper
2 days ago
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
1231czx
updated
a dataset
about 2 months ago
mytestdpo/qwmathbase_raw_raft_step160_olympiadbench
1231czx
published
a dataset
about 2 months ago
mytestdpo/qwmathbase_raw_raft_step160_olympiadbench
View all activity
Team members
4
models
7
Sort: Recently updated
mytestdpo/llsfttype12_gsm_auggsm_dpo_sftlossbeta05_step500
Text Generation
•
Updated
Jan 19
•
2
mytestdpo/type2_only_llama3it_gsm8k_beta05_sftloss_step500_dpo
Text Generation
•
Updated
Jan 17
•
2
mytestdpo/type12_only_beta05_sftloss_step500
Text Generation
•
Updated
Jan 17
•
2
mytestdpo/type12_7k_type3_5ktype4_beta05_sftloss_step450_dpo
Text Generation
•
Updated
Jan 17
•
2
mytestdpo/llama3_orm_tmp10_2
Text Generation
•
Updated
Jan 6
•
2
mytestdpo/llama3_orm_tmp10
Text Generation
•
Updated
Jan 6
•
2
mytestdpo/llama3_8b_dpo_iter2
Updated
Dec 26, 2024
datasets
156
Sort: Recently updated
mytestdpo/qwmathbase_raw_raft_step160_olympiadbench
Viewer
•
Updated
Mar 19
•
675
•
25
mytestdpo/qwmathbase_raw_raft_step160_minerva_math
Viewer
•
Updated
Mar 19
•
272
•
31
mytestdpo/qwmathbase_raw_raft_step160_amc23
Viewer
•
Updated
Mar 19
•
40
•
33
mytestdpo/qwmathbase_raw_raft_step160_math500
Viewer
•
Updated
Mar 19
•
500
•
35
mytestdpo/qwmathbase_raw_raft_step160_aime24
Viewer
•
Updated
Mar 19
•
30
•
35
mytestdpo/qwmathbase_raw_raft_step40_olympiadbench
Viewer
•
Updated
Mar 19
•
675
•
33
mytestdpo/qwmathbase_raw_raft_step40_minerva_math
Viewer
•
Updated
Mar 19
•
272
•
35
mytestdpo/qwmathbase_raw_raft_step40_amc23
Viewer
•
Updated
Mar 19
•
40
•
28
mytestdpo/qwmathbase_raw_raft_step40_math500
Viewer
•
Updated
Mar 19
•
500
•
34
mytestdpo/qwmathbase_raw_raft_step40_aime24
Viewer
•
Updated
Mar 19
•
30
•
34
Expand 156 datasets