chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-grpo Text Generation • 8B • Updated Apr 7 • 5
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-ghpo Text Generation • 8B • Updated Apr 8 • 6
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine Text Generation • 8B • Updated Apr 8 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch2 Text Generation • 8B • Updated Apr 9 • 9
chenggong1995/Qwen-2.5-Base-7B-gen8-scale-MATH-lighteval-olympiads_aime-unique-grpo-beta0-epoch2 Text Generation • 8B • Updated Apr 9 • 6
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-v4 Text Generation • 8B • Updated Apr 9 • 7
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-only Text Generation • 8B • Updated Apr 10 • 7
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW-noformat Text Generation • 8B • Updated Apr 14 • 6