chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1-epoch5-new44 Text Generation • Updated 3 days ago
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-beta0-epoch5-new44 Text Generation • Updated 4 days ago • 1
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-ghpo-cold0-3Dhint-prompt1-epoch1 Text Generation • Updated 5 days ago • 2
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-grpo-beta0-epoch1 Text Generation • Updated 5 days ago • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta0-epoch1-v2 Text Generation • Updated 8 days ago • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint50-prompt1-redonum-test Text Generation • Updated 9 days ago • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1-v2 Text Generation • Updated 9 days ago • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-mix-grpo-CL-beta1e-3-epoch1-v2 Text Generation • Updated 10 days ago • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1 Text Generation • Updated 10 days ago • 1