chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1-epoch5-new44-test Updated 6 days ago
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-beta0-epoch5-new44-test Updated 6 days ago
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-3Dhint-prompt1-epoch5-new44 Text Generation • 8B • Updated May 21 • 5
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-beta0-epoch5-new44 Text Generation • 8B • Updated May 20 • 4
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-ghpo-cold0-3Dhint-prompt1-epoch1 Text Generation • 8B • Updated May 19 • 4
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5-grpo-beta0-epoch1 Text Generation • 8B • Updated May 19 • 5
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta0-epoch1-v2 Text Generation • 8B • Updated May 16 • 3
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint50-prompt1-redonum-test Text Generation • 8B • Updated May 16 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1-v2 Text Generation • 8B • Updated May 15 • 3
chenggong1995/Qwen-2.5-Base-7B-gen8-mix-grpo-CL-beta1e-3-epoch1-v2 Text Generation • 8B • Updated May 14 • 2
chenggong1995/Qwen-2.5-Base-7B-gen8-mix_hint50-grpo-CL-beta1e-3-epoch1 Text Generation • 8B • Updated May 14 • 2
chenggong1995/Qwen-2.5-Base-7B-gen8-mix-grpo-CL-beta1e-3-epoch1 Text Generation • 8B • Updated May 14 • 2
chenggong1995/Qwen-2.5-Base-7B-gen8-olympiads_aime-grpo-CL-beta1e-3-epoch1 Text Generation • 8B • Updated May 9 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-olympiads_aime-grpo-CL-epoch1 Text Generation • 8B • Updated May 8 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold0-3Dhint-prompt1-epoch8-old2 8B • Updated May 8 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5-grpo-epoch5-GPU44 Text Generation • 8B • Updated May 6 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-2Dhint-prompt1-GPU71 Text Generation • 8B • Updated May 6 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-2Dhint-prompt1-epoch4-new Text Generation • 8B • Updated May 5 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold0-hint0.5-prompt0-epoch4 Text Generation • 8B • Updated May 4 • 3
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-2Dhint-prompt1-epoch4 8B • Updated May 2 • 3
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold20-hint0.5-prompt1-epoch4 8B • Updated May 2 • 1
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5 Text Generation • 8B • Updated May 1 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-dui1-epoch5 Text Generation • 8B • Updated May 1 • 4
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold10-3Dhint-prompt1-cosine Text Generation • 8B • Updated Apr 29 • 6
chenggong1995/Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-cold10-hint0.5-prompt1-dp Text Generation • 8B • Updated Apr 29 • 5