morizon/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 Text Generation • 14B • Updated Feb 19 • 56
daichira/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft_math-tanuki_adapter_0.9 Updated Feb 24
daichira/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft_math-tanuki_adapter Updated Feb 24
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja2000 Text Generation • 14B • Updated Feb 26 • 34
u-10bei/llm-jp-3-13b-instruct2-grpo-0222_lora_step2000_ja5000 Text Generation • 14B • Updated Feb 26 • 7