weizhepei/Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-2-no-sys-new Text Generation • Updated 6 days ago • 6
weizhepei/Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-3-no-sys-new Text Generation • Updated 6 days ago • 7
CohenQu/infoseek_v4_Med_MCQA_USMLE_Qwen2.5-3B-Instruct_MedMCQA.11.00 Text Generation • Updated 5 days ago • 4
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_9k_v2 Text Generation • Updated 5 days ago • 8
wcs2024/Qwen2.5-Math-7B-Instruct-pruned-keep-0.00-0.25_0.75-1.00 Text Generation • Updated 3 days ago • 2
flyingbugs/Qwen2.5-Math-7B-math220k-pruned-correctness-ratio Text Generation • Updated about 11 hours ago
chenggong1995/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2 Text Generation • Updated 4 days ago
flyingbugs/Qwen2.5-Math-7B-Instruct-GeneralThought-tail-only Text Generation • Updated 4 days ago • 2
chenggong1995/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-ghpo-epoch3 Text Generation • Updated 2 days ago • 133
chenggong1995/openr1-Qwen-2.5-Base-7B-gen8-math3to5_olympiads_aime-grpo-epoch3 Text Generation • Updated 3 days ago • 2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW Text Generation • Updated 4 days ago • 507
weizhepei/Qwen2.5-3B-WebArena-Lite-SFT-CoT-QwQ-32B-epoch-1-no-sys-new Text Generation • Updated 4 days ago • 6