RyanYr/brm-respL-dapo-r1-tpldeepseek-r1-plain-qwen2.5math-1.5B-base-lr2.5e-6-beta0.002 Updated May 12 • 3
RyanYr/brm-rsm-lr2.5e-6-beta0.002-stp480-dapo-qwen2.5math-7B-base-lr2.5e-6-beta0.001 Updated May 1 • 1