deepRp-llama31-8b / checkpoint-36
taozi555's picture
Training in progress, step 36, checkpoint
bd3394d verified