Qwen2.5-7B-GRPO-NM-COT-20K-2epoch / train_results.json
Haitao999's picture
Model save
05d7c11 verified
raw
history blame contribute delete
203 Bytes
{
"total_flos": 0.0,
"train_loss": 2.155859272374791e-08,
"train_runtime": 56373.7111,
"train_samples": 20000,
"train_samples_per_second": 0.355,
"train_steps_per_second": 0.002
}