Llama-3.1-8B-sft-hhrlhf-dpo / last-checkpoint /model.safetensors.index.json

Commit History

Training in progress, epoch 1, checkpoint
3d531cc
verified

AmberYifan commited on