Llama-3.1-8B-sft-hhrlhf-dpo / last-checkpoint
AmberYifan's picture
Training in progress, epoch 2, checkpoint
c63ce2e verified