danushkhanna
/

DeepSeek-R1-Distill-Llama-8B-anubis_dpo

Generated from Trainer

Model card Files Files and versions Community

DeepSeek-R1-Distill-Llama-8B-anubis_dpo / training_loss.png

danushkhanna's picture

Upload folder using huggingface_hub

f420eab verified about 1 month ago

history contribute delete

27.9 kB