Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tanliboy
/
llama-3.2-3b-dpo-2

Text Generation
Transformers
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions Metrics Training metrics Community
llama-3.2-3b-dpo-2 / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 7 commits
tanliboy's picture
tanliboy
End of training
26bbc31 verified 9 months ago
  • Oct01_07-05-14_action-graph-trainer
    End of training 9 months ago