Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

mdagost
/
SmolLM2-FT-DPO

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
smol-course
module_1
trl
dpo
conversational
text-generation-inference
Model card Files Files and versions Metrics Training metrics Community
SmolLM2-FT-DPO / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
mdagost's picture
mdagost
End of training
19e1659 verified 6 months ago
  • Dec11_17-32-49_4a5d3c8f7315
    End of training 6 months ago
  • Dec11_23-08-07_ff43bde1d962
    End of training 6 months ago
  • Dec11_23-20-53_ff43bde1d962
    End of training 6 months ago