Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

CharlesLi
/
OpenELM-1_1B-DPO-full-max-8-reward

Text Generation
Transformers
TensorBoard
Safetensors
openelm
trl
dpo
alignment-handbook
Generated from Trainer
conversational
custom_code
Model card Files Files and versions Metrics Training metrics Community
OpenELM-1_1B-DPO-full-max-8-reward / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
CharlesLi's picture
CharlesLi
Model save
3ee5d97 verified 10 months ago
  • Oct05_14-24-10_xe8545-a100-14
    Model save 10 months ago
  • Oct06_22-40-26_xe8545-a100-22
    Model save 10 months ago
  • Oct07_13-52-56_xe8545-a100-22
    Model save 10 months ago
  • Sep16_21-11-33_xe8545-a100-28
    Model save 11 months ago