Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

David-Xu
/
cira-7b-dpo-lora-merge

PEFT
TensorBoard
Safetensors
llama
alignment-handbook
Generated from Trainer
4-bit precision
bitsandbytes
Model card Files Files and versions Metrics Training metrics Community
cira-7b-dpo-lora-merge / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 16 commits
David-Xu's picture
David-Xu
Training in progress, step 1700
156a47b verified over 1 year ago
  • Mar11_07-47-23_b89f062cf3e1
    Training in progress, step 900 over 1 year ago
  • Mar11_09-32-25_b89f062cf3e1
    Training in progress, step 900 over 1 year ago
  • Mar11_09-57-38_b89f062cf3e1
    Training in progress, step 1200 over 1 year ago
  • Mar11_10-38-35_1b08ddff8132
    Training in progress, step 1700 over 1 year ago