Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

gabrielmbmb
/
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo

PEFT
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
gabrielmbmb's picture
gabrielmbmb
End of training
ab610d7 verified 7 months ago
  • Nov05_10-54-45_ip-26-0-163-236
    Training in progress, step 100 7 months ago
  • Nov05_11-27-51_ip-26-0-167-175
    End of training 7 months ago