Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

gabrielmbmb
/
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo

PEFT
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo / runs /Nov05_11-27-51_ip-26-0-167-175
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
gabrielmbmb's picture
gabrielmbmb
End of training
ab610d7 verified 7 months ago
  • events.out.tfevents.1730806544.ip-26-0-167-175.3215185.0
    42.7 kB
    xet
    Model save 7 months ago
  • events.out.tfevents.1730808720.ip-26-0-167-175.3215185.1
    828 Bytes
    xet
    End of training 7 months ago