Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

gabrielmbmb
/
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo

PEFT
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo / runs /Nov05_10-54-45_ip-26-0-163-236
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
gabrielmbmb's picture
gabrielmbmb
Training in progress, step 100
2c8f376 verified 7 months ago
  • events.out.tfevents.1730804610.ip-26-0-163-236.1850469.0
    6.63 kB
    xet
    Training in progress, step 100 7 months ago