Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
chloeli
/
qwen-2.5-0.5B-instruct-sft-lora-countdown-o3-5k
like
0
Text Generation
Transformers
Safetensors
MelinaLaimon/stream-of-search
qwen2
Generated from Trainer
alignment-handbook
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
qwen-2.5-0.5B-instruct-sft-lora-countdown-o3-5k
Commit History
End of training
6d682b2
verified
chloeli
commited on
12 days ago
Model save
09f9928
verified
chloeli
commited on
12 days ago
Training in progress, step 625
d2349ec
verified
chloeli
commited on
12 days ago
Training in progress, step 600
d9abfc3
verified
chloeli
commited on
12 days ago
Training in progress, step 500
76d65cb
verified
chloeli
commited on
12 days ago
Training in progress, step 400
197d26f
verified
chloeli
commited on
12 days ago
Training in progress, step 300
3b506af
verified
chloeli
commited on
12 days ago
Training in progress, step 200
a9bcbf0
verified
chloeli
commited on
12 days ago
Training in progress, step 100
c1cfee5
verified
chloeli
commited on
12 days ago
initial commit
e262a0c
verified
chloeli
commited on
12 days ago