Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
colinpannikkat
/
OpenRS-RLoRA-LoftQ-R32-4
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
OpenRS-RLoRA-LoftQ-R32-4
Commit History
End of training
9c80b66
verified
colinpannikkat
commited on
Jun 10
Model save
529bd45
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 500
b6ea48c
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 450
0f3b8f2
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 400
a4e7458
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 350
39d338f
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 300
227bdb0
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 250
c7dedb3
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 200
7e138bc
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 150
f96dfee
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 100
80d3992
verified
colinpannikkat
commited on
Jun 10
Training in progress, step 50
2fd3316
verified
colinpannikkat
commited on
Jun 10
initial commit
9196f12
verified
colinpannikkat
commited on
Jun 10