Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ajarts88
/
OpenRS-RLoRA-LoftQ-R16
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
OpenRS-RLoRA-LoftQ-R16
Commit History
End of training
b13d6d9
verified
ajarts88
commited on
Jun 13
Model save
d84f8bc
verified
ajarts88
commited on
Jun 13
Training in progress, step 500
4c8a9d7
verified
ajarts88
commited on
Jun 13
Training in progress, step 450
c7581b0
verified
ajarts88
commited on
Jun 13
Training in progress, step 400
90c7853
verified
ajarts88
commited on
Jun 13
Training in progress, step 350
5b2146a
verified
ajarts88
commited on
Jun 13
Training in progress, step 300
907c7b4
verified
ajarts88
commited on
Jun 13
Training in progress, step 250
7d8f292
verified
ajarts88
commited on
Jun 13
Training in progress, step 200
cd555c6
verified
ajarts88
commited on
Jun 13
Training in progress, step 150
252e161
verified
ajarts88
commited on
Jun 13
Training in progress, step 100
e8983c0
verified
ajarts88
commited on
Jun 13
Training in progress, step 50
833ad56
verified
ajarts88
commited on
Jun 13
initial commit
87cb203
verified
ajarts88
commited on
Jun 13