Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
colinpannikkat
/
OpenRS-RLoRA-LoftQ-R32-Cosine-Len
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
OpenRS-RLoRA-LoftQ-R32-Cosine-Len
Commit History
End of training
2917d40
verified
colinpannikkat
commited on
Jun 5
Model save
ea356b9
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 500
cacdf9b
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 450
774fed6
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 400
499593c
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 350
c537f90
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 300
5ad2d72
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 250
f03e874
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 200
97d2d6d
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 150
145d606
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 100
ba7dcb8
verified
colinpannikkat
commited on
Jun 5
Training in progress, step 50
bc5ee43
verified
colinpannikkat
commited on
Jun 5
initial commit
54e4846
verified
colinpannikkat
commited on
Jun 5