Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
longlian
/
Qwen2-0.5B-GRPO-demo
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-GRPO-demo
Commit History
Model save
fb9c3e6
verified
longlian
commited on
18 days ago
Training in progress, step 226
3e63327
verified
longlian
commited on
18 days ago
Training in progress, step 220
4320130
verified
longlian
commited on
18 days ago
Training in progress, step 210
746e71f
verified
longlian
commited on
18 days ago
Training in progress, step 200
9c78ada
verified
longlian
commited on
18 days ago
Training in progress, step 190
57e7e60
verified
longlian
commited on
18 days ago
Training in progress, step 180
a08d857
verified
longlian
commited on
18 days ago
Training in progress, step 170
b9c0653
verified
longlian
commited on
18 days ago
Training in progress, step 160
696779f
verified
longlian
commited on
18 days ago
Training in progress, step 150
4c0ca55
verified
longlian
commited on
18 days ago
Training in progress, step 140
a0ed3c8
verified
longlian
commited on
18 days ago
Training in progress, step 130
5dc9936
verified
longlian
commited on
18 days ago
Training in progress, step 120
962fcf3
verified
longlian
commited on
18 days ago
Training in progress, step 110
a37d5f8
verified
longlian
commited on
18 days ago
Training in progress, step 100
3857c6f
verified
longlian
commited on
18 days ago
Training in progress, step 90
5b1cc46
verified
longlian
commited on
18 days ago
Training in progress, step 80
05175d9
verified
longlian
commited on
18 days ago
Training in progress, step 70
685aad0
verified
longlian
commited on
18 days ago
Training in progress, step 60
30f8ebd
verified
longlian
commited on
18 days ago
Training in progress, step 50
8300ad3
verified
longlian
commited on
18 days ago
Training in progress, step 40
a50a8d3
verified
longlian
commited on
18 days ago
Training in progress, step 30
a272c8d
verified
longlian
commited on
18 days ago
Training in progress, step 20
09e30e9
verified
longlian
commited on
18 days ago
Training in progress, step 10
08f31c7
verified
longlian
commited on
18 days ago
Training in progress, step 10
9583747
verified
longlian
commited on
18 days ago
Model save
7103a30
verified
longlian
commited on
19 days ago
Training in progress, step 113
33fbcfa
verified
longlian
commited on
19 days ago
Training in progress, step 110
49db5bd
verified
longlian
commited on
19 days ago
Training in progress, step 100
f4d935c
verified
longlian
commited on
19 days ago
Training in progress, step 90
cfca36a
verified
longlian
commited on
19 days ago
Training in progress, step 80
99d6035
verified
longlian
commited on
19 days ago
Training in progress, step 70
6ed9dcb
verified
longlian
commited on
19 days ago
Training in progress, step 60
b5e8c48
verified
longlian
commited on
19 days ago
Training in progress, step 50
15e9abb
verified
longlian
commited on
19 days ago
Training in progress, step 40
c5ea79c
verified
longlian
commited on
19 days ago
Training in progress, step 30
6e3cdb0
verified
longlian
commited on
19 days ago
Training in progress, step 20
321f9b2
verified
longlian
commited on
19 days ago
Training in progress, step 10
21269d9
verified
longlian
commited on
19 days ago
initial commit
5b46519
verified
longlian
commited on
19 days ago