Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
KMasaki
/
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Commit History
End of training
a0a30e7
verified
KMasaki
commited on
11 days ago
Model save
093d421
verified
KMasaki
commited on
11 days ago
Training in progress, step 3347
1182be3
verified
KMasaki
commited on
11 days ago
End of training
8578253
verified
KMasaki
commited on
11 days ago
Model save
b33b9f2
verified
KMasaki
commited on
11 days ago
Training in progress, step 3347
b4e46c3
verified
KMasaki
commited on
11 days ago
Training in progress, step 3200
87298c4
verified
KMasaki
commited on
11 days ago
Training in progress, step 3200
805daa7
verified
KMasaki
commited on
11 days ago
End of training
1b3a6df
verified
KMasaki
commited on
12 days ago
Model save
526ad82
verified
KMasaki
commited on
12 days ago
Training in progress, step 3347
cbcb7cc
verified
KMasaki
commited on
12 days ago
Training in progress, step 3200
6b75204
verified
KMasaki
commited on
12 days ago
Training in progress, step 2800
2320156
verified
KMasaki
commited on
12 days ago
Training in progress, step 2800
aaf4893
verified
KMasaki
commited on
12 days ago
Training in progress, step 2800
5f811e2
verified
KMasaki
commited on
14 days ago
Training in progress, step 2400
0a480eb
verified
KMasaki
commited on
14 days ago
Training in progress, step 2400
7a14703
verified
KMasaki
commited on
14 days ago
Training in progress, step 2400
1c54f8b
verified
KMasaki
commited on
14 days ago
Training in progress, step 2400
5bd9aca
verified
KMasaki
commited on
14 days ago
Training in progress, step 2400
1626b78
verified
KMasaki
commited on
14 days ago
Training in progress, step 2000
bbea4c9
verified
KMasaki
commited on
14 days ago
Training in progress, step 2000
d0cd53a
verified
KMasaki
commited on
14 days ago
Training in progress, step 2000
dcc88cb
verified
KMasaki
commited on
14 days ago
Training in progress, step 2000
a49ea20
verified
KMasaki
commited on
14 days ago
Training in progress, step 2000
371a47e
verified
KMasaki
commited on
15 days ago
Training in progress, step 1600
87114a4
verified
KMasaki
commited on
15 days ago
Training in progress, step 1600
075e89d
verified
KMasaki
commited on
15 days ago
Training in progress, step 1600
fb873a8
verified
KMasaki
commited on
15 days ago
Training in progress, step 1600
15e8321
verified
KMasaki
commited on
15 days ago
Training in progress, step 1600
66c1909
verified
KMasaki
commited on
15 days ago
Training in progress, step 1200
766419f
verified
KMasaki
commited on
15 days ago
Training in progress, step 1200
f3e5987
verified
KMasaki
commited on
15 days ago
Training in progress, step 1200
cf9350f
verified
KMasaki
commited on
15 days ago
Training in progress, step 1200
a82fd5c
verified
KMasaki
commited on
15 days ago
Training in progress, step 1200
6bd233c
verified
KMasaki
commited on
15 days ago
Training in progress, step 800
4247202
verified
KMasaki
commited on
15 days ago
Training in progress, step 800
ee17c08
verified
KMasaki
commited on
15 days ago
Training in progress, step 800
2480319
verified
KMasaki
commited on
15 days ago
Training in progress, step 800
81595e3
verified
KMasaki
commited on
15 days ago
Training in progress, step 800
89de184
verified
KMasaki
commited on
16 days ago
Training in progress, step 400
2f06533
verified
KMasaki
commited on
16 days ago
Training in progress, step 400
cbcacef
verified
KMasaki
commited on
16 days ago
Training in progress, step 400
28f6382
verified
KMasaki
commited on
16 days ago
Training in progress, step 400
aed6d0d
verified
KMasaki
commited on
16 days ago
End of training
9f545cf
verified
KMasaki
commited on
18 days ago
Model save
694f08c
verified
KMasaki
commited on
18 days ago
Training in progress, epoch 0
b482d92
verified
KMasaki
commited on
18 days ago
End of training
249dd33
verified
KMasaki
commited on
19 days ago
Model save
c736381
verified
KMasaki
commited on
19 days ago
Training in progress, epoch 0
e447237
verified
KMasaki
commited on
19 days ago
Previous
1
2
Next