Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mlxha
/
Qwen3-4B-grpo-medmcqa
like
1
Text Generation
Transformers
Safetensors
mlxha/medmcqa-grpo
qwen3
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen3-4B-grpo-medmcqa
Commit History
End of training
b8a3a76
verified
mlxha
commited on
May 12
Model save
fd1449a
verified
mlxha
commited on
May 12
Training in progress, step 1904
c8bbfdc
verified
mlxha
commited on
May 12
Training in progress, step 1900
e1f4c90
verified
mlxha
commited on
May 12
Training in progress, step 1850
da72552
verified
mlxha
commited on
May 12
Training in progress, step 1800
5f97930
verified
mlxha
commited on
May 12
Training in progress, step 1750
0f7971e
verified
mlxha
commited on
May 12
Training in progress, step 1700
de8e8d9
verified
mlxha
commited on
May 11
Training in progress, step 1650
2acd610
verified
mlxha
commited on
May 11
Training in progress, step 1600
8b91008
verified
mlxha
commited on
May 11
Training in progress, step 1550
2457cfb
verified
mlxha
commited on
May 11
Training in progress, step 1500
29b5862
verified
mlxha
commited on
May 11
Training in progress, step 1450
ab289f2
verified
mlxha
commited on
May 11
Training in progress, step 1400
ead2726
verified
mlxha
commited on
May 10
Training in progress, step 1350
ddfedaa
verified
mlxha
commited on
May 10
Training in progress, step 1300
31242ba
verified
mlxha
commited on
May 10
Training in progress, step 1250
bd2d1c5
verified
mlxha
commited on
May 10
Training in progress, step 1200
958ea98
verified
mlxha
commited on
May 10
Training in progress, step 1150
9fcf2ef
verified
mlxha
commited on
May 10
Training in progress, step 1100
60ef28d
verified
mlxha
commited on
May 10
Training in progress, step 1050
cd0dc6c
verified
mlxha
commited on
May 10
Training in progress, step 1000
b992f8c
verified
mlxha
commited on
May 9
Training in progress, step 950
f992c77
verified
mlxha
commited on
May 9
Training in progress, step 900
84f2529
verified
mlxha
commited on
May 9
Training in progress, step 850
09cf976
verified
mlxha
commited on
May 9
Training in progress, step 800
9890107
verified
mlxha
commited on
May 9
Training in progress, step 750
34ec3a7
verified
mlxha
commited on
May 9
Training in progress, step 700
c9bfa59
verified
mlxha
commited on
May 9
Training in progress, step 650
c4c1eb3
verified
mlxha
commited on
May 9
Training in progress, step 600
eaf6935
verified
mlxha
commited on
May 9
Training in progress, step 550
1127bf5
verified
mlxha
commited on
May 9
Training in progress, step 500
ee3b956
verified
mlxha
commited on
May 8
Training in progress, step 450
c50bda0
verified
mlxha
commited on
May 8
Training in progress, step 400
cc66861
verified
mlxha
commited on
May 8
Training in progress, step 350
3d0dfc0
verified
mlxha
commited on
May 8
Training in progress, step 300
9e3a888
verified
mlxha
commited on
May 8
Training in progress, step 250
fad2ef1
verified
mlxha
commited on
May 7
Training in progress, step 200
9dab78a
verified
mlxha
commited on
May 7
Training in progress, step 150
cfd7f91
verified
mlxha
commited on
May 7
Training in progress, step 100
0f3673d
verified
mlxha
commited on
May 7
Training in progress, step 50
13de164
verified
mlxha
commited on
May 7
initial commit
1071e6a
verified
mlxha
commited on
May 6