Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RyanYr
/
ppo-respL-dapo-r1-qwen2.5math-1.5B-base-lr-mbs64_critic
like
0
Model card
Files
Files and versions
Community
main
ppo-respL-dapo-r1-qwen2.5math-1.5B-base-lr-mbs64_critic
/
optim_world_size_4_rank_2.pt
Commit History
Save model at global step 400
16b1863
verified
RyanYr
commited on
May 13
Save model at global step 360
6b43047
verified
RyanYr
commited on
May 13
Save model at global step 320
711ed28
verified
RyanYr
commited on
May 13
Save model at global step 280
4dc90be
verified
RyanYr
commited on
May 13
Save model at global step 240
75e09eb
verified
RyanYr
commited on
May 13
Save model at global step 200
7a60498
verified
RyanYr
commited on
May 12
Save model at global step 160
6944f26
verified
RyanYr
commited on
May 12
Save model at global step 120
424b06c
verified
RyanYr
commited on
May 12
Save model at global step 80
c203df7
verified
RyanYr
commited on
May 12
Save model at global step 40
6e59593
verified
RyanYr
commited on
May 12
Save model at global step 80
9bbe4da
verified
RyanYr
commited on
May 12
Save model at global step 40
04840f9
verified
RyanYr
commited on
May 12