Yurun Yuan
RyanYr
AI & ML interests
None yet
Recent Activity
updated
a model
less than a minute ago
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.001
published
a model
about 4 hours ago
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.001
updated
a dataset
about 4 hours ago
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.002_matheval
Organizations
None yet
Collections
2
models
399
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.001
Updated
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.002
Updated
RyanYr/brm-dapo-llama3.2-3B-it_lr2e-6_rspL4096-mbs1024-beta0.001
Updated
•
49
RyanYr/brm-dapo-llama3.2-3B-it_lr1.5e-6_rspL4096-mbs1024-beta0.001
Updated
•
46
RyanYr/brm-dapo-llama3.2-3B-it_lr1e-6_rspL4096-mbs1024-beta0.001
Updated
•
2
RyanYr/ppo-dapo-llama3.2-3B-it-lr-bs1024-mbs256_critic
Updated
•
5
RyanYr/ppo-dapo-llama3.2-3B-it-lr-bs1024-mbs256_actor
Updated
•
87
RyanYr/brm-dapo-qwen2.5math-1.5B-base-lr2.5e-6-bfsz1024-mbs1024-beta0.002
Updated
•
4
RyanYr/ppo-DAPONumina-llama3.2-3B-it-lr-mbs64_critic
Updated
•
9
RyanYr/ppo-DAPONumina-llama3.2-3B-it-lr-mbs64_actor
Updated
•
45
datasets
906
RyanYr/brm-dapo-llama3.2-3B-it_lr2.5e-6_rspL4096-mbs1024-beta0.002_matheval
Updated
RyanYr/brm-dapo-llama3.2-3B-it_lr2e-6_rspL4096-mbs1024-beta0.001_matheval
Updated
•
53
RyanYr/brm-dapo-llama3.2-3B-it_lr1.5e-6_rspL4096-mbs1024-beta0.001_matheval
Updated
•
58
RyanYr/ppo-dapo-llama3.2-3B-it-lr-bs1024-mbs256_actor_matheval
Updated
•
243
RyanYr/brm-dapo-qwen2.5math-1.5B-base-lr2.5e-6-bfsz1024-mbs1024-beta0.002_matheval
Viewer
•
Updated
•
1.55k
•
51
RyanYr/brm-dapo-llama3.2-3B-it_lr1e-6_rspL4096-beta0.001_mixed16_matheval
Viewer
•
Updated
•
28.9k
•
140
RyanYr/ppo-dapo-llama3.2-3B-it-lr-mbs64_actor_mixed16_matheval
Viewer
•
Updated
•
28.9k
•
137
RyanYr/ppo-DAPONumina-llama3.2-3B-it-lr-mbs64_actor_matheval
Updated
•
180
RyanYr/test
Viewer
•
Updated
•
100
•
97
RyanYr/brm-dapo-qwen2.5math-7B-base-lr2e-6-beta0.002_mixed16_matheval
Updated
•
144