2 11 25

Yiping Wang

ypwang61

https://ypwang61.github.io/

AI & ML interests

machine learning

Recent Activity

upvoted an article 3 days ago

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

upvoted a collection 29 days ago

Spurious Rewards

updated a collection 30 days ago

One-Shot RLVR

View all activity

Organizations

None yet

upvoted an article 3 days ago

Article

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

and 17 others •

3 days ago

• 33

upvoted a collection 29 days ago

Spurious Rewards

Collection

Spurious Rewards: Rethinking Training Signals in RLVR • 14 items • Updated 29 days ago • 2

updated a collection 30 days ago

One-Shot RLVR

Collection

Collections of models and papers for works: "Reinforcement Learning for Reasoning in Large Language Models with One Training Example" • 14 items • Updated 30 days ago • 1

updated a model about 1 month ago

ypwang61/intermediate-qwen25-7b-step300

8B • Updated Jun 12 • 5

published a model about 1 month ago

ypwang61/intermediate-qwen25-7b-step300

8B • Updated Jun 12 • 5

updated a collection about 1 month ago

One-Shot RLVR

Collection

Collections of models and papers for works: "Reinforcement Learning for Reasoning in Large Language Models with One Training Example" • 14 items • Updated 30 days ago • 1

upvoted a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 243

updated a model about 1 month ago

ypwang61/One-Shot-RLVR-Qwen2.5-7B-1.2k-dsr-sub

8B • Updated Jun 8 • 21

published a model about 1 month ago

ypwang61/One-Shot-RLVR-Qwen2.5-7B-1.2k-dsr-sub

8B • Updated Jun 8 • 21

updated a model about 1 month ago

ypwang61/One-Shot-RLVR-Qwen2.5-7B-pi1

8B • Updated Jun 8 • 63

published a model about 1 month ago

ypwang61/One-Shot-RLVR-Qwen2.5-7B-pi1

8B • Updated Jun 8 • 63

updated 2 models about 1 month ago

ypwang61/sharp_s180

8B • Updated Jun 3 • 2

ypwang61/sharp_s1560

2B • Updated Jun 3 • 1

published 2 models about 1 month ago

ypwang61/sharp_s180

8B • Updated Jun 3 • 2

ypwang61/sharp_s1560

2B • Updated Jun 3 • 1

updated a model about 1 month ago

ypwang61/One-Shot-RLVR-R1-Distill-1.5B-4-shot

2B • Updated Jun 3 • 21

Yiping Wang

AI & ML interests

Recent Activity

Organizations

ypwang61's activity

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models