Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 5 days ago
PRIME-RL/Eurus-2-7B-PRIME
updated a model 5 days ago
PRIME-RL/Eurus-2-7B-SFT
updated a dataset 6 days ago
PRIME-RL/Eurus-2-RL-Data
View all activity

Articles

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME 8 days ago

Evaluation

6
#1 opened 9 days ago by
tugstugi
upvoted an article 9 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
15