Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

updated a model 5 days ago
PRIME-RL/Eurus-2-7B-PRIME
updated a model 5 days ago
PRIME-RL/Eurus-2-7B-SFT
updated a dataset 6 days ago
PRIME-RL/Eurus-2-RL-Data
View all activity

Articles

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

upvoted an article 9 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
15