20 8 5

Hanbin Wang

hanbin

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

liked a model about 1 month ago

openbmb/MiniCPM-o-4_5

upvoted a paper 5 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

upvoted a paper 6 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

View all activity

Organizations

liked a model about 1 month ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 4 days ago • 75.4k • 905

upvoted a paper 5 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 22

upvoted 2 papers 6 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 125

published a dataset 7 months ago

hanbin/dolmino-mix-1124-pes2o-hf-3

Updated Aug 25, 2025 • 6

updated a dataset 7 months ago

hanbin/dolmino-mix-1124-pes2o-hf

Viewer • Updated Aug 25, 2025 • 1.09M • 30

published a dataset 7 months ago

hanbin/dolmino-mix-1124-pes2o-hf

Viewer • Updated Aug 25, 2025 • 1.09M • 30

updated 2 models 7 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

published 2 models 7 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B_oasst1_wildchat

Text Generation • 8B • Updated Jul 29, 2025 • 1

updated 2 models 8 months ago

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B

Text Generation • 8B • Updated Jul 28, 2025 • 2

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B

Text Generation • 8B • Updated Jul 28, 2025 • 1

published 2 models 8 months ago

hanbin/Llama-3.1-8B-pretrain-1-pes2o-anneal-1B

Text Generation • 8B • Updated Jul 28, 2025 • 1

hanbin/Llama-3.1-8B-pes2o-anneal-2.7B

Text Generation • 8B • Updated Jul 28, 2025 • 2

updated a model 8 months ago

hanbin/Qwen2.5-7B-pattern-mixed-6epoch

Text Generation • 8B • Updated Jul 23, 2025 • 1

published a model 8 months ago

hanbin/Qwen2.5-7B-pattern-mixed-6epoch

Text Generation • 8B • Updated Jul 23, 2025 • 1

updated a model 8 months ago

hanbin/Llama-3.1-8B-pretrain-1

Text Generation • 8B • Updated Jul 14, 2025 • 1

published a model 8 months ago

hanbin/Llama-3.1-8B-pretrain-1

Text Generation • 8B • Updated Jul 14, 2025 • 1

updated a model 12 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • 8B • Updated Mar 14, 2025 • 2 • 2

Hanbin Wang

AI & ML interests

Recent Activity

Organizations

hanbin's activity