16 30 16

Yuzhen Huang

yuzhen17

https://hyz17.github.io

HYZ17

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

upvoted a paper 10 days ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

authored a paper 10 days ago

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

View all activity

Organizations

yuzhen17's activity

upvoted 2 papers 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 10 days ago • 116

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published 13 days ago • 64

authored a paper 10 days ago

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published 11 days ago • 6

upvoted 2 papers 10 days ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published 13 days ago • 101

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published 11 days ago • 6

commented a paper 10 days ago

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Paper • 2505.22203 • Published 11 days ago • 6 •

updated a dataset 11 days ago

hkust-nlp/rl-verifier-pitfalls_hacking_data

Viewer • Updated 11 days ago • 6.12k • 84 • 1

published a dataset 11 days ago

hkust-nlp/rl-verifier-pitfalls_hacking_data

Viewer • Updated 11 days ago • 6.12k • 84 • 1

updated a dataset 11 days ago

hkust-nlp/deepscaler_simplelr

Viewer • Updated 11 days ago • 40.3k • 44

published a dataset 11 days ago

hkust-nlp/deepscaler_simplelr

Viewer • Updated 11 days ago • 40.3k • 44

published a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

Reinforcement Learning • Updated 11 days ago • 7

updated a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

Reinforcement Learning • Updated 11 days ago • 7

published a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

Reinforcement Learning • Updated 11 days ago • 5

updated a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

Reinforcement Learning • Updated 11 days ago • 5

published a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-HF

Reinforcement Learning • Updated 11 days ago • 5

updated a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-HF

Reinforcement Learning • Updated 11 days ago • 5

published a model 11 days ago

hkust-nlp/R1-Distill-Verifier-1.5B

Updated 11 days ago • 11

updated a model 11 days ago

hkust-nlp/R1-Distill-Verifier-1.5B

Updated 11 days ago • 11

published a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

Reinforcement Learning • Updated 11 days ago • 8

updated a model 11 days ago

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

Reinforcement Learning • Updated 11 days ago • 8