66 25 33

Shenzhi Wang

shenzhi-wang

https://shenzhi-wang.netlify.app/

ShenzhiWang_THU

AI & ML interests

Large Language Model, Reinforcement Learning, and AI Agents

Recent Activity

upvoted a paper 8 days ago

Group Sequence Policy Optimization

upvoted a paper about 1 month ago

Reinforcement Pre-Training

authored a paper about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

View all activity

Organizations

upvoted a paper 8 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 9 days ago • 257

upvoted a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 252

authored a paper about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

upvoted a paper about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

commented a paper about 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176 •

upvoted a paper 2 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 26

upvoted 2 papers 3 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 269

authored a paper 3 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 181

upvoted 2 papers 3 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 181

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 133

authored 2 papers 4 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

upvoted a paper 4 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

upvoted 2 papers 5 months ago

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Paper • 2503.10437 • Published Mar 13 • 33

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

Paper • 2502.18364 • Published Feb 25 • 37

updated a model 6 months ago

xwen-team/Xwen-0.5B-Chat

0.6B • Updated Feb 12 • 4

published a model 6 months ago

xwen-team/Xwen-0.5B-Chat

0.6B • Updated Feb 12 • 4

upvoted a collection 6 months ago

Xwen-Chat

Collection

6 items • Updated Feb 3 • 12

updated a model 6 months ago

xwen-team/Xwen-7B-Chat

Text Generation • 8B • Updated Feb 4 • 15 • 32

Shenzhi Wang

AI & ML interests

Recent Activity

Organizations

shenzhi-wang's activity