Shenzhi Wang
shenzhi-wang
AI & ML interests
Large Language Model, Reinforcement Learning, and AI Agents
Recent Activity
upvoted
a
paper
7 days ago
Group Sequence Policy Optimization
upvoted
a
paper
about 1 month ago
Reinforcement Pre-Training
authored
a paper
about 2 months ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning