taicheng guo
taicheng
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
Qwen/Qwen3-0.6B
upvoted
a
paper
about 1 month ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning
upvoted
a
paper
about 1 month ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models