zhu
xuekai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
29 days ago
Reasoning with Exploration: An Entropy Perspective
upvoted
a
paper
about 1 month ago
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
upvoted
a
paper
about 2 months ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models