Xiangyu Hong
lilhong
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
1 day ago
SSRL: Self-Search Reinforcement Learning
upvoted
a
paper
29 days ago
On the token distance modeling ability of higher RoPE attention
dimension
upvoted
a
paper
3 months ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
Organizations
None yet