Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
upvoted a paper about 2 hours ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories updated a model 14 days ago
meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval published a model 23 days ago
meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval