2 32 5

Yuxin Zuo

yuxinzuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

upvoted a paper 8 days ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

upvoted a paper 18 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published 8 days ago • 18

upvoted a paper 8 days ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published 9 days ago • 107

upvoted a paper 18 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published 18 days ago • 78

liked a model 21 days ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated 12 days ago • 33k • 178

liked a dataset 27 days ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 3.7k • 156

liked a dataset about 2 months ago

ChuGyouk/MedXpertQA

Viewer • Updated Jun 15 • 4.45k • 33 • 5

upvoted a collection 3 months ago

SimpleVLA-RL

Collection

6 items • Updated Jun 15 • 2

authored a paper 3 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 128

upvoted 3 papers 3 months ago

liked a model 3 months ago

Intelligent-Internet/II-Medical-8B

Text Generation • 8B • Updated 4 days ago • 47k • • 168

upvoted 2 papers 3 months ago

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Paper • 2505.03981 • Published May 6 • 15

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 182

upvoted a paper 4 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 97

commented a paper 4 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120 •

authored 2 papers 4 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

upvoted 2 papers 4 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published Apr 2 • 37

Yuxin Zuo

AI & ML interests

Recent Activity

Organizations

yuxinzuo's activity