2 32 5

Yuxin Zuo

yuxinzuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

upvoted a paper 8 days ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

upvoted a paper 18 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published 8 days ago • 18

upvoted a paper 8 days ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published 9 days ago • 107

upvoted a paper 18 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published 18 days ago • 78

upvoted a collection 3 months ago

SimpleVLA-RL

Collection

6 items • Updated Jun 15 • 2

upvoted 5 papers 3 months ago

upvoted 3 papers 4 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 97

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published Apr 2 • 37

upvoted 3 papers 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 88

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Paper • 2503.07459 • Published Mar 10 • 16

upvoted 5 papers 6 months ago

Optimal Brain Apoptosis

Paper • 2502.17941 • Published Feb 25 • 10

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 64

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19 • 29

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18 • 38

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

Yuxin Zuo

AI & ML interests

Recent Activity

Organizations

yuxinzuo's activity