Yongliang Shen

tricktreat

tricktreat

AI & ML interests

None yet

Recent Activity

liked a dataset about 7 hours ago

microsoft/synthetic-computers-at-scale

upvoted a paper about 8 hours ago

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

upvoted a paper about 8 hours ago

Co-Evolving Policy Distillation

View all activity

Organizations

submitted 2 papers to Daily Papers 17 days ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 19 days ago • 141

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 19 days ago • 141

authored 2 papers 18 days ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published 23 days ago • 47

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published 19 days ago • 141

authored 6 papers 25 days ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9, 2025 • 12

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Paper • 2603.02578 • Published Mar 3 • 25

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Paper • 2603.15611 • Published Mar 16 • 10

CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Paper • 2603.17775 • Published Mar 18 • 2

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published about 1 month ago • 99

authored 3 papers 7 months ago

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published Sep 29, 2025 • 7

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published Sep 29, 2025 • 32

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published Sep 29, 2025 • 31

authored 2 papers 8 months ago

EviNote-RAG: Enhancing RAG Models via Answer-Supportive Evidence Notes

Paper • 2509.00877 • Published Aug 31, 2025 • 3

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 50

authored 5 papers 9 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12, 2025 • 37

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7, 2025 • 17

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7, 2025 • 20

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7, 2025 • 22

MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task

Paper • 2502.11684 • Published Feb 17, 2025 • 2

Yongliang Shen

AI & ML interests

Recent Activity

Organizations

tricktreat's activity