Hao Li's picture

20

Hao Li

Richardleee

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

upvoted a paper 20 days ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

upvoted a paper 20 days ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

View all activity

Organizations

Richardleee's activity

upvoted a paper 2 days ago

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Paper • 2506.12278 • Published 6 days ago • 15

upvoted 3 papers 20 days ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published 23 days ago • 66

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published 22 days ago • 45

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published 22 days ago • 67

upvoted a paper 21 days ago

SWE-bench Goes Live!

Paper • 2505.23419 • Published 22 days ago • 21

upvoted 2 papers 29 days ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published about 1 month ago • 74

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published about 1 month ago • 54

upvoted 9 papers 3 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 38

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 77

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published Apr 1 • 26

Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation

Paper • 2503.22675 • Published Mar 28 • 35

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving

Paper • 2503.21821 • Published Mar 26 • 17

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 91

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 45

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 69

IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval

Paper • 2503.04644 • Published Mar 6 • 21

upvoted 4 papers 5 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 15

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Paper • 2501.12375 • Published Jan 21 • 22

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 47

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86