4 20 5

Kanzhi Cheng

cckevinn

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

updated a dataset about 1 month ago

cckevinn/GUI-Actor-Data

liked a dataset about 1 month ago

cckevinn/GUI-Actor-Data

View all activity

Organizations

upvoted a paper 17 days ago

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios

Paper • 2506.20279 • Published 22 days ago • 18

updated a dataset about 1 month ago

cckevinn/GUI-Actor-Data

Updated about 1 month ago • 639 • 8

liked a dataset about 1 month ago

cckevinn/GUI-Actor-Data

Updated about 1 month ago • 639 • 8

published a dataset about 1 month ago

cckevinn/GUI-Actor-Data

Updated about 1 month ago • 639 • 8

authored a paper about 1 month ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 48

upvoted 2 papers about 1 month ago

A Controllable Examination for Long-Context Language Models

Paper • 2506.02921 • Published Jun 3 • 32

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 48

liked a model about 1 month ago

microsoft/GUI-Actor-7B-Qwen2-VL

Image-Text-to-Text • 8B • Updated Jun 10 • 2.51k • 35

authored a paper about 2 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 102

upvoted a paper about 2 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 102

upvoted a paper 3 months ago

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published Apr 16 • 29

authored a paper 3 months ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published Apr 11 • 55

upvoted 2 papers 3 months ago

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published Apr 11 • 55

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Paper • 2504.10127 • Published Apr 14 • 17

upvoted 3 papers 4 months ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published Mar 16 • 17

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published Mar 17 • 52

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 26

commented a paper 4 months ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 26 •

authored a paper 4 months ago

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era

Paper • 2503.12329 • Published Mar 16 • 26

liked a Space 4 months ago

CapArena Auto 1

🥇

Display Leaderboard of LLM Model Evaluations

Kanzhi Cheng

AI & ML interests

Recent Activity

Organizations

cckevinn's activity

CapArena Auto 1