1 10 2

Kaiyuan Chen

Lucky2022

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

upvoted a paper 3 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

liked a dataset 3 days ago

xbench/ScienceQA

View all activity

Organizations

Lucky2022's activity

authored a paper 2 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published 5 days ago • 9

upvoted a paper 3 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published 5 days ago • 9

liked 2 datasets 3 days ago

xbench/ScienceQA

Viewer • Updated 3 days ago • 100 • 198 • 7

xbench/DeepSearch

Viewer • Updated 3 days ago • 100 • 259 • 4

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21 • 75

upvoted a paper 2 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 85

upvoted 3 papers 4 months ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 12

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 85

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 21

commented a paper 4 months ago

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 21 •

authored 2 papers 4 months ago

CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding

Paper • 2405.02384 • Published May 3, 2024

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 21

upvoted a paper 4 months ago

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Paper • 2502.04235 • Published Feb 6 • 22

updated a dataset over 1 year ago

Lucky2022/coda

Viewer • Updated Jan 26, 2024 • 302 • 50

updated a Space over 1 year ago

AutoTrain Advanced

🚀