Shiyu Huang's picture

7 6 14

Shiyu Huang

ShiyuHuang

·

https://huangshiyu13.github.io/

AI & ML interests

RL, Game AI, NLP, CV

Recent Activity

upvoted a paper about 1 month ago

The Leaderboard Illusion

commented on a paper 3 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

updated a collection 3 months ago

video_benchmark

View all activity

Organizations

ShiyuHuang's activity

upvoted a paper about 1 month ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

commented a paper 3 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 27 •

updated a collection 3 months ago

video_benchmark

3 items • Updated Feb 27

upvoted a paper 3 months ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 86

updated a collection 3 months ago

Reasoning

2 items • Updated Feb 27

New activity in THUDM/cogvlm2-llama3-caption 5 months ago

keep mentioning "bilibili" watermark

#6 opened 7 months ago by

中文效果怎么样呢

#1 opened 9 months ago by

authored a paper 5 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45

liked a dataset 5 months ago

THUDM/MotionBench

Viewer • Updated Jan 8 • 5k • 4.44k • 2

upvoted a paper 5 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45

authored a paper 5 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 19

liked a dataset 5 months ago

AIWinter/LVBench

Updated Sep 13, 2024 • 329 • 6

updated a Space 5 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

liked a model 5 months ago

THUDM/VisionReward-Video

Text Generation • Updated Jan 1 • 517 • 5

liked a Space 6 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

updated 3 Spaces 6 months ago

LVBench Leaderboard

Submit model evaluations to a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard