21 71 31

HAODONG DUAN

KennyUTC

https://kennymckormick.github.io

AI & ML interests

Video Understanding; Multi-Modal Learning

Recent Activity

upvoted a paper 3 days ago

InternLM2 Technical Report

upvoted a paper 4 days ago

Intern-S1: A Scientific Multimodal Foundation Model

upvoted a paper 19 days ago

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

View all activity

Organizations

upvoted a paper 3 days ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 35

upvoted a paper 4 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 4 days ago • 218

upvoted a paper 19 days ago

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published 20 days ago • 33

upvoted a paper 21 days ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published 24 days ago • 62

upvoted 2 papers about 1 month ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 85

CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards

Paper • 2507.09104 • Published Jul 12 • 17

upvoted a collection about 1 month ago

OpenCompass Multi-Modal Leaderboards

Collection

6 items • Updated Jul 24 • 2

upvoted a paper 2 months ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24 • 26

upvoted 3 papers 3 months ago

upvoted 2 papers 4 months ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6 • 94

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 281

upvoted 7 papers 5 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 48

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 69

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 36

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 49

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 37

HAODONG DUAN

AI & ML interests

Recent Activity

Organizations

KennyUTC's activity