46 24 36

yinanhe

ynhe

AI & ML interests

None yet

Recent Activity

updated a dataset 4 days ago

Vchitect/VBench-2.0_human_annotation

updated a Space 5 days ago

Vchitect/VBench_Leaderboard

updated a dataset 6 days ago

Vchitect/VBench_human_annotation

View all activity

Organizations

authored 7 papers 10 months ago

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 31

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Paper • 2303.16727 • Published Mar 29, 2023

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 34

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Paper • 2501.00574 • Published Dec 31, 2024 • 6

InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling

Paper • 2501.12386 • Published Jan 21, 2025 • 1

DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency

Paper • 2501.10110 • Published Jan 17, 2025 • 1

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14, 2025 • 1

authored 7 papers almost 2 years ago

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Paper • 2212.03191 • Published Dec 6, 2022

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26, 2024 • 37

VideoMamba: State Space Model for Efficient Video Understanding

Paper • 2403.06977 • Published Mar 11, 2024 • 29

authored 2 papers about 2 years ago

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Paper • 2311.17005 • Published Nov 28, 2023 • 2

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

authored 4 papers over 2 years ago

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Paper • 2309.15103 • Published Sep 26, 2023 • 42

VideoChat: Chat-Centric Video Understanding

Paper • 2305.06355 • Published May 10, 2023 • 3

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Paper • 2307.06942 • Published Jul 13, 2023 • 23

InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language

Paper • 2305.05662 • Published May 9, 2023 • 4

yinanhe

AI & ML interests

Recent Activity

Organizations

ynhe's activity