Long Video Benchmark

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Shitao authored a paper about 1 month ago

OmniGen2: Exploration to Advanced Multimodal Generation

yzwang authored a paper about 1 month ago

OmniGen2: Exploration to Advanced Multimodal Generation

JUNJIE99 authored a paper about 1 month ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

View all activity

Shitao

authored a paper about 1 month ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 73

yzwang

authored a paper about 1 month ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 73

JUNJIE99

authored 4 papers about 1 month ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Paper • 2502.12558 • Published Feb 18

Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval

Paper • 2502.11431 • Published Feb 17

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

Paper • 2506.10821 • Published Jun 12 • 20

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 73

sy1998

authored a paper 2 months ago

EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models

Paper • 2506.01667 • Published Jun 2 • 21

yzwang

authored a paper 2 months ago

MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos

Paper • 2502.12558 • Published Feb 18

sy1998

authored 4 papers 3 months ago

Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding

Paper • 2503.18478 • Published Mar 24 • 1

yzwang

authored a paper 6 months ago

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Paper • 2502.06788 • Published Feb 10 • 13

yzwang

authored a paper 7 months ago

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Paper • 2406.10638 • Published Jun 15, 2024

Shitao

authored a paper 8 months ago

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

yzwang

authored a paper 8 months ago

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

JUNJIE99

authored 2 papers 8 months ago

Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

Paper • 2409.14485 • Published Sep 22, 2024 • 2

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

yzwang

authored 2 papers 10 months ago

Fine-Grained Visual Prompting

Paper • 2306.04356 • Published Jun 7, 2023

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 96

AI & ML interests

Recent Activity

Team members 4

LVBench's activity