Dense World

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

LXT authored a paper 23 days ago

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

LXT authored a paper 23 days ago

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

LXT authored a paper 23 days ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

View all activity

LXT

authored 3 papers 23 days ago

Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

Paper • 2505.24164 • Published May 30

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Paper • 2506.13691 • Published Jun 16 • 2

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published 24 days ago • 46

onion-liu

authored a paper about 1 month ago

Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

Paper • 2506.18851 • Published Jun 23 • 29

LXT

authored 10 papers about 2 months ago

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Paper • 2505.23606 • Published May 29 • 14

Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Paper • 2505.16862 • Published May 22

MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query

Paper • 2506.03144 • Published Jun 3 • 3

BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation

Paper • 2505.12620 • Published May 19

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9 • 4

DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers

Paper • 2505.21541 • Published May 24 • 7

HarborYuan

authored a paper 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 83

LXT

authored 2 papers 3 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 83

DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency

Paper • 2504.12080 • Published Apr 16 • 8

daviddousa

authored 3 papers 3 months ago

Detection and Tracking Meet Drones Challenge

Paper • 2001.06303 • Published Jan 16, 2020

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark

Paper • 2105.02440 • Published May 6, 2021

Vidi: Large Multimodal Models for Video Understanding and Editing

Paper • 2504.15681 • Published Apr 22 • 15

AI & ML interests

Recent Activity

Team members 8

Dense-World's activity