Xiangyu Yue's picture

1

Xiangyu Yue

xyyue

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

authored a paper 29 days ago

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

authored a paper 3 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

View all activity

Organizations

None yet

authored a paper 1 day ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published 3 days ago • 23

authored a paper 29 days ago

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Paper • 2505.21327 • Published 30 days ago • 83

authored 2 papers 3 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

Unleashing Vecset Diffusion Model for Fast Shape Generation

Paper • 2503.16302 • Published Mar 20 • 44

authored 2 papers 4 months ago

Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model

Paper • 2502.16779 • Published Feb 24 • 3

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published Feb 23 • 13

authored 2 papers 7 months ago

Chimera: Improving Generalist Model with Domain-Specific Experts

Paper • 2412.05983 • Published Dec 8, 2024 • 9

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

authored a paper 8 months ago

Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published Oct 17, 2024 • 9

authored a paper 9 months ago

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

Paper • 2410.08049 • Published Oct 10, 2024 • 8