16 14 5

Haobo Yuan

HarborYuan

https://yuanhaobo.me

AI & ML interests

computer vision

Recent Activity

upvoted a paper 6 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

updated a model 13 days ago

HarborYuan/Sa2VA-LLaVA-1.5-7B

published a model 13 days ago

HarborYuan/Sa2VA-LLaVA-1.5-7B

View all activity

Organizations

upvoted a paper 6 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 11 days ago • 63

updated a model 13 days ago

HarborYuan/Sa2VA-LLaVA-1.5-7B

Image-Text-to-Text • 7B • Updated 13 days ago • 18

published a model 13 days ago

HarborYuan/Sa2VA-LLaVA-1.5-7B

Image-Text-to-Text • 7B • Updated 13 days ago • 18

published a model 18 days ago

HarborYuan/Sa2VA-Qwen3-VL-4B-SAM3

Image-Text-to-Text • 5B • Updated 18 days ago • 55

updated a model 18 days ago

HarborYuan/Sa2VA-Qwen3-VL-4B-SAM3

Image-Text-to-Text • 5B • Updated 18 days ago • 55

updated a model 27 days ago

HarborYuan/dcvr2-gaussian

Updated 27 days ago

published a model 27 days ago

HarborYuan/dcvr2-gaussian

Updated 27 days ago

updated a dataset about 2 months ago

HarborYuan/VRT-Eval

Updated May 7 • 23

upvoted a paper 5 months ago

SAMTok: Representing Any Mask with Two Words

Paper • 2601.16093 • Published Jan 22 • 44

updated a dataset 7 months ago

HarborYuan/subench

Updated Dec 9, 2025 • 145

published a dataset 7 months ago

HarborYuan/VRT-Eval

Updated May 7 • 23

updated 2 models 7 months ago

HarborYuan/R-Sa2VA-Qwen3VL-4B-SFT

5B • Updated Dec 5, 2025 • 4

HarborYuan/R-Sa2VA-Qwen3VL-4B-RL

5B • Updated Dec 5, 2025 • 8

published 2 models 7 months ago

HarborYuan/R-Sa2VA-Qwen3VL-4B-SFT

5B • Updated Dec 5, 2025 • 4

HarborYuan/R-Sa2VA-Qwen3VL-4B-RL

5B • Updated Dec 5, 2025 • 8

published 2 datasets 7 months ago

HarborYuan/VisualReasoningTracer

Updated Oct 15, 2025 • 456

HarborYuan/subench

Updated Dec 9, 2025 • 145

upvoted a paper 7 months ago

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12, 2025 • 72

upvoted 2 papers 8 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 56

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

Haobo Yuan

AI & ML interests

Recent Activity

Organizations

HarborYuan's activity