zhumuzhi's picture

3 15 5

zhumuzhi

Z-MU-Z

·

Z-MU-Z

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting

authored a paper about 1 month ago

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

authored a paper about 1 month ago

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

View all activity

Organizations

None yet

authored 2 papers about 1 month ago

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26 • 17

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Paper • 2505.21457 • Published May 27 • 14

authored 7 papers 4 months ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11 • 26

De novo protein design using geometric vector field networks

Paper • 2310.11802 • Published Oct 18, 2023

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Paper • 2305.13310 • Published May 22, 2023

SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning

Paper • 2308.06531 • Published Aug 12, 2023

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data

Paper • 2405.10185 • Published May 16, 2024

Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation

Paper • 2410.02369 • Published Oct 3, 2024 • 1

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published Feb 24 • 53