zhumuzhi's picture

3 18 5

zhumuzhi

Z-MU-Z

·

Z-MU-Z

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

upvoted a paper 29 days ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

upvoted a paper about 1 month ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

View all activity

Organizations

None yet

commented 2 papers 3 months ago

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Paper • 2505.21457 • Published May 27 • 14 •

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26 • 17 •

commented a paper 5 months ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11 • 27 •