You Li's picture

8 4 3

You Li

Michael4933

·

Michael4933

AI & ML interests

NLP, Multi-modal LLM

Recent Activity

upvoted a paper 4 days ago

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

upvoted a paper 4 days ago

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

updated a Space 16 days ago

Michael4933/Migician

View all activity

Organizations

None yet

Michael4933's activity

upvoted 2 papers 4 days ago

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

Paper • 2505.15436 • Published 18 days ago • 1

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published 19 days ago • 1

upvoted a paper 3 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 30

upvoted a paper 5 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10 • 30