1 38 4

Donghao Zhou

donghao-zhou

https://correr-zhou.github.io

Correr-Zhou

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 4 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

upvoted a paper 4 days ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

upvoted a paper 5 days ago

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

View all activity

Organizations

None yet

donghao-zhou's activity

upvoted 2 papers 4 days ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published 4 days ago • 57

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published 5 days ago • 31

upvoted 2 papers 5 days ago

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation

Paper • 2505.24521 • Published 9 days ago • 15

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Paper • 2505.24862 • Published 8 days ago • 31

upvoted a paper 9 days ago

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Paper • 2505.23606 • Published 9 days ago • 14

upvoted a paper 11 days ago

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Paper • 2505.20292 • Published 12 days ago • 52

upvoted a paper 16 days ago

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Paper • 2505.16707 • Published 16 days ago • 42

upvoted a paper 20 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 25 days ago • 183

upvoted 2 papers 26 days ago

DanceGRPO: Unleashing GRPO on Visual Generation

Paper • 2505.07818 • Published 26 days ago • 29

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 27 days ago • 143

upvoted a paper 29 days ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published about 1 month ago • 78

upvoted 2 papers about 1 month ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published May 1 • 42

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 88

upvoted 3 papers about 2 months ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 50

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published Apr 15 • 64

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 63

upvoted a paper 2 months ago

ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation

Paper • 2503.22194 • Published Mar 28 • 24

upvoted 3 papers 3 months ago