Rui Zhao's picture

Rui Zhao

ruizhaocv

·

https://ruizhaocv.github.io/

AI & ML interests

Multimodal and GenAI

Recent Activity

upvoted a paper 9 days ago

BrushEdit: All-In-One Image Inpainting and Editing

upvoted a paper 9 days ago

Wonderland: Navigating 3D Scenes from a Single Image

upvoted a paper 9 days ago

ColorFlow: Retrieval-Augmented Image Sequence Colorization

View all activity

Organizations

ruizhaocv's activity

upvoted 3 papers 9 days ago

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published 13 days ago • 33

Wonderland: Navigating 3D Scenes from a Single Image

Paper • 2412.12091 • Published 9 days ago • 14

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Paper • 2412.11815 • Published 10 days ago • 26

upvoted 3 papers 10 days ago

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Paper • 2412.09283 • Published 14 days ago • 19

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 12 days ago • 131

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 13 days ago • 84

upvoted 3 papers 13 days ago

LoRACLR: Contrastive Adaptation for Customization of Diffusion Models

Paper • 2412.09622 • Published 13 days ago • 7

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published 13 days ago • 21

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published 14 days ago • 41

upvoted a paper 14 days ago

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Paper • 2412.07744 • Published 15 days ago • 19

upvoted 7 papers 15 days ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published 20 days ago • 54

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

Paper • 2412.05355 • Published 19 days ago • 7

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Paper • 2412.06781 • Published 16 days ago • 18

AMO Sampler: Enhancing Text Rendering with Overshooting

Paper • 2411.19415 • Published 27 days ago • 3

ObjCtrl-2.5D: Training-free Object Control with Camera Poses

Paper • 2412.07721 • Published 16 days ago • 8

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 16 days ago • 45

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Paper • 2412.07774 • Published 15 days ago • 25

upvoted a paper 28 days ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published 29 days ago • 82

upvoted a paper about 2 months ago

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published Nov 7 • 70

upvoted a paper 2 months ago

Retrieval Head Mechanistically Explains Long-Context Factuality

Paper • 2404.15574 • Published Apr 24 • 2