ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published 10 days ago • 26
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Paper • 2412.09283 • Published 14 days ago • 19
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 12 days ago • 131
LoRACLR: Contrastive Adaptation for Customization of Diffusion Models Paper • 2412.09622 • Published 13 days ago • 7
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM Paper • 2412.09618 • Published 13 days ago • 21
Multimodal Latent Language Modeling with Next-Token Diffusion Paper • 2412.08635 • Published 14 days ago • 41
StyleMaster: Stylize Your Video with Artistic Generation and Translation Paper • 2412.07744 • Published 15 days ago • 19
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance Paper • 2412.05355 • Published 19 days ago • 7
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published 16 days ago • 18
AMO Sampler: Enhancing Text Rendering with Overshooting Paper • 2411.19415 • Published 27 days ago • 3
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper • 2412.07721 • Published 16 days ago • 8
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 16 days ago • 45
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 15 days ago • 25
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published 29 days ago • 82
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published Nov 7 • 70
Retrieval Head Mechanistically Explains Long-Context Factuality Paper • 2404.15574 • Published Apr 24 • 2