ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published 2 days ago • 56
VEnhancer: Generative Space-Time Enhancement for Video Generation Paper • 2407.07667 • Published Jul 10 • 12
Training-free Regional Prompting for Diffusion Transformers Paper • 2411.02395 • Published 5 days ago • 22
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks Paper • 2410.20650 • Published 13 days ago • 14
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published 14 days ago • 21
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation Paper • 2410.18666 • Published 16 days ago • 17
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer Paper • 2410.10812 • Published 26 days ago • 14
DragAnything: Motion Control for Anything using Entity Representation Paper • 2403.07420 • Published Mar 12 • 13
Improving Long-Text Alignment for Text-to-Image Diffusion Models Paper • 2410.11817 • Published 25 days ago • 14
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper • 2410.10792 • Published 26 days ago • 26
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Paper • 2410.10774 • Published 26 days ago • 23
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion Paper • 2410.08168 • Published about 1 month ago • 7
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis Paper • 2410.08261 • Published about 1 month ago • 48
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control Paper • 2403.04880 • Published Mar 7 • 53
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models Paper • 2410.08207 • Published about 1 month ago • 18
Progressive Autoregressive Video Diffusion Models Paper • 2410.08151 • Published about 1 month ago • 15
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler Paper • 2410.05651 • Published Oct 8 • 13
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Paper • 2410.07171 • Published Oct 9 • 41