DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Paper • 2412.18597 • Published 1 day ago • 12
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published 6 days ago • 17
Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Paper • 2412.10208 • Published 13 days ago • 19
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers Paper • 2412.09611 • Published 13 days ago • 9
Learning Flow Fields in Attention for Controllable Person Image Generation Paper • 2412.08486 • Published 15 days ago • 32
StyleMaster: Stylize Your Video with Artistic Generation and Translation Paper • 2412.07744 • Published 15 days ago • 19
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Paper • 2411.07199 • Published Nov 11 • 45
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 16 days ago • 45
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 15 days ago • 25
Negative Token Merging: Image-based Adversarial Feature Guidance Paper • 2412.01339 • Published 24 days ago • 21
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training Paper • 2412.02030 • Published 23 days ago • 18
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model Paper • 2411.17459 • Published 30 days ago • 10
CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes Paper • 2411.00771 • Published Nov 1 • 9
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper • 2410.20280 • Published Oct 26 • 23
Unbounded: A Generative Infinite Game of Character Life Simulation Paper • 2410.18975 • Published Oct 24 • 35
FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model Paper • 2410.13925 • Published Oct 17 • 22
Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion Paper • 2410.13674 • Published Oct 17 • 15
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning Paper • 2410.06373 • Published Oct 8 • 35