WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation Paper • 2505.01490 • Published 9 days ago • 4
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published 10 days ago • 27
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing Paper • 2505.02823 • Published 6 days ago • 5
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing Paper • 2505.02370 • Published 7 days ago • 12
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation Paper • 2505.04512 • Published 5 days ago • 32
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published 13 days ago • 40
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer Paper • 2504.20690 • Published 13 days ago • 18
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper • 2504.15271 • Published 20 days ago • 65
Personalized Text-to-Image Generation with Auto-Regressive Models Paper • 2504.13162 • Published 24 days ago • 19
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 19 days ago • 60
DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper • 2504.14509 • Published 22 days ago • 50
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 18 days ago • 88
Subject-driven Video Generation via Disentangled Identity and Motion Paper • 2504.17816 • Published 19 days ago • 11
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published 25 days ago • 17
Cobra: Efficient Line Art COlorization with BRoAder References Paper • 2504.12240 • Published 26 days ago • 28