FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers Paper • 2507.12956 • Published 23 days ago • 22
FLEXITOKENS: Flexible Tokenization for Evolving Language Models Paper • 2507.12720 • Published 23 days ago • 8
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation Paper • 2507.09862 • Published 26 days ago • 48
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published Jun 24 • 40
Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images Paper • 2506.22960 • Published Jun 28 • 6
HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling Paper • 2506.20452 • Published Jun 25 • 18
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model Paper • 2506.15682 • Published Jun 18 • 5