Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published 4 days ago • 76
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing Paper • 2505.02823 • Published 4 days ago • 5
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published 11 days ago • 40
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published 8 days ago • 26
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution Paper • 2505.00497 • Published 8 days ago • 13
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions Paper • 2504.19056 • Published 13 days ago • 15
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction Paper • 2504.21855 • Published 9 days ago • 12