Computer-Vison - a cedhons Collection

cedhons 's Collections

Computer-Vison

updated 3 days ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published 4 days ago • 76
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing

Paper • 2505.02823 • Published 4 days ago • 5
PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published 11 days ago • 40
Improving Editability in Image Generation with Layer-wise Memory

Paper • 2505.01079 • Published 8 days ago • 26
A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 9 days ago • 43
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Paper • 2505.00497 • Published 8 days ago • 13
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

Paper • 2504.19056 • Published 13 days ago • 15
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Paper • 2504.21855 • Published 9 days ago • 12