Faster Video Diffusion with Trainable Sparse Attention Paper • 2505.13389 • Published 5 days ago • 34
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published 20 days ago • 80