TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting Paper • 2503.17032 • Published Mar 21 • 27
Common Pile v0.1 Raw Data Collection 8TB of public domain and openly licensed text • 30 items • Updated Jun 6 • 14
BigVGAN: A Universal Neural Vocoder with Large-Scale Training Paper • 2206.04658 • Published Jun 9, 2022 • 4
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper • 2402.08093 • Published Feb 12, 2024 • 62
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published Jan 10 • 52