EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis Paper • 2503.20168 • Published Mar 26
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training Paper • 2502.11196 • Published Feb 16 • 22
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16, 2024 • 9
B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests Paper • 2409.08692 • Published Sep 13, 2024 • 28
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters Paper • 2408.17253 • Published Aug 30, 2024 • 40
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 51
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Paper • 2408.04708 • Published Aug 8, 2024 • 8
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback Paper • 2403.06840 • Published Mar 11, 2024 • 1
ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis Paper • 2403.06932 • Published Mar 11, 2024 • 1
SPA: Towards A Computational Friendly Cloud-Base and On-Devices Collaboration Seq2seq Personalized Generation Paper • 2403.07088 • Published Mar 11, 2024 • 1
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts Paper • 2307.07218 • Published Jul 14, 2023 • 27
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias Paper • 2306.03509 • Published Jun 6, 2023 • 5
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis Paper • 2306.03504 • Published Jun 6, 2023 • 8
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training Paper • 2305.10763 • Published May 18, 2023 • 3