Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis Paper • 2504.10352 • Published 5 days ago
CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Paper • 2409.19510 • Published Sep 29, 2024
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Paper • 2406.06619 • Published Jun 7, 2024