Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation Paper โข 2305.18474 โข Published May 29, 2023 โข 3
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper โข 2408.16532 โข Published Aug 29 โข 47
UniAudio: An Audio Foundation Model Toward Universal Audio Generation Paper โข 2310.00704 โข Published Oct 1, 2023 โข 21
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation Paper โข 2305.18474 โข Published May 29, 2023 โข 3
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training Paper โข 2305.10763 โข Published May 18, 2023 โข 3