Malaysian-Podcast-Dia-1.6B

Full parameter finetuning nari-labs/Dia-1.6B on Malaysian Podcast from mesolitica/Malaysian-Emilia where the permutation for voice conversion only select 80% similar.

Complete tutorial how to use at mesolitica/malaya-speech/Dia-TTS.

How we trained it

  1. The finetuning done in FP32-BF16 mixed precision training.
  2. Multipacking encoder-decoder.
  3. Wandb at https://wandb.ai/huseinzol05/dia-tts-malaysian-emilia-full-mixed-precision-podcast

Source code

Source code at https://github.com/mesolitica/malaya-speech/tree/master/session/dia-tts

Acknowledgement

Special thanks to https://www.sns.com.my and Nvidia for 8x H100 node!

Downloads last month
440
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mesolitica/Malaysian-Podcast-Dia-1.6B