Text-to-Speech
Transformers
Safetensors
parler_tts
text2text-generation
annotation

Doubts and queries about the model

#12
by mukherjeesougata2399 - opened

I have a few doubts and queries regarding this TTS model, which are as follows:

  1. Language Identification for Common Scripts:
    How does the model identify the language when multiple languages share the same or nearly identical scripts? For example:

    • Bengali and Assamese use the same script.
    • Hindi and Sanskrit have almost identical scripts.
  2. Under the "Tips" section, there is a statement:

    "The remaining speech features (gender, speaking rate, pitch, and reverberation) can be controlled directly
    through the prompt."

    Can you provide an example usage for this?

  3. Why is it necessary to fine-tune on a subset of the same dataset used to train the pre-trained model?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment