Hongbing Li commited on 3 days ago

Commit

db9a17a

1 Parent(s): 98a6a84

Add model weights and configuration files

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitattributes +1 -0
README.md +145 -3
audio.wav +3 -0
config.json +3 -0
config_kokoro.json +150 -0
inference.py +53 -0
onnx/model.onnx +3 -0
onnx/model_fp16.onnx +3 -0
onnx/model_q4.onnx +3 -0
onnx/model_q4f16.onnx +3 -0
onnx/model_q8f16.onnx +3 -0
onnx/model_quantized.onnx +3 -0
onnx/model_uint8.onnx +3 -0
onnx/model_uint8f16.onnx +3 -0
requirement.txt +5 -0
tokenizer.json +175 -0
tokenizer_config.json +6 -0
voices/af.bin +3 -0
voices/af_alloy.bin +3 -0
voices/af_aoede.bin +3 -0
voices/af_bella.bin +3 -0
voices/af_heart.bin +3 -0
voices/af_jessica.bin +3 -0
voices/af_kore.bin +3 -0
voices/af_nicole.bin +3 -0
voices/af_nova.bin +3 -0
voices/af_river.bin +3 -0
voices/af_sarah.bin +3 -0
voices/af_sky.bin +3 -0
voices/am_adam.bin +3 -0
voices/am_echo.bin +3 -0
voices/am_eric.bin +3 -0
voices/am_fenrir.bin +3 -0
voices/am_liam.bin +3 -0
voices/am_michael.bin +3 -0
voices/am_onyx.bin +3 -0
voices/am_puck.bin +3 -0
voices/am_santa.bin +3 -0
voices/bf_alice.bin +3 -0
voices/bf_emma.bin +3 -0
voices/bf_isabella.bin +3 -0
voices/bf_lily.bin +3 -0
voices/bm_daniel.bin +3 -0
voices/bm_fable.bin +3 -0
voices/bm_george.bin +3 -0
voices/bm_lewis.bin +3 -0
voices/ef_dora.bin +3 -0
voices/em_alex.bin +3 -0
voices/em_santa.bin +3 -0
voices/ff_siwis.bin +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.wav filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,145 @@
----
-license: mit
----

+---
+license: apache-2.0
+library_name: transformers.js
+language:
+- en
+base_model:
+- hexgrad/Kokoro-82M
+pipeline_tag: text-to-speech
+---
+# Kokoro TTS
+Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out).
+## Table of contents
+- [Usage](#usage)
+  - [JavaScript](#javascript)
+  - [Python](#python)
+- [Voices/Samples](#voicessamples)
+- [Quantizations](#quantizations)
+## Usage
+### JavaScript
+First, install the `kokoro-js` library from [NPM](https://npmjs.com/package/kokoro-js) using:
+```bash
+npm i kokoro-js
+```
+You can then generate speech as follows:
+```js
+import { KokoroTTS } from "kokoro-js";
+const model_id = "onnx-community/Kokoro-82M-ONNX";
+const tts = await KokoroTTS.from_pretrained(model_id, {
+  dtype: "q8", // Options: "fp32", "fp16", "q8", "q4", "q4f16"
+});
+const text = "Life is like a box of chocolates. You never know what you're gonna get.";
+const audio = await tts.generate(text, {
+  // Use `tts.list_voices()` to list all available voices
+  voice: "af_bella",
+});
+audio.save("audio.wav");
+```
+### Python
+```python
+import os
+import numpy as np
+from onnxruntime import InferenceSession
+# You can generate token ids as follows:
+#   1. Convert input text to phonemes using https://github.com/hexgrad/misaki
+#   2. Map phonemes to ids using https://huggingface.co/hexgrad/Kokoro-82M/blob/785407d1adfa7ae8fbef8ffd85f34ca127da3039/config.json#L34-L148
+tokens = [50, 157, 43, 135, 16, 53, 135, 46, 16, 43, 102, 16, 56, 156, 57, 135, 6, 16, 102, 62, 61, 16, 70, 56, 16, 138, 56, 156, 72, 56, 61, 85, 123, 83, 44, 83, 54, 16, 53, 65, 156, 86, 61, 62, 131, 83, 56, 4, 16, 54, 156, 43, 102, 53, 16, 156, 72, 61, 53, 102, 112, 16, 70, 56, 16, 138, 56, 44, 156, 76, 158, 123, 56, 16, 62, 131, 156, 43, 102, 54, 46, 16, 102, 48, 16, 81, 47, 102, 54, 16, 54, 156, 51, 158, 46, 16, 70, 16, 92, 156, 135, 46, 16, 54, 156, 43, 102, 48, 4, 16, 81, 47, 102, 16, 50, 156, 72, 64, 83, 56, 62, 16, 156, 51, 158, 64, 83, 56, 16, 44, 157, 102, 56, 16, 44, 156, 76, 158, 123, 56, 4]
+# Context length is 512, but leave room for the pad token 0 at the start & end
+assert len(tokens) <= 510, len(tokens)
+# Style vector based on len(tokens), ref_s has shape (1, 256)
+voices = np.fromfile('./voices/af.bin', dtype=np.float32).reshape(-1, 1, 256)
+ref_s = voices[len(tokens)]
+# Add the pad ids, and reshape tokens, should now have shape (1, <=512)
+tokens = [[0, *tokens, 0]]
+model_name = 'model.onnx' # Options: model.onnx, model_fp16.onnx, model_quantized.onnx, model_q8f16.onnx, model_uint8.onnx, model_uint8f16.onnx, model_q4.onnx, model_q4f16.onnx
+sess = InferenceSession(os.path.join('onnx', model_name))
+audio = sess.run(None, dict(
+    input_ids=tokens,
+    style=ref_s,
+    speed=np.ones(1, dtype=np.float32),
+))[0]
+```
+Optionally, save the audio to a file:
+```py
+import scipy.io.wavfile as wavfile
+wavfile.write('audio.wav', 24000, audio[0])
+```
+## Voices/Samples
+> Life is like a box of chocolates. You never know what you're gonna get.
+| Name         | Nationality | Gender | Sample                                                                                                                                  |
+| ------------ | ----------- | ------ | --------------------------------------------------------------------------------------------------------------------------------------- |
+| **af_heart** | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/S_9tkA75BT_QHKOzSX6S-.wav"></audio> |
+| af_alloy     | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/wiZ3gvlL--p5pRItO4YRE.wav"></audio> |
+| af_aoede     | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/Nv1xMwzjTdF9MR8v0oEEJ.wav"></audio> |
+| af_bella     | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/sWN0rnKU6TlLsVdGqRktF.wav"></audio> |
+| af_jessica   | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/2Oa4wITWAmiCXJ_Q97-7R.wav"></audio> |
+| af_kore      | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/AOIgyspzZWDGpn7oQgwtu.wav"></audio> |
+| af_nicole    | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/EY_V2OGr-hzmtTGrTCTyf.wav"></audio> |
+| af_nova      | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/X-xdEkx3GPlQG5DK8Gsqd.wav"></audio> |
+| af_river     | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/ZqaV2-xGUZdBQmZAF1Xqy.wav"></audio> |
+| af_sarah     | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/xzoJBl1HCvkE8Fl8Xu2R4.wav"></audio> |
+| af_sky       | American    | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/ubebYQoaseyQk-jDLeWX7.wav"></audio> |
+| am_adam      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/tvauhDVRGvGK98I-4wv3H.wav"></audio> |
+| am_echo      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/qy_KuUB0hXsu-u8XaJJ_Z.wav"></audio> |
+| am_eric      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/JhqPjbpMhraUv5nTSPpwD.wav"></audio> |
+| am_fenrir    | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/c0R9caBdBiNjGUUalI_DQ.wav"></audio> |
+| am_liam      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/DFHvulaLeOjXIDKecvNG3.wav"></audio> |
+| am_michael   | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/IPKhsnjq1tPh3JmHH8nEg.wav"></audio> |
+| am_onyx      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/ov0pFDfE8NNKZ80LqW6Di.wav"></audio> |
+| am_puck      | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/MOC654sLMHWI64g8HWesV.wav"></audio> |
+| am_santa     | American    | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/LzA6JmHBvQlhOviy8qVfJ.wav"></audio> |
+| bf_alice    | British     | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/9mnYZ3JWq7f6U12plXilA.wav"></audio> |
+| bf_emma     | British     | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/_fvGtKMttRI0cZVGqxMh8.wav"></audio> |
+| bf_isabella | British     | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/VzlcJpqGEND_Q3duYnhiu.wav"></audio> |
+| bf_lily     | British     | Female | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/qZCoartohiRlVamY8Xpok.wav"></audio> |
+| bm_daniel   | British     | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/Eb0TLnLXHDRYOA3TJQKq3.wav"></audio> |
+| bm_fable    | British     | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/NT9XkmvlezQ0FJ6Th5hoZ.wav"></audio> |
+| bm_george   | British     | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/y6VJbCESszLZGupPoqNkF.wav"></audio> |
+| bm_lewis    | British     | Male   | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/RlB5BRvLt-IFvTjzQNxCh.wav"></audio> |
+## Quantizations
+The model is resilient to quantization, enabling efficient high-quality speech synthesis at a fraction of the original model size.
+> How could I know? It's an unanswerable question. Like asking an unborn child if they'll lead a good life. They haven't even been born.
+| Model                                          | Size (MB) | Sample                                                                                                                                  |
+|------------------------------------------------|-----------|-----------------------------------------------------------------------------------------------------------------------------------------|
+| model.onnx (fp32)                              | 326       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/njexBuqPzfYUvWgs9eQ-_.wav"></audio> |
+| model_fp16.onnx (fp16)                         | 163       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/8Ebl44hMQonZs4MlykExt.wav"></audio> |
+| model_quantized.onnx (8-bit)                   | 92.4      | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/9SLOt6ETclZ4yRdlJ0VIj.wav"></audio> |
+| model_q8f16.onnx (Mixed precision)             | 86        | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/gNDMqb33YEmYMbAIv_Grx.wav"></audio> |
+| model_uint8.onnx (8-bit & mixed precision)     | 177       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/tpOWRHIWwEb0PJX46dCWQ.wav"></audio> |
+| model_uint8f16.onnx (Mixed precision)          | 114       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/vtZhABzjP0pvGD7dRb5Vr.wav"></audio> |
+| model_q4.onnx (4-bit matmul)                   | 305       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/8FVn0IJIUfccEBWq8Fnw_.wav"></audio> |
+| model_q4f16.onnx (4-bit matmul & fp16 weights) | 154       | <audio controls src="https://cdn-uploads.huggingface.co/production/uploads/61b253b7ac5ecaae3d1efe0c/7DrgWC_1q00s-wUJuG44X.wav"></audio> |

audio.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:226266faab69075bc17eb83d3d8256d0dfa4df25eb6bb323c783c6a4c57e2107
+size 374458

config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "model_type": "style_text_to_speech_2"
+}

config_kokoro.json ADDED Viewed

	@@ -0,0 +1,150 @@

+{
+  "istftnet": {
+    "upsample_kernel_sizes": [20, 12],
+    "upsample_rates": [10, 6],
+    "gen_istft_hop_size": 5,
+    "gen_istft_n_fft": 20,
+    "resblock_dilation_sizes": [
+      [1, 3, 5],
+      [1, 3, 5],
+      [1, 3, 5]
+    ],
+    "resblock_kernel_sizes": [3, 7, 11],
+    "upsample_initial_channel": 512
+  },
+  "dim_in": 64,
+  "dropout": 0.2,
+  "hidden_dim": 512,
+  "max_conv_dim": 512,
+  "max_dur": 50,
+  "multispeaker": true,
+  "n_layer": 3,
+  "n_mels": 80,
+  "n_token": 178,
+  "style_dim": 128,
+  "text_encoder_kernel_size": 5,
+  "plbert": {
+    "hidden_size": 768,
+    "num_attention_heads": 12,
+    "intermediate_size": 2048,
+    "max_position_embeddings": 512,
+    "num_hidden_layers": 12,
+    "dropout": 0.1
+  },
+  "vocab": {
+    ";": 1,
+    ":": 2,
+    ",": 3,
+    ".": 4,
+    "!": 5,
+    "?": 6,
+    "—": 9,
+    "…": 10,
+    "\"": 11,
+    "(": 12,
+    ")": 13,
+    "“": 14,
+    "”": 15,
+    " ": 16,
+    "\u0303": 17,
+    "ʣ": 18,
+    "ʥ": 19,
+    "ʦ": 20,
+    "ʨ": 21,
+    "ᵝ": 22,
+    "\uAB67": 23,
+    "A": 24,
+    "I": 25,
+    "O": 31,
+    "Q": 33,
+    "S": 35,
+    "T": 36,
+    "W": 39,
+    "Y": 41,
+    "ᵊ": 42,
+    "a": 43,
+    "b": 44,
+    "c": 45,
+    "d": 46,
+    "e": 47,
+    "f": 48,
+    "h": 50,
+    "i": 51,
+    "j": 52,
+    "k": 53,
+    "l": 54,
+    "m": 55,
+    "n": 56,
+    "o": 57,
+    "p": 58,
+    "q": 59,
+    "r": 60,
+    "s": 61,
+    "t": 62,
+    "u": 63,
+    "v": 64,
+    "w": 65,
+    "x": 66,
+    "y": 67,
+    "z": 68,
+    "ɑ": 69,
+    "ɐ": 70,
+    "ɒ": 71,
+    "æ": 72,
+    "β": 75,
+    "ɔ": 76,
+    "ɕ": 77,
+    "ç": 78,
+    "ɖ": 80,
+    "ð": 81,
+    "ʤ": 82,
+    "ə": 83,
+    "ɚ": 85,
+    "ɛ": 86,
+    "ɜ": 87,
+    "ɟ": 90,
+    "ɡ": 92,
+    "ɥ": 99,
+    "ɨ": 101,
+    "ɪ": 102,
+    "ʝ": 103,
+    "ɯ": 110,
+    "ɰ": 111,
+    "ŋ": 112,
+    "ɳ": 113,
+    "ɲ": 114,
+    "ɴ": 115,
+    "ø": 116,
+    "ɸ": 118,
+    "θ": 119,
+    "œ": 120,
+    "ɹ": 123,
+    "ɾ": 125,
+    "ɻ": 126,
+    "ʁ": 128,
+    "ɽ": 129,
+    "ʂ": 130,
+    "ʃ": 131,
+    "ʈ": 132,
+    "ʧ": 133,
+    "ʊ": 135,
+    "ʋ": 136,
+    "ʌ": 138,
+    "ɣ": 139,
+    "ɤ": 140,
+    "χ": 142,
+    "ʎ": 143,
+    "ʒ": 147,
+    "ʔ": 148,
+    "ˈ": 156,
+    "ˌ": 157,
+    "ː": 158,
+    "ʰ": 162,
+    "ʲ": 164,
+    "↓": 169,
+    "→": 171,
+    "↗": 172,
+    "↘": 173,
+    "ᵻ": 177
+  }
+}

inference.py ADDED Viewed

	@@ -0,0 +1,53 @@

+import os
+import json
+import numpy as np
+import scipy.io.wavfile as wavfile
+from onnxruntime import InferenceSession
+from phonemizer import phonemize
+# === Step 1: Load phoneme-to-ID vocabulary ===
+CONFIG_PATH = "./config_kokoro.json"  # Download this from Hugging Face: Kokoro-82M/config.json
+with open(CONFIG_PATH, "r", encoding="utf-8") as f:
+    config = json.load(f)
+phoneme_to_id = config["vocab"]
+# === Step 2: Convert text to phonemes using espeak-ng ===
+text = "Hi how are you, what is your name. tell me something"
+phonemes = phonemize(
+    text,
+    language="en-us",
+    backend="espeak",
+    strip=True,
+    preserve_punctuation=True,
+    with_stress=True
+)
+# === Step 3: Filter out unsupported phonemes and convert to token IDs ===
+phonemes = "".join(p for p in phonemes if p in phoneme_to_id)
+print("Phonemes:", phonemes)
+tokens = [phoneme_to_id[p] for p in phonemes]
+print("Token IDs:", tokens)
+# === Step 4: Prepare style embedding and input IDs ===
+assert len(tokens) <= 510, "Token sequence too long (max 510 phonemes)"
+voices = np.fromfile('./voices/af.bin', dtype=np.float32).reshape(-1, 1, 256)
+ref_s = voices[len(tokens)]  # Select style vector based on token length
+tokens = [[0, *tokens, 0]]  # Add padding tokens at the beginning and end
+# === Step 5: Run ONNX model inference ===
+model_name = 'model.onnx'
+sess = InferenceSession(os.path.join('onnx', model_name))
+audio = sess.run(None, {
+    'input_ids': tokens,
+    'style': ref_s,
+    'speed': np.ones(1, dtype=np.float32),
+})[0]
+# === Step 6: Save output audio as a 24kHz WAV file ===
+wavfile.write('audio.wav', 24000, audio[0])
+print("✅ Audio saved to audio.wav")

onnx/model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8fbea51ea711f2af382e88c833d9e288c6dc82ce5e98421ea61c058ce21a34cb
+size 325532232

onnx/model_fp16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ba4527a874b42b21e35f468c10d326fdff3c7fc8cac1f85e9eb6c0dfc35c334a
+size 163234740

onnx/model_q4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:04cf570cf9c4153694f76347ed4b9a48c1b59ff1de0999e6605d123966b197c7
+size 305215966

onnx/model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1a508a6a29671ead84fac99c7401fbd3c21a583fc6ed1406d1ec974d53bf45f
+size 154586422

onnx/model_q8f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:04c658aec1b6008857c2ad10f8c589d4180d0ec427e7e6118ceb487e215c3cd0
+size 86033585

onnx/model_quantized.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fbae9257e1e05ffc727e951ef9b9c98418e6d79f1c9b6b13bd59f5c9028a1478
+size 92361116

onnx/model_uint8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6607a397d77b8514065420b7c1e7320117f7aabfdb45ce15f0050c5b0fe75aea
+size 177464632

onnx/model_uint8f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:883333e03c597584b532eebea0f8310f25f0c9ade58fe864792c12d969944a9a
+size 114209226

requirement.txt ADDED Viewed

	@@ -0,0 +1,5 @@

+phonemizer==3.2.1
+espeakng==1.0.1
+numpy>=1.21
+onnxruntime>=1.16.0
+scipy>=1.7

tokenizer.json ADDED Viewed

	@@ -0,0 +1,175 @@

+{
+  "version": "1.0",
+  "truncation": null,
+  "padding": null,
+  "added_tokens": [],
+  "normalizer": {
+    "type": "Replace",
+    "pattern": {
+      "Regex": "[^$;:,.!?\u2014\u2026\"()\u201c\u201d \u0303\u02a3\u02a5\u02a6\u02a8\u1d5d\uab67AIOQSTWY\u1d4aabcdefhijklmnopqrstuvwxyz\u0251\u0250\u0252\u00e6\u03b2\u0254\u0255\u00e7\u0256\u00f0\u02a4\u0259\u025a\u025b\u025c\u025f\u0261\u0265\u0268\u026a\u029d\u026f\u0270\u014b\u0273\u0272\u0274\u00f8\u0278\u03b8\u0153\u0279\u027e\u027b\u0281\u027d\u0282\u0283\u0288\u02a7\u028a\u028b\u028c\u0263\u0264\u03c7\u028e\u0292\u0294\u02c8\u02cc\u02d0\u02b0\u02b2\u2193\u2192\u2197\u2198\u1d7b]"
+    },
+    "content": ""
+  },
+  "pre_tokenizer": {
+    "type": "Split",
+    "pattern": {
+      "Regex": ""
+    },
+    "behavior": "Isolated",
+    "invert": false
+  },
+  "post_processor": {
+    "type": "TemplateProcessing",
+    "single": [
+      {
+        "SpecialToken": {
+          "id": "$",
+          "type_id": 0
+        }
+      },
+      {
+        "Sequence": {
+          "id": "A",
+          "type_id": 0
+        }
+      },
+      {
+        "SpecialToken": {
+          "id": "$",
+          "type_id": 0
+        }
+      }
+    ],
+    "special_tokens": {
+      "$": {
+        "id": "$",
+        "ids": [
+          0
+        ],
+        "tokens": [
+          "$"
+        ]
+      }
+    }
+  },
+  "decoder": null,
+  "model": {
+    "vocab": {
+      "$": 0,
+      ";": 1,
+      ":": 2,
+      ",": 3,
+      ".": 4,
+      "!": 5,
+      "?": 6,
+      "\u2014": 9,
+      "\u2026": 10,
+      "\"": 11,
+      "(": 12,
+      ")": 13,
+      "\u201c": 14,
+      "\u201d": 15,
+      " ": 16,
+      "\u0303": 17,
+      "\u02a3": 18,
+      "\u02a5": 19,
+      "\u02a6": 20,
+      "\u02a8": 21,
+      "\u1d5d": 22,
+      "\uab67": 23,
+      "A": 24,
+      "I": 25,
+      "O": 31,
+      "Q": 33,
+      "S": 35,
+      "T": 36,
+      "W": 39,
+      "Y": 41,
+      "\u1d4a": 42,
+      "a": 43,
+      "b": 44,
+      "c": 45,
+      "d": 46,
+      "e": 47,
+      "f": 48,
+      "h": 50,
+      "i": 51,
+      "j": 52,
+      "k": 53,
+      "l": 54,
+      "m": 55,
+      "n": 56,
+      "o": 57,
+      "p": 58,
+      "q": 59,
+      "r": 60,
+      "s": 61,
+      "t": 62,
+      "u": 63,
+      "v": 64,
+      "w": 65,
+      "x": 66,
+      "y": 67,
+      "z": 68,
+      "\u0251": 69,
+      "\u0250": 70,
+      "\u0252": 71,
+      "\u00e6": 72,
+      "\u03b2": 75,
+      "\u0254": 76,
+      "\u0255": 77,
+      "\u00e7": 78,
+      "\u0256": 80,
+      "\u00f0": 81,
+      "\u02a4": 82,
+      "\u0259": 83,
+      "\u025a": 85,
+      "\u025b": 86,
+      "\u025c": 87,
+      "\u025f": 90,
+      "\u0261": 92,
+      "\u0265": 99,
+      "\u0268": 101,
+      "\u026a": 102,
+      "\u029d": 103,
+      "\u026f": 110,
+      "\u0270": 111,
+      "\u014b": 112,
+      "\u0273": 113,
+      "\u0272": 114,
+      "\u0274": 115,
+      "\u00f8": 116,
+      "\u0278": 118,
+      "\u03b8": 119,
+      "\u0153": 120,
+      "\u0279": 123,
+      "\u027e": 125,
+      "\u027b": 126,
+      "\u0281": 128,
+      "\u027d": 129,
+      "\u0282": 130,
+      "\u0283": 131,
+      "\u0288": 132,
+      "\u02a7": 133,
+      "\u028a": 135,
+      "\u028b": 136,
+      "\u028c": 138,
+      "\u0263": 139,
+      "\u0264": 140,
+      "\u03c7": 142,
+      "\u028e": 143,
+      "\u0292": 147,
+      "\u0294": 148,
+      "\u02c8": 156,
+      "\u02cc": 157,
+      "\u02d0": 158,
+      "\u02b0": 162,
+      "\u02b2": 164,
+      "\u2193": 169,
+      "\u2192": 171,
+      "\u2197": 172,
+      "\u2198": 173,
+      "\u1d7b": 177
+    }
+  }
+}

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "model_max_length": 512,
+  "pad_token": "$",
+  "tokenizer_class": "PreTrainedTokenizer",
+  "unk_token": "$"
+}

voices/af.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a4f11d9d055a12bfa0db2668a3e4f0ef8fd1f1ccca69494479718e44dbf9e41a
+size 524288

voices/af_alloy.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c4a6b876047fd7fb472edf4ebd63cfac7c3b958a7cae7c106e8f038ca6308c45
+size 522240

voices/af_aoede.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4a004c33430762e2461eedb2013fad808ef4ab3121f5300f554476caf58d8361
+size 522240

voices/af_bella.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f69d836209b78eb8c66e75e3cda491e26ea838a3674257e9d4e5703cbaf55c8b
+size 522240

voices/af_heart.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d583ccff3cdca2f7fae535cb998ac07e9fcb90f09737b9a41fa2734ec44a8f0b
+size 522240

voices/af_jessica.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a240a5e3c15b43563d6e923bdca8ef5613a23471d9b77653694012435df23bd8
+size 522240

voices/af_kore.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9be5221b6a941c04b561959b8ff0b06e809444dcc4ab7e75a7b23606f691819e
+size 522240

voices/af_nicole.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cd2191ab31b914ed7b318416b0e4440fdf392ddad9106a060819aa600a64f59a
+size 522240

voices/af_nova.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:18778272caa0d0eebaea251c35fd635f038434f9eee5e691d02a174bd328414f
+size 522240

voices/af_river.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:00a2bcf82b1d86e8f19902ede58c65ccf6c0e43b44b7d74fad54e5d8933c9c30
+size 522240

voices/af_sarah.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4409fbc125afabacc615d94db5398d847006a737b0247d6892b7a9a0007a2f0a
+size 522240

voices/af_sky.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4435255c9744f3f31659e0d714ab7689bf65d9e77ec1cce060f083912614f0b9
+size 522240

voices/am_adam.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:162b035ed91cfc48b6046982184c645f72edcdd1b82843347f605d7bf7b15716
+size 522240

voices/am_echo.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3968b92c3c4cd1c4416dbded36c13eaa388a90d5788d02a13e4d781f5f8cf3c3
+size 522240

voices/am_eric.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e8b5be17edd1e3636901ce7598baafe2dc8dd8ff707a0c23bf9e461add7e2832
+size 522240

voices/am_fenrir.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c27989f741f7ee34d273a39d8a595cc0837d35f5ced9a29b7cc162614616df43
+size 522240

voices/am_liam.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:52403be32fd047c6a44517cb0bcd6b134f2a18baa73e70ef41651e0eab921ade
+size 522240

voices/am_michael.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1d1f21dd8da39c30705cd4c75d039d265e9bc4a2a93ed09bc9e1b1225eb95ba1
+size 522240

voices/am_onyx.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:da5d135b424164916d75a68ffb4c2abce3d7d5ccc82dd1ee6cf447ce286145e6
+size 522240

voices/am_puck.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fcf73c989033e9233e0b98713eca600c8c74dcc1614b37009d5450ff4a2274a0
+size 522240

voices/am_santa.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:61150cf726ab6c5ed7a99f90a304f91f5a72c00c592e89ec94e5df11c319227a
+size 522240

voices/bf_alice.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:08afa6ba24da61ea5e8efa139e5aadc938d83f0a6da5a900adaf763ac1da5573
+size 522240

voices/bf_emma.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:669fe0647f9dd04fcab92f1439a40eeb4c8b4ab1f82e4996fe3d918ce4a63b73
+size 522240

voices/bf_isabella.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3754352c4aaa46d17f27654ab7518d65b62ad6163a0f55a5f4330c2da2c4e94f
+size 522240

voices/bf_lily.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e0ee32ebe64a467124976b14e69590746f1c4ce41a12b587a50c862edfea335
+size 522240

voices/bm_daniel.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6b3194bbceffb746733cbc22c8f593dd44e401a71d53895a2dca891bc595a1e8
+size 522240

voices/bm_fable.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f889083196807b4adb15e9204252165f503b8d33d3982e681c52443c49d798f1
+size 522240

voices/bm_george.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c4b235a4c1f2cd3b939fed08b899ce9385638b763f7b73a59616c4fc9bd6c9bc
+size 522240

voices/bm_lewis.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b8f671cef828c30e66fdf0b0756a76bba58f6bb3398cbbf27058642acbcedb97
+size 522240

voices/ef_dora.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f66ec66bd295acb18372e37008533a9a3228483ccd294e7538d5d9294ac9a532
+size 522240

voices/em_alex.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:27809e9eafdcbcfff90a3016c697568676531de2a2c39cee29c96c7bd6b83e95
+size 522240

voices/em_santa.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ad43b774e1ca24d05c6161297d8aeb770ac3d29bb95daf516727af5f7d543683
+size 522240

voices/ff_siwis.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a35f5675ad08948e326ae75fd0ea16ba5d0042e4f76b5f3d1df77d0a48c54861
+size 522240