| license: apache-2.0 | |
| library_name: kittentts | |
| # Kitten TTS π» | |
| Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis. | |
| *Currently in developer preview* | |
| ## β¨ Features | |
| - **Ultra-lightweight**: Model size less than 25MB | |
| - **CPU-optimized**: Runs without GPU on any device | |
| - **High-quality voices**: Several premium voice options available | |
| - **Fast inference**: Optimized for real-time speech synthesis | |
| ## π Quick Start | |
| ### Installation | |
| ``` | |
| pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl | |
| ``` | |
| ### Basic Usage | |
| ``` | |
| from kittentts import KittenTTS | |
| m = KittenTTS("KittenML/kitten-tts-nano-0.1") | |
| audio = m.generate("This high quality TTS model works without a GPU") | |
| # Save the audio | |
| import soundfile as sf | |
| sf.write('output.wav', audio, 24000) | |
| ``` | |
| ## π» System Requirements | |
| Works literally everywhere | |
| ## Checklist | |
| - [x] Release a preview model | |
| - [ ] Release the fully trained model weights | |
| - [ ] Release mobile SDK | |
| - [ ] Release web version |