Smoliakov PRO

Yehor

AI & ML interests

Speech-to-Text, Text-to-Speech, Voice over Internet Protocol

Recent Activity

liked a model about 5 hours ago
BlinkDL/rwkv7-g1
liked a model 3 days ago
Ihor/Text2Graph-R1-Qwen2.5-0.5b
liked a model 3 days ago
pipecat-ai/smart-turn
View all activity

Organizations

Speech-UK initiative's profile picture MedVoice's profile picture Call Recognition System's profile picture

Posts 2

view post
Post
2779
Published a stable version of Ukrainian Text-to-Speech library on GitHub and PyPI.

Features:

- Multi-speaker model: 2 female (Tetiana, Lada) + 1 male (Mykyta) voices;
- Fine-grained control over speech parameters, including duration, fundamental frequency (F0), and energy;
- High-fidelity speech generation using the RAD-TTS++ acoustic model;
- Fast vocoding using Vocos;
- Synthesizes long sentences effectively;
- Supports a sampling rate of 44.1 kHz;
- Tested on Linux environments and Windows/WSL;
- Python API (requires Python 3.9 or later);
- CUDA-enabled for GPU acceleration.

Repository: https://github.com/egorsmkv/tts_uk