F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Demo for OpenF5-TTS
Ultra-fast Whisper Turbo inference ⚡
Dia - 1.6B Text-to-Dialogue Model
A demo of OpenDalle V1.1 on a ZERO GPU.
Robust, duration-controllable voice-cloning TTS
Generate a visual waveform video from audio
Demo for StepFun's Step Audio TTS 3B mode
Did StyleTTS 2 generate that audio?!?
Unofficial demo for TB-OCR (OCR for documents)
Fast & efficient ASR outperforming Whisper!
Generate MIDI music using RWKV v4!
Experiment26 7B GPU Demo
Sync F5-TTS demo
Search GitHub discussions for Hugging Face repositories
Search for audiobooks by keywords
Fast, efficient, & multilingual text-to-speech
Display top leaderboards and arenas
Automatically add on-screen subs to your videos
Request a reboot for OpenDalle v1.1 GPU Demo