Audio Spaces
- Runtime error70π
- Running on T4947π
Seamless M4T
- Running on A10G4.5kπ΅
MusicGen
- Running on A10G774π
Audioldm Text To Audio Generation
- Sleeping284π
AudioLDM2 Text2Audio Text2Music Generation
- Runtime error220π
AudioSep
- Running154π΅π΅π΅
Lp Music Caps
- Running on T4238π’
Tortoise Tts
ExpressivText-to-Speech
- Sleeping13π
All In One
- Running on T42.06kπΈ
XTTS
- Paused188πΈπΆ
Coqui Bark Voice Cloning
- Running on A10G343π
VALL E X
- Sleeping189π₯
WavJourney
- Paused266πΆπ
Music To Image
- Running on A10G263π
MMS
- Running536π£οΈ
ElevenLabs TTS
- Build error287π
AudioGPT
- Running on T42.04kπΆ
Bark
- Runtime error36π©βπ€
SpeechT5 Speech Recognition Demo
- Runtime error172πΈ
CoquiTTS (Official)
- Running on L41.82kπ
Whisper
- Running on CPU Upgrade600πποΈ
Moe TTS
- Build error17π₯
YourTTS
- Running538π
Talking Face Generation with Multilingual TTS
- Runtime error563π
OpenAI TTS New
- Sleeping162π’
Mustango
- Sleeping55π
OWSM Demo
- Running on T4582π£οΈ
StyleTTS 2
Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on T4357β‘
HierSpeech++ (Zero-shot TTS)
- Sleeping18π
Video2music
- Running on T4185π€«
Whisper Large V2
- Running on T456π
Musicgen Prompt Upsampling
- Running on A10G47π€
Qwen-Audio
- Runtime error514π
Seamless M4T v2
- Running on T4239π
Seamless Streaming
- Runtime error47π΅
Matcha TTS
- Running on Zero235π₯
MusicGen Streaming
- Running on T4281π
Resemble Enhance
- Running on A10G228πΌ
Singing Voice Conversion
- Sleeping50π§
NaturalSpeech2
- Paused21π₯
Create Your Own TTS Dataset
- Sleepingπ’
Podcast Transcription
- Running977π€
OpenVoice
- Runtime error94π»
M2UGen Demo
- Runtime error70π
Pheme
- Sleeping5π
ESPnet2 TTS
- Running13π
Whisper-WebUI
- Paused172π
Image2SFX Comparison
Generates audio environment from an image
- Running on T4380π¬οΈπ¬π
WhisperSpeech
- Build error146π£οΈ
MetaVoice 1B
A demo of MetaVoice 1B, a new TTS model by MetaVoice.
- Running on CPU Upgrade495π
TTS Arena
Vote on the latest TTS models!
- Running167π½
Whisper Speech X DreamTalk
Combine voice cloning and portrait lipsync animation
- Sleeping173π€
Canary 1b
- Paused75β‘
SALMONN Audio Questioning
Deeply interrogate audio file content
- Running on T4386π£οΈ
MeloTTS
Fast, efficient, & multilingual text-to-speech
- Running on Zero264π§
Audio Editing
Edit audios with text prompts
- Runtime error18π»
ChatMusician
- Running on CPU Upgrade61π§ββοΈπ§ββοΈπ§ββοΈ
xVASynth TTS
CPU powered, low RTF, emotional, multilingual TTS
- Running on Zero164π
NaturalSpeech3 FACodec
- Sleeping22βοΈ
Hey Gemma
- Configuration error68π£οΈποΈ
Ratchet + Whisper
- Paused3π
AutoSubs
Automatically add on-screen subs to your videos
- Build error162π
VoiceCraft
- Running on Zero119π
Tango2
Fast Text to Audio Generator
- Running on Zero735π₯
Parler-TTS
High-fidelity Text-To-Speech
- Running on A10G179π₯
Sing an idea β‘οΈ Music
Bring song ideas to life
- Running on Zero55π
Musicgen Songstarter Demo
- Paused91π
Whisper JAX
- Running on Zero16π’
AudioLCM
- Running on Zero155π»
Stable Audio Live Multiplayer
- Running on Zero346π₯
Stable Audio Open Zero
- Running on Zero12π
Make An Audio 3
- Sleeping60π
Mars5 Space
- Runtime error5π΅
Tango Music AF
Text to Music Generator
- Runtime error6π₯
Tango AF
Text to Audio Generator
- Running90π
BigVGAN
- Running on Zero64π
SenseVoice
- Running on Zero53π
CosyVoice 300M
- Running on Zero22π
PicoAudio
- Sleeping29πͺ©
MusiConGen
- Running14π
Mms Zeroshot
- Running138π
Qwen2 Audio Instruct Demo
- Running on Zero72π€
GPT SoVITS V2
- Running on Zero245π£
EzAudio
- Running on Zero205πΆ
OpenMusic
- Running on Zero439πΌπΆ
Midi Music Generator
- Running on Zero585π€―
Whisper Turbo
- Running on Zero254π€―
Realtime Whisper Turbo
Realtime implementation of Whisper large turbo
- Running116π
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Running on Zero4π
Text2midi
- Running on A10G287π
Fish Speech 1
- Running90π€π
TTS Spaces Arena
Vote on the top HF TTS models!
- Running on Zero15π£οΈ
Diva Realtime Chat
- Running on Zero1.17kπ£οΈ
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running on Zero215π»
MaskGCT TTS Demo
MaskGCT TTS Demo
- Running on L40S91π¬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.