Audio Spaces
- 71📈
- 951
Seamless M4T
📞 - 4.95k
MusicGen
🎵Generate music from text descriptions
- 810
Audioldm Text To Audio Generation
🔊Generate audio from text descriptions
- 305
AudioLDM2 Text2Audio Text2Music Generation
🔊Generate audio and waveform video from text
- 221
AudioSep
🐠 - 165
Lp Music Caps
🎵Create music captions from audio files
- 301
Tortoise Tts
🐢ExpressivText-to-Speech
- 22
All In One
📊 - 2.59k
XTTS
🐸Generate realistic voice synthesis using text and reference audio
- 189
Coqui Bark Voice Cloning
🐸 - 359
VALL E X
🎙Generate audio from text using voice prompts
- 192
WavJourney
🔥 - 264
Music To Image
🎶 - 278
MMS
🌍Transform and identify speech with MMS
- 586
ElevenLabs TTS
🗣Generate realistic voices from text
- 288
AudioGPT
🚀 - 2.3k
Bark
🐶Generate realistic audio from text
- 36
SpeechT5 Speech Recognition Demo
👩 - 173
CoquiTTS (Official)
🐸 - 2.24k
Whisper
📉Transcribe audio from microphone, files, or YouTube
- 635
Moe TTS
😊Generate and convert speech using text and audio inputs
- 17
YourTTS
🔥 - 553
Talking Face Generation with Multilingual TTS
👄Generate a talking face video from text
- 562
OpenAI TTS New
📊 - 167
Mustango
🐢 - 55
OWSM Demo
🔊 - 650
StyleTTS 2
🗣Efficient, fast, and natural text to speech with StyleTTS 2!
- 393
HierSpeech++ (Zero-shot TTS)
⚡Generate high-quality speech from text using a prompt audio
- 21
Video2music
📚Generate music for a video based on its content and key
- 187
Whisper Large V2
🤫 - 64
Musicgen Prompt Upsampling
🌖Generate music from text prompts 🎶
- 67
Qwen-Audio
🎤Interact with a chatbot using text and audio
- 515
Seamless M4T v2
📞 - 296
Seamless Streaming
📞Translate text into different languages
- 51
Matcha TTS
🍵Generate speech from text input
- 270
MusicGen Streaming
🔥Generate music from text prompts
- 352
Resemble Enhance
🚀Enhance and clean audio files
- 259
Singing Voice Conversion
🎼Transform your voice into a singer's
- 50
NaturalSpeech2
🎧 - 21
Create Your Own TTS Dataset
🔥 Podcast Transcription
🐢- 1.06k
OpenVoice
🤗 - 95
M2UGen Demo
💻 - 69
Pheme
📊 - 6
ESPnet2 TTS
📈Generate speech from text in multiple languages
- 22
Whisper-WebUI
🚀Generate subtitles and translate them
- 170
Image2SFX Comparison
👂Generates audio environment from an image
- 380
WhisperSpeech
🌬 - 146
MetaVoice 1B
🗣A demo of MetaVoice 1B, a new TTS model by MetaVoice.
- 750
TTS Arena V2
🏆Vote on the latest TTS models!
- 172
Whisper Speech X DreamTalk
😽Combine voice cloning and portrait lipsync animation
- 198
Canary 1b
🐤Transcribe and translate audio into text
- 456
MeloTTS
🗣Fast, efficient, & multilingual text-to-speech
- 291
Audio Editing
🎧Edit audios with text prompts
- 18
ChatMusician
💻 - 70
xVASynth TTS
🧝CPU powered, low RTF, emotional, multilingual TTS
- 179
NaturalSpeech3 FACodec
🏃Convert and reconstruct speech files
- 25
Hey Gemma
☎ - 70
Ratchet + Whisper
🗣 - 3
AutoSubs
📜Automatically add on-screen subs to your videos
- 161
VoiceCraft
📈 - 307
TangoFlux
🚀Text to Audio (Sound SFX) Generator
- 824
Parler-TTS
🥖High-fidelity Text-To-Speech
- 184
Sing an idea ➡️ Music
🔥Bring song ideas to life
- 74
Musicgen Songstarter Demo
👁Generate music using descriptions and optional melody audio
- 145
Whisper JAX
👀Transcribe or translate audio from microphone, file, or YouTube
- 21
AudioLCM
🏢Generate audio from text
- 159
Stable Audio Live Multiplayer
💻Generate audio from text prompts
- 428
Stable Audio Open Zero
🔥Generate audio from text prompts
- 13
Make An Audio 3
🐠Generate audio from text
- 60
Mars5 Space
📉 - 5
Tango Music AF
🎵Text to Music Generator
- 100
BigVGAN
🔊Generate high-fidelity audio from input audio waveforms
- 90
SenseVoice
🐠Transcribe audio with emotions and events
- 59
CosyVoice 300M
📉 - 26
PicoAudio
📈Generate audio from text descriptions with timestamps
- 6
Audio Flamingo Demo
📚 - 29
MusiConGen
🪩 - 17
Mms Zeroshot
🌍Generate transcript from audio input
- 189
Qwen2 Audio Instruct Demo
🌍Interact with a multimodal chatbot using text and audio
- 147
GPT SoVITS V2
🤗Generate voice from text using reference audio
- 263
EzAudio
🟣Generate and edit audio from text prompts
- 216
OpenMusic
🎶Generate high-quality music from text descriptions
- 515
Midi Music Generator
🎼Generate MIDI music from prompts
- 902
Whisper Turbo
🤯Transcribe audio or YouTube videos to text
- 312
Realtime Whisper Turbo
🤯Realtime implementation of Whisper large turbo
- 157
Whisper Large V3 Turbo WebGPU
🚀ML-powered speech recognition directly in your browser
- 503
Fish Speech 1
🏆Generate audio from text with voice customization
- 367
TTS Spaces Arena
🤗Blind vote on HF TTS models!
- 18
Diva Realtime Chat
🗣Convert spoken words to text and voice assistant responses
- 2.33k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- 257
MaskGCT TTS Demo
😻MaskGCT TTS Demo
- 91
MelodyFlow
🎵Generate music from text and melody
- 143
Fish Agent
💬An end-to-end (e2e) Voice Language Model by Fish Audio.
- 65
Nexa Omni Demo
🎧Generate text from audio input
- 208
CosyVoice2-0.5B
🥳Generate realistic voice audio from text and audio prompts
- 2.61k
Kokoro TTS
❤Upgraded to v1.0!
- 108
Make Custom Voices With KokoroTTS
⚡Make Custom Voices With KokoroTTS
- 299
Llasa 3b Tts
🔥Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- 12
Llasa 1b Multilingual TTS
🌍Generate speech from text with or without cloning a voice
- 314
Kokoro Text-to-Speech (WebGPU)
🗣High-quality speech synthesis powered by Kokoro TTS
- 40
Hibiki Simple
👄High-Fidelity Simultaneous Speech-To-Speech Translation
- 377
Zonos
🌍Generate high-quality audio from text using various controls
- 66
Kokoro Web
🗣ML-powered speech synthesis directly in your browser
- 585
Di♪♪Rhythm
🎶Blazingly Fast and Embarrassingly Simple Song Generation
- 20
Audiobox Aesthetics
📚Demo for audiobox-aesthetics
- 230
Spark TTS
🌖A text-to-speech model powered by SparkAudio and Mobvoi.
- 791
Sesame CSM
🌱Conversational speech generation
- 198
Orpheus TTS
🚀Try Orpheus TTS here
- 31
Canary 1B Flash
🐤Canary 1B Flash demo
- 68
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙Generate customized voice音频 from text
- 5
AudioMorphix
🌊Setup and run a Gradio app
- 73
MegaTTS3 Demo
👋 - 113
AudioX
👀Generate audio and video from text prompts
- 86
Vevo for Zero-shot VC, TTS, and More
🐠Controllable Zero-Shot Voice Imitation
- 1.37k
Dia 1.6B
👯Generate realistic dialogue from a script, using Dia!
- 39
Aero 1 Audio Demo
💬Demo for Aero-1-Audio
- 35
Voila Demo
💻Chat with a voice-clone AI
- 399
ACE Step
😻A Step Towards Music Generation Foundation Model
- 1
Audio Difficulty Estimator
🎹Estimate piano difficulty from audio