AICoverGen
Run image generation application
Run image generation application
Generate sexual voice sounds from text
Generate voice from text with style
Generate and convert speech using text and audio inputs
Generate speech from text
Launch a web interface for text generation
Convert audio to different voices
Combine and process audio files
Vocal and background audio separator
Generate audio from text using a voice synthesis model
[中文/English/日本語]multilingual text-to-speech
Generate audio from text prompts
A simple, high-quality voice conversion tool
Clone voice to say text
Voice conversion framework based on VITS
Generate anime character voice from text
Launch a web-based user interface
Install dependencies and start an audio application
Download and prepare voice conversion models
Generate audio from text using voice synthesis
Generate anime character voice from text
Generate audio from text using selected speaker and language
Generate audio from text using VITS
Generate Japanese speech from text
Generate speech from text using various voice models
Generate MIDI music from prompts
Generate Japanese lyrics
Generate audio from text with a custom voice
Generate Japanese voice from text
Generate Japanese audio from text
Convert text to speech using multiple school voice models
Generate audio from text with ChatGPT integration
Generate realistic audio from text
Generate customized spoken audio from text and voice reference
Convert text to speech in multiple languages
Generate music from text and melody descriptions
Reconstruct and convert voice audio
Generate Talking avatars from Text-to-Speech
Convert audio voices using models
Generate sound effects for silent videos
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Execute dynamic code
Get a music sample inspired by the mood of an image
In-browser speech recognition w/ word-level timestamps
Vote on the latest TTS models!
Text-To-Speech (TTS) Evaluation using objective metrics.
Search and explore LAKH MIDI dataset with MidiCaps
Generate audio from text descriptions
Search and explore 179k+ MIDI titles
Transcribe audio with emotions and events
Transcribe audio to text with speaker diarization
Separate vocals from background in audio
Generate audiobook-style speech from text
Convert text to speech using band character voices
Generate audio from text with speaker selection and language translation
Convert and reconstruct speech files
Convert text to speech
Vote on the top Japanese TTS models!
Convert text to voice using a musical model
Genshin Impact Game Style Music Generator
Generate speech from text using various voices
easy training helper For RVC
Generate talking face video from image and audio
Convert and manipulate audio voices
Classify audio into NSFW categories
Generate voice with Style-Bert-VITS2
A demo of RVC pip
Clone voices from audio files
Generate speech from text in multiple languages
Chat with a bot using text and audio
Harmonize and mix any MIDI melody
Create a spectrogram and get audio info
Generate and modify audio with models
Separe vocal and instrumental tracks from audio
Generate speech from text with reference audio
Generate audio from text or PDF
Remove vocals from your music tracks easily
High-fidelity Text-To-Speech
Generate Japanese audio from text
Generate audio from text for anime characters
Generate Animalese audio from text
Convert text to Animalese using sound models
Generate audio from text prompts
Generate audio from text with voice customization
Transcribe and summarize YouTube videos or audio files
Launch a web interface for downloading and managing YouTube videos
Convert text to animal-like speech
Start web UI for image generation
Convert audio and images to different formats
Convert and train voice models
An easy-to-use voice conversion framework based on VITS.
Clone voices by typing text and providing a reference audio file
Generate speech and translate audio using AI models
Transform a report or document into an interview/discussion
Super fastest Voice Assistant
Convert text to speech
Generate music using descriptions and optional melody audio
Generate audio with voice conversion
Transform and render any MIDI
Generate POP music medley with Imagen diffusion transformer
Classify absolutely any MIDI by genre, song and artist
Intelligently compare any pair of MIDIs
Browse and download ChatTTS speaker embeddings
Generate a seamless bridge between two composition parts
Generate speech from text
Generate speech from text with customizable parameters
Fixed fork of the original audio sr!
Convert voice to match another using reference audio
Generate audio responses from uploaded or recorded audio
Retrieval augmented harmonization of any MIDI melody
Add a unique melody to any MIDI file
Mix chords from one MIDI to another MIDI
Convert Morse code to audio
Convert and modify voices in audio files
Get Lyrics from Genius's Link
Groq API Playground
Generate speech quality score from audio
LMSYS bench for audio agents
Combine audio with a video or image to create a lip-synched video
Create lifelike animated videos using a photo and audio
Create a video by syncing spoken audio to an image
Description of Matcha TTS Japanese
Generate clean audio from noisy recordings
High-fidelity Text-To-Speech
Generate and edit audio from text prompts
Transcribe audio to text with timestamps
Benchmark load model and tts time
Give your space a voice! (Demo)
Answer questions about audio
Analyze audio and answer questions about it
Generate high-quality music from text descriptions
Generate a 2-speaker podcast from text input or documents!
Enjoy TTS Chat
Create interactive spoken dialogue using audio input
Generate audio with text and reference audio
Controlled source augmented rock music transformer
Long-form Musicgen
Convert text to speech in multiple languages
Generate realistic-sounding AI voice from text
Generate菅義偉-like speech from text
Generate lip-synced talking head video from audio
Generate detailed script for podcast or lecture from text input
Request evaluation results for a speech model
Personalised Podcasts For All - Available in 13 Languages
Transcribe and translate Japanese & English audio
Fast, efficient, & multilingual text-to-speech
Transcribe and translate audio into text
Generate audio from text
Transcribe or translate audio and YouTube videos
Realtime implementation of Whisper large turbo
ML-powered speech recognition directly in your browser
ExpressivText-to-Speech
Generate speech from text with accentuation
Download video or audio from URL
unlimited Audio generation with a few added features
Restore degraded audio using a Transformer-based model
whisper3 turbo
Generate audio from text using selected character voices
Generate audio from text using a customizable voice model
Transform audio with pre-trained models and customize settings
Transcribe audio to text with style options
Separate vocals and instruments from audio
Enhance and clean audio files
Transcribe audio to text
Convert audio voices to match a chosen model
Generate a podcast from text, URLs, PDFs, and images
Generate and apply matching music background to video shot
Generates a sound effect that matches video shot
Generates audio environment from an image
Clone voices for custom TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Blind vote on HF TTS models!
CPU powered, low RTF, emotional, multilingual TTS
Generate music powered by AI
Co-Speech Gesture Video Generation
Transcribe Japanese audio to text
Generate text from audio recordings
Whisper model to transcript japanese audio to katakana.
Better AI powered platform to purify your speech signal
Text | Image | Audio | Video to Spectrogram || Steganography
Generate images using various models
Separate audio tracks from music files
Convert spoken words to text and voice assistant responses
Transcribe and diarize your audio recordings
Stable audio open model from Synthio paper.
Fast & efficient ASR outperforming Whisper!
Generate a video from audio with customizable waveform
Generate MIDI music using RWKV v4!
Whisper Transcribe MP3 files, use a GPU to convert faster!
Efficient, fast, and natural text to speech with StyleTTS 2!
MaskGCT TTS Demo
Generate music from text and melody
Transcribe audio or YouTube videos
Self-correcting multi-instrumental chords transformer
Chords-conditioned music transformer
Ultra-fast Whisper Turbo inference ⚡
Generate a video waveform from text-based audio descriptions
In-Browser Audio Wake-Word Spotting
Streamlit pianoroll playback element
Audio-Separator by Politrees
Fast multi-instrumental music transformer
Streamlit browser for piano music datasets.
Demo of masking tasks from the PIANO dataset
An end-to-end (e2e) Voice Language Model by Fish Audio.
Separate audio stems and convert to MIDI
Generate podcasts with AI avatars
Create personalized voice clips with情感
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate MIDI music sequences
Did StyleTTS 2 generate that audio?!?
base model for mono-channel completion
Create and clone voice clones for text-to-speech conversion
Lunch web-based text-to-speech interface
Upgraded to v1.0!
Convert text to speech in multiple languages
Generate text from audio input
MaskGCT TTS Demo
Generate audio from text
Generate Voice Clones
Spanish finetune for the original F5 model.
Generate speech from text
Convert audio to lip-sync data
Generate speech from text in multiple languages
シャルティアのAI音声合成モデルを作りました。
早乙女乱馬(女)のAI音声合成モデルを作りました。
ベアトリスのAI音声合成モデルを作りました。
Talk to Fixie.ai's Ultravox with WebRTC ⚡️
Estimate physical properties merely from pouring sound!
Create interactive HTML web pages with your voice
Generate videos by adding speech to images or videos
Record an audio, then use AI to transcribe and translate it.
Large and fast music transformer for pitches inpainting
Generate speech from text using selected language and speaker
Generate audio from text with voice control
Generate music based on text and melody
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a
Versatile audio super resolution (any -> 48kHz) with AudioSR
Generate human-like speech from text
TTS tool
short_description: 猫屋敷まゆのAI音声合成モデルを作りました。
Search for similar game voice samples
A demo of Indic Parler-TTS
Transform text to speech and speech to text
Verify speakers using voice samples
Target Speaker Extraction with WeSep
Transcribe speech into text
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
A home for scoring speech quality
Non official benchmark by Fish Speech
Generate chupa sounds from text or audio
Generate Japanese speech from text
Detect emotions from an audio file
Generate audio from video or text prompts
Clone a voice with text input
SText to Audio(Sound SFX) Generator
Talk to Kyutai's moshi - powered by Gradio WebRTC!
Generate high-quality speech from text using a prompt audio
Talk to the Gradio docs! Powered by Pydantic and WebRTC ⚡️
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
Generate music from text prompts
generated sound from video/text and search
Classify audio sounds and voices
Real-time in-browser speech recognition
Talk with openAI's new Realtime Voice API
Separate noisy audio into clean speaker tracks
Extract sounds from audio using text prompts
Generate edited English speech from audio and text
Music Genre Classifier
Guzheng Performance Technique Recognizer
Chinese Traditional Instrument Sound Retriever
Chinese Music Pentatonic Mode Detector
Manipulate audio properties like speed, volume, and format
Video to Audio
Transcribe audio to text from URLs or uploads
Make your audio to 8D
Audio-Separator Demo
Yet another Real-time Whisper with WebGPU, written in Vue
Identify any MIDI
Yet another Real-time in-browser STT, re-implemented in Vue
アイリ VTuber. LLM powered Live2D/VRM living character.
figured bass calculator
Added improvements, 1107+ languages supported
V1.0Convert any Ebook to AudioBook with Xtts + VoiceCloning!
Converts Ebooks into audiobooks with piper-tts
First ebook2audiobook Dockerfile test
Audio Visualization Circle Effect Tool
Ready-to-play synth instrument!
Genshin Impact & Honkai Star Rail Game Character Voice TTS
Erhu Performance Technique Recognizer
Discriminator of Bel Canto and Chinese Folk Singing
Piano Sound Quality Classifier
Discriminator of Chest Vocie and Falsetto
Generate realistic voice audio from text and audio prompts
Ultra-fast and very well fitted solo Piano music transformer
ヘスティアのAI音声合成モデルを作りました。
フレイヤのAI音声合成モデルを作りました。
Hands-Free AI Voice Chat with a Retro Vibe
Hands-Free AI Voice Chat with a Retro Vibe
Hands-Free AI Voice Chat with a Retro Vibe
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
Separate music and vocals from audio
A benchmark for open-source multi-dialect Arabic ASR models
Generate music from text prompts
High-fidelity Text-To-Speech
Audio Conditioned LipSync with Latent Diffusion Models
Transform your voice into a singer's
Generate speech from text with different speakers
Deepfake Detection
Audio edit
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Accessibility PDF & pasted text to speech converter w/ gTTs
Communicate with an AI assistant and convert text to speech
Convertir texto a audio
Audio Visualizer
2
Text to Audio (Sound SFX) Generator
Generate audio from text or modify voice pitch
Search and play Karaoke MIDI by title, lyrics, or summary
Search music using keywords
G2P
Better AI powered platform to purify your speech signal
結束いのりのAI音声合成モデルを作りました。
ドラクエ3の女勇者のAI音声合成モデルを作りました。
喜屋武飛夏のAI音声合成モデルを作りました。
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
Generate speech from text with customizable voices
Korean Speech Transcribe(Text) and English Translate(Korean)
Demo for Jasco Model Music Stems Generation
High-quality speech synthesis powered by Kokoro TTS
Transcribe and summarise audio files using AI.
NetEase Cloud Music MP3 Direct URL Parser
GPT-SoVITS for MITA!
Guided melody accompaniment generation with transformers
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Interact with a chatbot using text and audio
A humble space for trying EGTTS V0.1
Generate music from lyrics and genre tags
work in progress
Make Custom Voices With KokoroTTS
Mix random MIDI loops into one coherent music composition
Convert text to speech online
Convert spoken words to text
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Generate soundfonts with latent flow matching
beepbox
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Audio Gen, Audio Style Transfer and Audio InPainting
Talk to Fixie.ai's Ultravox with WebRTC ⚡️