vocal-separation / README.md
github-actions[bot]
Sync to HuggingFace Spaces
039e024
---
title: Vocal Separation SOTA
emoji: 🎤
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
---
# Vocal Separation SOTA
[![Open in Spaces](https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-lg-dark.svg)](https://huggingface.co/spaces/JacobLinCool/vocal-separation)
This is a demo for SOTA vocal separation models. Upload an audio file and the model will separate the vocals from the background music.
Based on the result of [MDX23](https://www.aicrowd.com/challenges/sound-demixing-challenge-2023/problems/music-demixing-track-mdx-23/leaderboards), the current SOTA model is [BS-RoFormer](https://arxiv.org/abs/2309.02612).
For comparison, you can also try the Mel-RoFormer model (a variant of BS-RoFormer) and the popular HTDemucs FT model.
## Models
- BS-RoFormer
- Mel-RoFormer
- HTDemucs FT
> The models are trained by the [UVR project](https://github.com/Anjok07/ultimatevocalremovergui).
> The code of this app is available on [GitHub](https://github.com/JacobLinCool/vocal-separation), any contributions should go there. Hugging Face Space is force pushed by GitHub Actions.