PC-DDSP-LoFiVocoder Model Family

Overview

Welcome to the official Hugging Face repository for the PC-DDSP-LoFiVocoder Model Family, a collection of vocoder models designed for DiffSinger voicebanks for use in OpenUTAU. This project provides two distinct model checkpoints, reflecting different stages of the training process, offering users flexibility in selecting the version that best suits their needs.

This repository was last updated on August 10, 2025

Both versions are available for download as ZIP files, including the necessary model weights, configuration files, and associated documentation.

Difference between Version A and B

Ver A: Based off the latest checkpoint trained up to August 10, 2025, use this one for a cleaner output.
Ver B: Based off an earlier checkpoint, use this one if you want a slightly more robotic-ish output

Ethical Considerations

This model is distributed under the CreativeML Open RAIL-M License, which promotes responsible AI use. Please adhere to the following:

Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see LICENSE.md for full restrictions).
Include attribution to the original resources and this repository in any derivative works or redistributions.

Attribution

When using or redistributing this vocoder in your voicebanks, please credit the author usamireko and include both the LICENSE.md and NOTICE.md files. A suggested citation is:

"PC-DDSP-LoFiVocoder by usamireko, trained using resources from Scarfmonster/HiFiPLN (MIT), VocalSet (CC-BY 4.0), Cantoría Dataset (CC-BY 4.0), and a private dataset by Spoopy☆Ace/SpoopyAce. Available at https://huggingface.co/usamireko/PC-DDSP-LoFiVocoder."

Training Resources

This model was developed using the following datasets and codebases:

Code: Based on Scarfmonster/HiFiPLN, licensed under the MIT License, a community vocoder framework for DiffSinger.
VocalSet Dataset: DOI: 10.5281/zenodo.1442513, licensed under Creative Commons Attribution 4.0 International (CC-BY 4.0), provided by Julia Wilkins et al. at Northwestern University.
Cantoría Dataset: DOI: 10.5281/zenodo.5878677, licensed under Creative Commons Attribution 4.0 International (CC-BY 4.0), provided by Helena Cuesta et al. at Universitat Pompeu Fabra.
Private Dataset: Supplied by Spoopy☆Ace/SpoopyAce with explicit permission.

For detailed licensing terms and acknowledgments, refer to the LICENSE.md and NOTICE.md files included in the ZIP archives.

License and Legal Notices

This model is released under the CreativeML Open RAIL-M License, which grants permissions for use, modification, and distribution while imposing use-based restrictions to ensure responsible AI practices. Key points include:

No warranties or guarantees are provided; use at your own risk.
Redistribution must include the license and notice files.
See LICENSE.md for the full terms and Attachment A for restricted uses.

The NOTICE.md file contains specific attributions to the training resources and contributors.

Contributing and Support

This is a community-supported project. For feedback, issues, or contributions:

Open an issue on this Hugging Face page.

Thank you for using PC-DDSP-LoFiVocoder!