Audio Classification
Chinese
music

Intro

For the 99 recordings, silence is first removed, which is done based on the annotation, targeting the parts where there is no technique annotation. Then all recordings are uniformly segmented into fixed-length segments of 3 seconds. After segmentation, clips shorter than 3 seconds are zero padded. This padding approach, unlike circular padding, is adopted specifically for frame-level detection tasks to prevent the introduction of extraneous information. Regarding the dataset split, since the dataset consists of 99 recordings, we split it at the recording level. The data is partitioned into training, validation, and testing subsets in a 79:10:10 ratio, roughly 8:1:1.

Demo (inference code)

https://huggingface.co/spaces/ccmusic-database/Guzheng_Tech99

Usage

from huggingface_hub import snapshot_download
model_dir = snapshot_download("ccmusic-database/Guzheng_Tech99")

Maintenance

GIT_LFS_SKIP_SMUDGE=1 git clone [email protected]:ccmusic-database/Guzheng_Tech99
cd Guzheng_Tech99

Results

Backbone Mel CQT Chroma
ViT-B-16 0.705 0.518 0.508
Swin-T 0.849 0.783 0.766
VGG19 0.862 0.799 0.665
EfficientNet-V2-L 0.783 0.812 0.697
ConvNeXt-B 0.849 0.849 0.805
ResNet101 0.638 0.830 0.707
SqueezeNet1.1 0.831 0.814 0.780
Average 0.788 0.772 0.704

Dataset

https://huggingface.co/datasets/ccmusic-database/Guzheng_Tech99

Mirror

https://www.modelscope.cn/models/ccmusic-database/Guzheng_Tech99

Evaluation

https://github.com/monetjoe/ccmusic_eval/tree/tech99

Cite

@article{Zhou-2025,
  author  = {Monan Zhou and Shenyang Xu and Zhaorui Liu and Zhaowen Wang and Feng Yu and Wei Li and Baoqiang Han},
  title   = {CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research},
  journal = {Transactions of the International Society for Music Information Retrieval},
  volume  = {8},
  number  = {1},
  pages   = {22--38},
  month   = {Mar},
  year    = {2025},
  url     = {https://doi.org/10.5334/tismir.194},
  doi     = {10.5334/tismir.194}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train ccmusic-database/Guzheng_Tech99

Space using ccmusic-database/Guzheng_Tech99 1

Collection including ccmusic-database/Guzheng_Tech99