yangwang825 commited on
Commit
8df1ce2
·
verified ·
1 Parent(s): 784ad1e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -21
README.md CHANGED
@@ -19,43 +19,43 @@ Audio classification:
19
 
20
  | Dataset | Split Method | Classes | Task | # Clips | Average Duration | Sampling Rate |
21
  | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
22
- | WMMS | TT | 32 | Multi-class | 1697 | 10.42 | 16000 |
23
- | MSWC (English) | TVT | 271 | Multi-class | 33726 | 0.99 | 16000 |
24
- | MSWC (Spanish) | TVT | 146 | Multi-class | 11759 | 0.99 | 16000 |
25
- | MSWC (Indian) | TVT | 14 | Multi-class | 739 | 0.99 | 16000 |
26
  | ESC50 | 5-fold | 50 | Multi-class | 2000 | 5.00 | 44100 |
27
  | UrbanSound8K | | 10 | Multi-class | | | |
28
  | AudioSet | | 527 | Multi-label | | | |
29
  | MagnaTagATune | | | Multi-label | | | |
30
  | Medley-solos-DB | | 8 | Multi-class | | | 44100 |
31
- | Pianos | TVT | 8 | Multi-class | 668 | 4.86 | 16000 |
32
- | FSD-Kaggle-2019 (curated) | TT | 80 | Multi-label | 9451 | 8.93 | 44100 |
33
- | GTZAN | TVT | 10 | Multi-class | 930 | 30.02 | 22050 |
34
- | Nsynth (instrument) | TVT | 11 | Multi-class | 305979 | 4.00 | 16000 |
35
- | Nsynth (pitch) | TVT | 112 | Multi-class | 305979 | 4.00 | 16000 |
36
- | CREMA-D | TVT | 6 | Multi-class | 7442 | 2.54 | 16000 |
37
  | IEMOCAP | 5-fold | 4 | Multi-class | 5531 | 4.52 | 16000 |
38
- | EmoDB | TT | 7 | Multi-class | 535 | 2.77 | 16000 |
39
  | EMOVO | 6-fold | 7 | Multi-class | 588 | 3.12 | 48000 |
40
- | IRMAS | TT | 11 | Multi-label | 9579 | 7.16 | 44100 |
41
  | RAVDESS | 5-fold | 8 | Multi-class | 2880 | 3.70 | 48000 |
42
- | TIMIT | TVT | 630 | Multi-class | 6300 | 3.07 | 16000 |
43
- | LibriSpeech | TT | 2484 | Multi-class | 21933 | 3.75 | 16000 |
44
 
45
  Automated audio captioning:
46
 
47
  | Dataset | Split Method | # Clips | Average Duration | Sampling Rate |
48
  | :---: | :---: | :---: | :---: | :---: |
49
- | Music4All | T | 109269 | 29.99 | 48000 |
50
- | Clotho (v1.0) | TT | 3938 | 22.43 | 44100 |
51
 
52
  Music, speech, and noise:
53
 
54
- | Dataset | # Clips | Average Duration | Sampling Rate |
55
- | :---: | :---: | :---: | :---: |
56
- | MUSAN | 2016 | 195.16 | 16000 |
57
- | RIR-Noise | 61260 | 1.54 | 16000 |
58
- | ARCA23K | | | |
59
 
60
  ## Contact Us
61
 
 
19
 
20
  | Dataset | Split Method | Classes | Task | # Clips | Average Duration | Sampling Rate |
21
  | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
22
+ | WMMS | train/test | 32 | Multi-class | 1697 | 10.42 | 16000 |
23
+ | MSWC (English) | train/validation/test | 271 | Multi-class | 33726 | 0.99 | 16000 |
24
+ | MSWC (Spanish) | train/validation/test | 146 | Multi-class | 11759 | 0.99 | 16000 |
25
+ | MSWC (Indian) | train/validation/test | 14 | Multi-class | 739 | 0.99 | 16000 |
26
  | ESC50 | 5-fold | 50 | Multi-class | 2000 | 5.00 | 44100 |
27
  | UrbanSound8K | | 10 | Multi-class | | | |
28
  | AudioSet | | 527 | Multi-label | | | |
29
  | MagnaTagATune | | | Multi-label | | | |
30
  | Medley-solos-DB | | 8 | Multi-class | | | 44100 |
31
+ | Pianos | train/validation/test | 8 | Multi-class | 668 | 4.86 | 16000 |
32
+ | FSD-Kaggle-2019 (curated) | train/test | 80 | Multi-label | 9451 | 8.93 | 44100 |
33
+ | GTZAN | train/validation/test | 10 | Multi-class | 930 | 30.02 | 22050 |
34
+ | Nsynth (instrument) | train/validation/test | 11 | Multi-class | 305979 | 4.00 | 16000 |
35
+ | Nsynth (pitch) | train/validation/test | 112 | Multi-class | 305979 | 4.00 | 16000 |
36
+ | CREMA-D | train/validation/test | 6 | Multi-class | 7442 | 2.54 | 16000 |
37
  | IEMOCAP | 5-fold | 4 | Multi-class | 5531 | 4.52 | 16000 |
38
+ | EmoDB | train/test | 7 | Multi-class | 535 | 2.77 | 16000 |
39
  | EMOVO | 6-fold | 7 | Multi-class | 588 | 3.12 | 48000 |
40
+ | IRMAS | train/test | 11 | Multi-label | 9579 | 7.16 | 44100 |
41
  | RAVDESS | 5-fold | 8 | Multi-class | 2880 | 3.70 | 48000 |
42
+ | TIMIT | train/validation/test | 630 | Multi-class | 6300 | 3.07 | 16000 |
43
+ | LibriSpeech | train/test | 2484 | Multi-class | 21933 | 3.75 | 16000 |
44
 
45
  Automated audio captioning:
46
 
47
  | Dataset | Split Method | # Clips | Average Duration | Sampling Rate |
48
  | :---: | :---: | :---: | :---: | :---: |
49
+ | Music4All | train | 109269 | 29.99 | 48000 |
50
+ | Clotho (v1.0) | train/test | 3938 | 22.43 | 44100 |
51
 
52
  Music, speech, and noise:
53
 
54
+ | Dataset | Split Method | # Clips | Average Duration | Sampling Rate |
55
+ | :---: | :---: | :---: | :---: | :---: |
56
+ | MUSAN | train | 2016 | 195.16 | 16000 |
57
+ | RIR-Noise | train | 61260 | 1.54 | 16000 |
58
+ | ARCA23K | | | | |
59
 
60
  ## Contact Us
61