Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ base_model:
|
|
10 |
|
11 |
- Place the WAV and JSON files in `dev_data`.
|
12 |
|
13 |
-
To distinguish the recognition performance of each part, the file names for
|
14 |
|
15 |
Download the pre-trained Sentence-BERT model and tokenizer from the following URL
|
16 |
and Place the downloaded pre-trained model and tokenizer inside the `../../qwen2_audio_baseline/Bert_pretrain`
|
|
|
10 |
|
11 |
- Place the WAV and JSON files in `dev_data`.
|
12 |
|
13 |
+
To distinguish the recognition performance of each part, the file names of the training audio for Part One need to be prefixed with fold1-d-, those for Part Two need to be prefixed with fold1-a-, fold1-b-, fold1-c-, and those for Part Three need to be prefixed with fold1-e-. If the training audio file names for Part One and Part Three do not have the prefixes fold1-d- and fold1-e-, you will need to add them yourself. For example, if the file name of the training audio for Part One is 5402400A, then add the prefix to make it fold1-d-5402400A. If the file name of the audio for Part Three is audio_0001405, then add the prefix to make it fold1-e-audio_0001405. The process for the development set is the same, except that fold1 should be changed to fold2.
|
14 |
|
15 |
Download the pre-trained Sentence-BERT model and tokenizer from the following URL
|
16 |
and Place the downloaded pre-trained model and tokenizer inside the `../../qwen2_audio_baseline/Bert_pretrain`
|