gary109
/

ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53-v1

Automatic Speech Recognition

gary109/AI_Light_Dance

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53-v1 / README.md

gary109's picture

update model card README.md

e9913fc about 3 years ago

|

history blame contribute delete

4.02 kB

	---
	license: apache-2.0
	tags:
	- automatic-speech-recognition
	- gary109/AI_Light_Dance
	- generated_from_trainer
	model-index:
	- name: ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53-v1
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53-v1

	This model is a fine-tuned version of [gary109/ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53](https://huggingface.co/gary109/ai-light-dance_singing2_ft_wav2vec2-large-xlsr-53) on the GARY109/AI_LIGHT_DANCE - ONSET-SINGING2 dataset.
	It achieves the following results on the evaluation set:
	- Loss: 0.5760
	- Wer: 0.2905

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 4e-05
	- train_batch_size: 10
	- eval_batch_size: 10
	- seed: 42
	- gradient_accumulation_steps: 16
	- total_train_batch_size: 160
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- lr_scheduler_warmup_steps: 100
	- num_epochs: 40.0
	- mixed_precision_training: Native AMP

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \| Wer \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|:------:\|
	\| 1.656 \| 1.0 \| 112 \| 1.7625 \| 0.9265 \|
	\| 1.3693 \| 2.0 \| 224 \| 1.5135 \| 0.9243 \|
	\| 1.2172 \| 3.0 \| 336 \| 1.2657 \| 0.8533 \|
	\| 1.0456 \| 4.0 \| 448 \| 1.0893 \| 0.7691 \|
	\| 0.9385 \| 5.0 \| 560 \| 1.0110 \| 0.7097 \|
	\| 0.8165 \| 6.0 \| 672 \| 0.9243 \| 0.6682 \|
	\| 0.7491 \| 7.0 \| 784 \| 0.8948 \| 0.6583 \|
	\| 0.6772 \| 8.0 \| 896 \| 0.7894 \| 0.6007 \|
	\| 0.6096 \| 9.0 \| 1008 \| 0.7684 \| 0.5663 \|
	\| 0.5714 \| 10.0 \| 1120 \| 0.6978 \| 0.4826 \|
	\| 0.5213 \| 11.0 \| 1232 \| 0.8433 \| 0.4927 \|
	\| 0.4624 \| 12.0 \| 1344 \| 0.6695 \| 0.4469 \|
	\| 0.4298 \| 13.0 \| 1456 \| 0.6569 \| 0.3868 \|
	\| 0.3939 \| 14.0 \| 1568 \| 0.6633 \| 0.3694 \|
	\| 0.3803 \| 15.0 \| 1680 \| 0.6376 \| 0.3920 \|
	\| 0.3415 \| 16.0 \| 1792 \| 0.6463 \| 0.3414 \|
	\| 0.3239 \| 17.0 \| 1904 \| 0.5841 \| 0.3197 \|
	\| 0.2946 \| 18.0 \| 2016 \| 0.5948 \| 0.3112 \|
	\| 0.2751 \| 19.0 \| 2128 \| 0.5760 \| 0.2905 \|
	\| 0.2834 \| 20.0 \| 2240 \| 0.5884 \| 0.2975 \|
	\| 0.2383 \| 21.0 \| 2352 \| 0.5989 \| 0.2775 \|
	\| 0.2265 \| 22.0 \| 2464 \| 0.6151 \| 0.2853 \|
	\| 0.2158 \| 23.0 \| 2576 \| 0.5843 \| 0.2670 \|
	\| 0.2015 \| 24.0 \| 2688 \| 0.6621 \| 0.2738 \|
	\| 0.215 \| 25.0 \| 2800 \| 0.6068 \| 0.2652 \|
	\| 0.1859 \| 26.0 \| 2912 \| 0.6136 \| 0.2570 \|
	\| 0.1745 \| 27.0 \| 3024 \| 0.6191 \| 0.2624 \|
	\| 0.1611 \| 28.0 \| 3136 \| 0.6364 \| 0.2578 \|
	\| 0.1513 \| 29.0 \| 3248 \| 0.6402 \| 0.2535 \|
	\| 0.172 \| 30.0 \| 3360 \| 0.6330 \| 0.2500 \|
	\| 0.1488 \| 31.0 \| 3472 \| 0.6275 \| 0.2521 \|
	\| 0.1371 \| 32.0 \| 3584 \| 0.6539 \| 0.2540 \|
	\| 0.1356 \| 33.0 \| 3696 \| 0.6544 \| 0.2491 \|
	\| 0.1319 \| 34.0 \| 3808 \| 0.6545 \| 0.2491 \|
	\| 0.1465 \| 35.0 \| 3920 \| 0.6573 \| 0.2495 \|
	\| 0.13 \| 36.0 \| 4032 \| 0.6594 \| 0.2494 \|
	\| 0.1244 \| 37.0 \| 4144 \| 0.6651 \| 0.2476 \|
	\| 0.1228 \| 38.0 \| 4256 \| 0.6754 \| 0.2497 \|
	\| 0.1181 \| 39.0 \| 4368 \| 0.6684 \| 0.2468 \|
	\| 0.1338 \| 40.0 \| 4480 \| 0.6713 \| 0.2471 \|


	### Framework versions

	- Transformers 4.21.0.dev0
	- Pytorch 1.9.1+cu102
	- Datasets 2.3.3.dev0
	- Tokenizers 0.12.1