Adding the info about the batch size and correcting a mistake in the section "training data"
#2
by
carlosdanielhernandezmena
- opened
README.md
CHANGED
@@ -111,7 +111,7 @@ print(output[0].text)
|
|
111 |
|
112 |
### Training data
|
113 |
|
114 |
-
The specific
|
115 |
|
116 |
### Training procedure
|
117 |
|
@@ -124,6 +124,7 @@ This model is the result of finetuning the model ["parakeet-rnnt-1.1b"](https://
|
|
124 |
* learning rate: 2e-4
|
125 |
* devices=4
|
126 |
* num_nodes=8
|
|
|
127 |
* accelerator=accelerator
|
128 |
* strategy="ddp"
|
129 |
* max_epochs=50
|
|
|
111 |
|
112 |
### Training data
|
113 |
|
114 |
+
The specific datasets used to create the model are the ["cv17_es_other_automatically_verified"](https://huggingface.co/datasets/projecte-aina/cv17_es_other_automatically_verified) (784 hours and 50 minutes) in combination with around 485 hours of Spanish data taken from the split called "validated" of [Mozilla Common Voice 17.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
|
115 |
|
116 |
### Training procedure
|
117 |
|
|
|
124 |
* learning rate: 2e-4
|
125 |
* devices=4
|
126 |
* num_nodes=8
|
127 |
+
* batch_size=8
|
128 |
* accelerator=accelerator
|
129 |
* strategy="ddp"
|
130 |
* max_epochs=50
|