Adding the info about the batch size and correcting a mistake in the section "training data"

#2
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -111,7 +111,7 @@ print(output[0].text)
111
 
112
  ### Training data
113
 
114
- The specific dataset used to create the model is called ["cv17_es_other_automatically_verified"](https://huggingface.co/datasets/projecte-aina/cv17_es_other_automatically_verified).
115
 
116
  ### Training procedure
117
 
@@ -124,6 +124,7 @@ This model is the result of finetuning the model ["parakeet-rnnt-1.1b"](https://
124
  * learning rate: 2e-4
125
  * devices=4
126
  * num_nodes=8
 
127
  * accelerator=accelerator
128
  * strategy="ddp"
129
  * max_epochs=50
 
111
 
112
  ### Training data
113
 
114
+ The specific datasets used to create the model are the ["cv17_es_other_automatically_verified"](https://huggingface.co/datasets/projecte-aina/cv17_es_other_automatically_verified) (784 hours and 50 minutes) in combination with around 485 hours of Spanish data taken from the split called "validated" of [Mozilla Common Voice 17.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0)
115
 
116
  ### Training procedure
117
 
 
124
  * learning rate: 2e-4
125
  * devices=4
126
  * num_nodes=8
127
+ * batch_size=8
128
  * accelerator=accelerator
129
  * strategy="ddp"
130
  * max_epochs=50