felerminoali commited on
Commit
eef61be
·
verified ·
1 Parent(s): 1078974

Model save

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: bigscience/mt0-base
5
  tags:
6
  - generated_from_trainer
7
  model-index:
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # mt5_pt_vmw
16
 
17
- This model is a fine-tuned version of [bigscience/mt0-base](https://huggingface.co/bigscience/mt0-base) on an unknown dataset.
18
 
19
  ## Model description
20
 
@@ -34,8 +34,8 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
- - train_batch_size: 8
38
- - eval_batch_size: 8
39
  - seed: 42
40
  - distributed_type: multi-GPU
41
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: google/byt5-base
5
  tags:
6
  - generated_from_trainer
7
  model-index:
 
14
 
15
  # mt5_pt_vmw
16
 
17
+ This model is a fine-tuned version of [google/byt5-base](https://huggingface.co/google/byt5-base) on an unknown dataset.
18
 
19
  ## Model description
20
 
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
+ - train_batch_size: 10
38
+ - eval_batch_size: 10
39
  - seed: 42
40
  - distributed_type: multi-GPU
41
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments