Jbautistas
/

checkpoints

Automatic Speech Recognition

Generated from Trainer

8-bit precision

Model card Files Files and versions

Jbautistas commited on 10 days ago

Commit

f89ca32

·

verified ·

1 Parent(s): 0690774

End of training

Files changed (3) hide show

README.md +13 -7
model.safetensors +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -6,6 +6,8 @@ license: apache-2.0
 base_model: openai/whisper-large-v3
 tags:
 - generated_from_trainer
 model-index:
 - name: Whisper large LoRA Merged Es - Jbautistas
   results: []
@@ -18,7 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6809
 ## Model description
@@ -41,19 +44,22 @@ The following hyperparameters were used during training:
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss |
-|:-------------:|:-------:|:----:|:---------------:|
-| 0.294         | 6.8966  | 200  | 0.5184          |
-| 0.1686        | 13.7931 | 400  | 0.6167          |
-| 0.1306        | 20.6897 | 600  | 0.6527          |
-| 0.1143        | 27.5862 | 800  | 0.6809          |
 ### Framework versions

 base_model: openai/whisper-large-v3
 tags:
 - generated_from_trainer
+metrics:
+- wer
 model-index:
 - name: Whisper large LoRA Merged Es - Jbautistas
   results: []
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7153
+- Wer: 58.9379
 ## Model description
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
 - num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Wer     |
+|:-------------:|:-------:|:----:|:---------------:|:-------:|
+| 1.4943        | 6.2759  | 50   | 1.1849          | 58.2648 |
+| 1.1692        | 12.5517 | 100  | 1.0537          | 56.2453 |
+| 0.9958        | 18.8276 | 150  | 0.8947          | 54.8990 |
+| 0.7308        | 25.0    | 200  | 0.7153          | 58.9379 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:734bffb12c25b8ab44708e1e609175de30d7a9534d6a2cbf09aa349e924a8296
-size 1773492440

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e0d45115d84043722ccc82be41e1aef7ddb20d56253af49d622abc7417ae91c
+size 1773489880

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4033888a576f3eee17d7c24bdd78a83b77ccc818cbffd2e67a63d01b504b6425
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:943ac7bc7fc365f2500a10c2faaa689a01076ff704329da2ae31ffd48da1b409
 size 5496