gglabs/stt-turbo-multilingual-test

Browse files

Files changed (11) hide show

README.md +14 -24
adapter_config.json +2 -2
adapter_model.safetensors +1 -1
runs/Aug16_05-02-49_ubuntu/events.out.tfevents.1755288169.ubuntu.863233.0 +3 -0
runs/Aug16_05-03-55_ubuntu/events.out.tfevents.1755288235.ubuntu.871478.0 +3 -0
runs/Aug16_05-05-24_ubuntu/events.out.tfevents.1755288325.ubuntu.880121.0 +3 -0
runs/Aug16_05-08-11_ubuntu/events.out.tfevents.1755288491.ubuntu.897503.0 +3 -0
runs/Aug16_05-10-49_ubuntu/events.out.tfevents.1755288649.ubuntu.912455.0 +3 -0
runs/Aug16_05-13-02_ubuntu/events.out.tfevents.1755288783.ubuntu.926962.0 +3 -0
runs/Aug16_05-15-33_ubuntu/events.out.tfevents.1755288934.ubuntu.941520.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -9,19 +9,19 @@ tags:
 metrics:
 - wer
 model-index:
-- name: Whisper Turbo Multilingual (ko, ja, zh, en)
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Turbo Multilingual (ko, ja, zh, en)
 This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the custom_multilingual dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3860
-- Wer: 15.9354
 ## Model description
@@ -40,32 +40,22 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
-- train_batch_size: 16
-- eval_batch_size: 32
 - seed: 42
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 64
-- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 200
-- training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Wer     |
-|:-------------:|:-------:|:----:|:---------------:|:-------:|
-| 0.3244        | 1.0971  | 500  | 0.3557          | 19.6171 |
-| 0.2701        | 2.1942  | 1000 | 0.3490          | 19.5934 |
-| 0.224         | 3.2913  | 1500 | 0.3503          | 17.5891 |
-| 0.2132        | 4.3884  | 2000 | 0.3518          | 16.9210 |
-| 0.1865        | 5.4855  | 2500 | 0.3571          | 16.2908 |
-| 0.1661        | 6.5826  | 3000 | 0.3652          | 16.0491 |
-| 0.1467        | 7.6796  | 3500 | 0.3692          | 16.4471 |
-| 0.136         | 8.7767  | 4000 | 0.3762          | 15.9496 |
-| 0.1229        | 9.8738  | 4500 | 0.3816          | 15.8453 |
-| 0.1146        | 10.9709 | 5000 | 0.3860          | 15.9354 |
 ### Framework versions

 metrics:
 - wer
 model-index:
+- name: Whisper Turbo Multilingual
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Turbo Multilingual
 This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the custom_multilingual dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4791
+- Wer: 18.75
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.001
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 5
+- training_steps: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer    |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.3929        | 1.0   | 10   | 0.7320          | 21.875 |
+| 0.076         | 2.0   | 20   | 0.4791          | 18.75  |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -28,9 +28,9 @@
   "revision": null,
   "target_modules": [
     "q_proj",
-    "k_proj",
     "v_proj",
-    "o_proj"
   ],
   "task_type": null,
   "trainable_token_indices": null,

   "revision": null,
   "target_modules": [
     "q_proj",
+    "o_proj",
     "v_proj",
+    "k_proj"
   ],
   "task_type": null,
   "trainable_token_indices": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1ea6247b8e16801be0648d2b73027fcf6408ab2ea750ce6d71f4920d972aae13
 size 78677456

 version https://git-lfs.github.com/spec/v1
+oid sha256:227bca4c5b5125ea6832cd4d58e2ad75d97cd3d3e161a850a9c45a7de5ab59e8
 size 78677456

runs/Aug16_05-02-49_ubuntu/events.out.tfevents.1755288169.ubuntu.863233.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:306154d2fea16bf6399d26b63a0f9aee1d1cc02f5ea03fc24361f2b39accb135
+size 6394

runs/Aug16_05-03-55_ubuntu/events.out.tfevents.1755288235.ubuntu.871478.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4f2eec87cae71b9e2f2356a076752b79709d4edd4805f5f2df24ea6e13b7daee
+size 6394

runs/Aug16_05-05-24_ubuntu/events.out.tfevents.1755288325.ubuntu.880121.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ab62afcb1338e4a7bd611e5433bb088f9b845bd142bb1bc6cce363ffdfb883de
+size 6394

runs/Aug16_05-08-11_ubuntu/events.out.tfevents.1755288491.ubuntu.897503.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6d8bfd385acde7be3672eec940b415d0bd600302281e35c595431c07cf50befa
+size 6394

runs/Aug16_05-10-49_ubuntu/events.out.tfevents.1755288649.ubuntu.912455.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:53e91fc2b9204c26c1341b65eac43774fcf514048409aa74db0671c2fd0ade18
+size 6394

runs/Aug16_05-13-02_ubuntu/events.out.tfevents.1755288783.ubuntu.926962.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:252a902efa2861b602dcaefd0162e74be0d468264b1b578dedd8c5bc2826e997
+size 6394

runs/Aug16_05-15-33_ubuntu/events.out.tfevents.1755288934.ubuntu.941520.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e5b1ae2124631722ad8973e2e63f14135fa135ee58ebf6e42f14a445fb732f7
+size 9436

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a95a43474a228ffdff8870d11d6af2f6d2aa8fe281c83c3850492a6b5eb71798
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:90e55d6a2722c603fe80e1892a5c0ad681bfeff6114717f4da5ec6d3cf336ce2
 size 5496