nrshoudi commited on
Commit
187f9f2
·
1 Parent(s): e06f3cd

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.2209
19
 
20
  ## Model description
21
 
@@ -35,8 +35,8 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.001
38
- - train_batch_size: 6
39
- - eval_batch_size: 6
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
@@ -48,16 +48,16 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 0.0671 | 1.0 | 546 | 0.2502 |
52
- | 0.0768 | 2.0 | 1092 | 0.2005 |
53
- | 0.0426 | 3.0 | 1638 | 0.2127 |
54
- | 0.04 | 4.0 | 2184 | 0.2028 |
55
- | 0.0241 | 5.0 | 2730 | 0.1957 |
56
- | 0.0168 | 6.0 | 3276 | 0.1985 |
57
- | 0.0154 | 7.0 | 3822 | 0.1980 |
58
- | 0.0053 | 8.0 | 4368 | 0.2109 |
59
- | 0.0058 | 9.0 | 4914 | 0.2133 |
60
- | 0.0014 | 10.0 | 5460 | 0.2209 |
61
 
62
 
63
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.2165
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.001
38
+ - train_batch_size: 8
39
+ - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 0.0978 | 1.0 | 205 | 0.2293 |
52
+ | 0.0401 | 2.0 | 410 | 0.1967 |
53
+ | 0.0265 | 3.0 | 615 | 0.1954 |
54
+ | 0.0293 | 4.0 | 820 | 0.2346 |
55
+ | 0.0122 | 5.0 | 1025 | 0.2172 |
56
+ | 0.0117 | 6.0 | 1230 | 0.2048 |
57
+ | 0.0057 | 7.0 | 1435 | 0.2086 |
58
+ | 0.0034 | 8.0 | 1640 | 0.2014 |
59
+ | 0.0011 | 9.0 | 1845 | 0.2135 |
60
+ | 0.0003 | 10.0 | 2050 | 0.2165 |
61
 
62
 
63
  ### Framework versions
adapter_config.json CHANGED
@@ -11,16 +11,19 @@
11
  "init_lora_weights": true,
12
  "layers_pattern": null,
13
  "layers_to_transform": null,
 
14
  "lora_alpha": 64,
15
  "lora_dropout": 0.05,
 
 
16
  "modules_to_save": null,
17
  "peft_type": "LORA",
18
  "r": 32,
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "v_proj",
23
- "q_proj"
24
  ],
25
  "task_type": null
26
  }
 
11
  "init_lora_weights": true,
12
  "layers_pattern": null,
13
  "layers_to_transform": null,
14
+ "loftq_config": {},
15
  "lora_alpha": 64,
16
  "lora_dropout": 0.05,
17
+ "megatron_config": null,
18
+ "megatron_core": "megatron.core",
19
  "modules_to_save": null,
20
  "peft_type": "LORA",
21
  "r": 32,
22
  "rank_pattern": {},
23
  "revision": null,
24
  "target_modules": [
25
+ "q_proj",
26
+ "v_proj"
27
  ],
28
  "task_type": null
29
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1fa67dcb6bd2c2413414654b0e89f817ce22b50d1900feb1ff7570457005480d
3
  size 37789960
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:81d06348c7641884794583d94ff502faabb1326e8775a8739cc66567b5eb7c25
3
  size 37789960
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cb53f704c853cd4ac52ab941e94c8254843b3b9ffea227af502ac9457a966591
3
  size 4792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7ee48ee4e3653fcb2371e6a3f574ad6c06474c1864132b11761d69e867373c9
3
  size 4792