marcus2000 commited on
Commit
d1920eb
1 Parent(s): 6428a1c

Saiga_timelist_task30steps

Browse files
README.md CHANGED
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # Saiga_timelist_task30steps
15
 
16
- This model is a fine-tuned version of [TheBloke/Llama-2-7B-fp16](https://huggingface.co/TheBloke/Llama-2-7B-fp16) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.7980
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 5e-06
38
  - train_batch_size: 2
39
  - eval_batch_size: 8
40
  - seed: 42
@@ -48,21 +48,21 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 1.5376 | 0.05 | 2 | 1.7979 |
52
- | 1.6041 | 0.1 | 4 | 1.7978 |
53
- | 1.6858 | 0.15 | 6 | 1.7979 |
54
- | 1.7722 | 0.21 | 8 | 1.7979 |
55
- | 1.6524 | 0.26 | 10 | 1.7979 |
56
- | 1.6374 | 0.31 | 12 | 1.7979 |
57
- | 1.7269 | 0.36 | 14 | 1.7979 |
58
- | 1.7028 | 0.41 | 16 | 1.7978 |
59
- | 1.6422 | 0.46 | 18 | 1.7980 |
60
- | 1.666 | 0.52 | 20 | 1.7979 |
61
- | 1.5679 | 0.57 | 22 | 1.7977 |
62
- | 1.7614 | 0.62 | 24 | 1.7978 |
63
- | 1.7053 | 0.67 | 26 | 1.7978 |
64
- | 1.7438 | 0.72 | 28 | 1.7981 |
65
- | 1.8348 | 0.77 | 30 | 1.7980 |
66
 
67
 
68
  ### Framework versions
 
13
 
14
  # Saiga_timelist_task30steps
15
 
16
+ This model is a fine-tuned version of [TheBloke/Llama-2-7B-fp16](https://huggingface.co/TheBloke/Llama-2-7B-fp16) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.0384
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 0.0003
38
  - train_batch_size: 2
39
  - eval_batch_size: 8
40
  - seed: 42
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 2.2298 | 0.37 | 2 | 2.2027 |
52
+ | 2.0986 | 0.74 | 4 | 2.1505 |
53
+ | 2.0278 | 1.11 | 6 | 2.1167 |
54
+ | 1.9954 | 1.48 | 8 | 2.0915 |
55
+ | 1.9696 | 1.85 | 10 | 2.0753 |
56
+ | 1.8978 | 2.22 | 12 | 2.0648 |
57
+ | 1.9246 | 2.59 | 14 | 2.0564 |
58
+ | 1.9361 | 2.96 | 16 | 2.0506 |
59
+ | 1.895 | 3.33 | 18 | 2.0470 |
60
+ | 1.8525 | 3.7 | 20 | 2.0442 |
61
+ | 1.8912 | 4.07 | 22 | 2.0419 |
62
+ | 1.8689 | 4.44 | 24 | 2.0400 |
63
+ | 1.9054 | 4.81 | 26 | 2.0390 |
64
+ | 1.8537 | 5.19 | 28 | 2.0384 |
65
+ | 1.8501 | 5.56 | 30 | 2.0384 |
66
 
67
 
68
  ### Framework versions
adapter_config.json CHANGED
@@ -21,9 +21,9 @@
21
  "revision": null,
22
  "target_modules": [
23
  "o_proj",
24
- "k_proj",
25
  "v_proj",
26
- "q_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
29
  "use_dora": false,
 
21
  "revision": null,
22
  "target_modules": [
23
  "o_proj",
24
+ "q_proj",
25
  "v_proj",
26
+ "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
29
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:706300dab15685f077ec2a32aa52b000cf1e4f9281cbc0a124774769968310b6
3
  size 33589040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:905f7fab7e97ee8ddf5c406f8ff2257c6890904c582dac61689d1d8e77fb8ad5
3
  size 33589040
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6d2f624a4fd88edd6b5b092f7fb0daca5979f8bec9c46bdcdc7bd6a997f4d72e
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac6721a4fca0bd686609a49d3b62193266d9905bcff2db6a2087bd7c0a8b5e46
3
  size 4920