thiomajid commited on
Commit
e54980e
·
verified ·
1 Parent(s): 12485e0

Completed epoch 4

Browse files
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 4.7790
18
 
19
  ## Model description
20
 
@@ -49,17 +49,17 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
- | 26.9686 | 0.32 | 100 | 5.6473 |
53
- | 20.7043 | 0.64 | 200 | 5.1898 |
54
- | 19.3176 | 0.96 | 300 | 4.9661 |
55
- | 18.0682 | 1.2784 | 400 | 4.8781 |
56
- | 17.5367 | 1.5984 | 500 | 4.8027 |
57
- | 17.3323 | 1.9184 | 600 | 4.7790 |
58
 
59
 
60
  ### Framework versions
61
 
62
- - Transformers 4.47.1
63
  - Pytorch 2.5.1+cu124
64
  - Datasets 3.2.0
65
  - Tokenizers 0.21.0
 
14
 
15
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 4.6578
18
 
19
  ## Model description
20
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
+ | 17.5854 | 0.32 | 100 | 4.8255 |
53
+ | 16.9758 | 0.64 | 200 | 4.7656 |
54
+ | 16.4608 | 0.96 | 300 | 4.6877 |
55
+ | 14.761 | 1.2784 | 400 | 4.7266 |
56
+ | 14.5481 | 1.5984 | 500 | 4.6778 |
57
+ | 14.7485 | 1.9184 | 600 | 4.6578 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.48.2
63
  - Pytorch 2.5.1+cu124
64
  - Datasets 3.2.0
65
  - Tokenizers 0.21.0
config.json CHANGED
@@ -5,7 +5,7 @@
5
  "model_type": "xlstm",
6
  "pad_token_id": 151643,
7
  "torch_dtype": "float32",
8
- "transformers_version": "4.47.1",
9
  "xlstm_cfg": {
10
  "_block_map": "1,0,1,0,1,0,1,0,1,0,1,0,1,0",
11
  "add_embedding_dropout": false,
 
5
  "model_type": "xlstm",
6
  "pad_token_id": 151643,
7
  "torch_dtype": "float32",
8
+ "transformers_version": "4.48.2",
9
  "xlstm_cfg": {
10
  "_block_map": "1,0,1,0,1,0,1,0,1,0,1,0,1,0",
11
  "add_embedding_dropout": false,
events.out.tfevents.1738959286.ba07c9a9650d.3222.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c4c95a5f4c0f93bc9847f6fab06dc322a5c85c3e96f8a613beab4d2805964fc7
3
+ size 1035358
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9765ccdb2e84cb04be86e9949bbf8200ba92af8a437e8ae588f63bc236ffec5c
3
  size 2661593144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68cf9fe198fbb61b277bf4c504e9e5256b61c256c3efc82aa7e0e8b60d20453a
3
  size 2661593144