nikniksen/tmjgpt-ft_v2

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6001
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.0965        | 1.0   | 1    | 4.2308          |
-| 1.0965        | 2.0   | 2    | 4.1805          |
-| 1.0598        | 3.0   | 3    | 4.0628          |
-| 0.9855        | 4.0   | 4    | 3.9432          |
-| 0.9225        | 5.0   | 5    | 3.8365          |
-| 0.8697        | 6.0   | 6    | 3.7503          |
-| 0.834         | 7.0   | 7    | 3.6862          |
-| 0.8089        | 8.0   | 8    | 3.6415          |
-| 0.7915        | 9.0   | 9    | 3.6136          |
-| 0.7801        | 10.0  | 10   | 3.6001          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.3234
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.9423        | 1.0   | 1    | 3.9858          |
+| 1.9404        | 2.0   | 2    | 3.9314          |
+| 1.974         | 3.0   | 3    | 3.8068          |
+| 1.8247        | 4.0   | 4    | 3.6860          |
+| 1.7804        | 5.0   | 5    | 3.5791          |
+| 1.6929        | 6.0   | 6    | 3.4900          |
+| 1.6617        | 7.0   | 7    | 3.4212          |
+| 1.5972        | 8.0   | 8    | 3.3715          |
+| 1.5936        | 9.0   | 9    | 3.3391          |
+| 1.5556        | 10.0  | 10   | 3.3234          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "alpha_pattern": {},
   "auto_mapping": null,
-  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

 {
   "alpha_pattern": {},
   "auto_mapping": null,
+  "base_model_name_or_path": null,
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bc124ea76613c67eb146acb768344d64be4559265893317200bd111b234afe2c
-size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:73b46bd249ad6aa13afe65bed1fb6e4136a1fe20a96912644200fc842f816fa5
+size 8398144

runs/Mar16_21-17-20_2ce4a61b58cb/events.out.tfevents.1710623845.2ce4a61b58cb.459.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d12ea1853e33e53d010633074180b54849a09953b19c2ea444d9bbdc238d1ae
+size 10283

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60d3a37ed0fccd6452738c94666b0ccc8781ccf6e6210802d203fa0c0090caf6
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:1575bd5a612d0247c56162721f5b47b655da2ddc4c136e3b52bcd2c34ce4ac79
 size 4856