areddyyt/printinx-alpha

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.1078
 ## Model description
@@ -51,16 +51,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.2525        | 0.97  | 27   | 4.8706          |
-| 3.3404        | 1.98  | 55   | 4.4729          |
-| 2.5966        | 2.99  | 83   | 4.3761          |
-| 2.5016        | 4.0   | 111  | 4.3981          |
-| 2.3125        | 4.97  | 138  | 4.3562          |
-| 2.0914        | 5.98  | 166  | 4.5522          |
-| 2.0154        | 6.99  | 194  | 4.6678          |
-| 1.9274        | 8.0   | 222  | 4.8387          |
-| 1.8977        | 8.97  | 249  | 5.0202          |
-| 1.7132        | 9.73  | 270  | 5.1078          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.1872
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 6.0824        | 0.67  | 1    | 3.2835          |
+| 2.933         | 2.0   | 3    | 3.2615          |
+| 5.8899        | 2.67  | 4    | 3.2449          |
+| 2.8651        | 4.0   | 6    | 3.2135          |
+| 5.4133        | 4.67  | 7    | 3.2023          |
+| 2.8622        | 6.0   | 9    | 3.1931          |
+| 4.0023        | 6.67  | 10   | 3.1872          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "alpha_pattern": {},
   "auto_mapping": null,
-  "base_model_name_or_path": null,
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

 {
   "alpha_pattern": {},
   "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3bccc7f536b1788b3bed4a7112ee2b90d05d78d418d960b9dc2392408ad46c33
-size 8398144

 version https://git-lfs.github.com/spec/v1
+oid sha256:5f063bede8aabd4cc179c7ccdbe828275b2553ae9a0a6f628df851f711c21614
+size 8397056

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4fcb37c27149a211bd216a3863d634d87870f5deef90dab27f30ea3c4276e47e
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:4d8c20c8ee46037739db363d10788e15ae659269d80b536f45f71ae03bb1a380
 size 4856