hussenmi/fungpt-ft

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7846
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.1532        | 0.91  | 8    | 2.2250          |
-| 1.6022        | 1.94  | 17   | 1.1908          |
-| 0.9936        | 2.97  | 26   | 0.9022          |
-| 0.8545        | 4.0   | 35   | 0.8480          |
-| 0.9068        | 4.91  | 43   | 0.8219          |
-| 0.77          | 5.94  | 52   | 0.8036          |
-| 0.7446        | 6.97  | 61   | 0.7887          |
-| 0.7274        | 8.0   | 70   | 0.7854          |
-| 0.8093        | 8.91  | 78   | 0.7847          |
-| 0.4664        | 9.14  | 80   | 0.7846          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.1043
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.2519        | 1.0   | 15   | 3.8834          |
+| 3.6084        | 2.0   | 30   | 3.4631          |
+| 3.1807        | 3.0   | 45   | 3.1895          |
+| 2.921         | 4.0   | 60   | 3.1170          |
+| 2.8287        | 5.0   | 75   | 3.0935          |
+| 2.7328        | 6.0   | 90   | 3.0807          |
+| 2.6611        | 7.0   | 105  | 3.0841          |
+| 2.6403        | 8.0   | 120  | 3.0969          |
+| 2.5925        | 9.0   | 135  | 3.1010          |
+| 2.55          | 10.0  | 150  | 3.1043          |
 ### Framework versions

runs/Mar12_22-09-50_0dce101895ae/events.out.tfevents.1710281393.0dce101895ae.1033.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3cdeb3a2caf2d72d5c58ec9bbba95beba9f0463234b8fa3eff1d328b6451fe99
+size 10307

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5866dd457f8178e91317497b0e5b4aa9f8f418c993d8d5abb795b01d0cb78ca8
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:54a8982d626e2457d1e0d0fad76fc1673cfbad668e0a09ff83bbfb276f217201
 size 4856