llama3_helpful_rm_full

Files changed (6) hide show

README.md CHANGED Viewed

@@ -4,6 +4,8 @@ license: llama3.1
 base_model: meta-llama/Llama-3.1-8B-Instruct
 tags:
 - generated_from_trainer
 model-index:
 - name: llama3_helpful_rm_full
   results: []
@@ -15,6 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 # llama3_helpful_rm_full
 This model is a fine-tuned version of [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on an unknown dataset.
 ## Model description
@@ -49,6 +54,10 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions

 base_model: meta-llama/Llama-3.1-8B-Instruct
 tags:
 - generated_from_trainer
+metrics:
+- accuracy
 model-index:
 - name: llama3_helpful_rm_full
   results: []
 # llama3_helpful_rm_full
 This model is a fine-tuned version of [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1843
+- Accuracy: 0.926
 ## Model description
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Accuracy |
+|:-------------:|:------:|:----:|:---------------:|:--------:|
+| 0.1996        | 0.4320 | 50   | 0.2091          | 0.91     |
+| 0.1942        | 0.8639 | 100  | 0.1843          | 0.926    |
 ### Framework versions

model-00001-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:87a4e3024a296ead899b177b283e26d58955afdf5c98181f918f78a35234755d
 size 4976706864

 version https://git-lfs.github.com/spec/v1
+oid sha256:1f2029f18b633aab2a7e6f4d63decc7ef1ccb98bb55530e2467e280c1eb2dbc7
 size 4976706864

model-00002-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c109be1159bc1c7676ed2f8a2de01d8b9306cda11472cf256030a2082f051b9a
 size 4999802720

 version https://git-lfs.github.com/spec/v1
+oid sha256:e84c96fb1f6a5d415c8dd572c572c0675641ae800af7a4fc49cf39277c3bfb81
 size 4999802720

model-00003-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c88a792a61cb77c7ccd2954f19c97fca1d425a5477fe3892d56e082321d69374
 size 4915916176

 version https://git-lfs.github.com/spec/v1
+oid sha256:829bc0adb09dffdc0007524907678a48efb5e997535ae0e3e9b83a8615b33894
 size 4915916176

model-00004-of-00004.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ef1f4d4403f3a2b6c4f6e2a1174c8d071885c92c11128325926eca54cecb5e73
 size 117473824

 version https://git-lfs.github.com/spec/v1
+oid sha256:51909bcb07a87b298b291c9ca503fb60c2986013428b8ac50747c387f6fcf3e5
 size 117473824

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d575925a3facda7e9d5b723976d17eceef19401bd8be1c7689a33008749d67e1
 size 7032

 version https://git-lfs.github.com/spec/v1
+oid sha256:d5cce04cff6a313a3b37022dad25432bc9b18fb55335e89b938cda54d46670ac
 size 7032