Update README.md
Browse files
README.md
CHANGED
@@ -20,12 +20,12 @@ This document presents the evaluation results of `DeepSeek-R1-Distill-Llama-70B`
|
|
20 |
|
21 |
## 📊 Evaluation Summary
|
22 |
|
23 |
-
| **Metric** | **Value** | **Description** |
|
24 |
-
|
25 |
-
| **Accuracy (acc,none)** | `21.2%` | Raw accuracy - percentage of correct answers. |
|
26 |
-
| **Standard Error (acc_stderr,none)** | `1.19%` | Uncertainty in the accuracy estimate. |
|
27 |
-
| **Normalized Accuracy (acc_norm,none)** | `25.4%` | Accuracy after dataset-specific normalization. |
|
28 |
-
| **Standard Error (acc_norm_stderr,none)** | `1.27%` | Uncertainty for normalized accuracy. |
|
29 |
|
30 |
📌 **Interpretation:**
|
31 |
- The model correctly answered **21.2% of the questions**.
|
|
|
20 |
|
21 |
## 📊 Evaluation Summary
|
22 |
|
23 |
+
| **Metric** | **Value** | **Description** | **8bit-version* |
|
24 |
+
|----------------------|-----------|-----------------|-----------|
|
25 |
+
| **Accuracy (acc,none)** | `21.2%` | Raw accuracy - percentage of correct answers. | `21.2%` |
|
26 |
+
| **Standard Error (acc_stderr,none)** | `1.19%` | Uncertainty in the accuracy estimate. | `1.2%` |
|
27 |
+
| **Normalized Accuracy (acc_norm,none)** | `25.4%` | Accuracy after dataset-specific normalization. | `25.2%` |
|
28 |
+
| **Standard Error (acc_norm_stderr,none)** | `1.27%` | Uncertainty for normalized accuracy. | `1.3%` |
|
29 |
|
30 |
📌 **Interpretation:**
|
31 |
- The model correctly answered **21.2% of the questions**.
|