rwmasood commited on
Commit
a87aa26
·
verified ·
1 Parent(s): 98a1b61

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -20,12 +20,12 @@ This document presents the evaluation results of `DeepSeek-R1-Distill-Llama-70B`
20
 
21
  ## 📊 Evaluation Summary
22
 
23
- | **Metric** | **Value** | **Description** |
24
- |----------------------|-----------|-----------------|
25
- | **Accuracy (acc,none)** | `21.2%` | Raw accuracy - percentage of correct answers. |
26
- | **Standard Error (acc_stderr,none)** | `1.19%` | Uncertainty in the accuracy estimate. |
27
- | **Normalized Accuracy (acc_norm,none)** | `25.4%` | Accuracy after dataset-specific normalization. |
28
- | **Standard Error (acc_norm_stderr,none)** | `1.27%` | Uncertainty for normalized accuracy. |
29
 
30
  📌 **Interpretation:**
31
  - The model correctly answered **21.2% of the questions**.
 
20
 
21
  ## 📊 Evaluation Summary
22
 
23
+ | **Metric** | **Value** | **Description** | **8bit-version* |
24
+ |----------------------|-----------|-----------------|-----------|
25
+ | **Accuracy (acc,none)** | `21.2%` | Raw accuracy - percentage of correct answers. | `21.2%` |
26
+ | **Standard Error (acc_stderr,none)** | `1.19%` | Uncertainty in the accuracy estimate. | `1.2%` |
27
+ | **Normalized Accuracy (acc_norm,none)** | `25.4%` | Accuracy after dataset-specific normalization. | `25.2%` |
28
+ | **Standard Error (acc_norm_stderr,none)** | `1.27%` | Uncertainty for normalized accuracy. | `1.3%` |
29
 
30
  📌 **Interpretation:**
31
  - The model correctly answered **21.2% of the questions**.