WizardLMTeam
/

WizardLM-70B-V1.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Adding Evaluation Results

#19

by leaderboard-pr-bot - opened Nov 17, 2023

base: refs/heads/main

←

from: refs/pr/19

Discussion Files changed

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -107,3 +107,17 @@ Despite this, we have still worked hard to obtain opening the weights of the mod
 Our researchers have no authority to publicly release them without authorization.
 Thank you for your understanding.

 Our researchers have no authority to publicly release them without authorization.
 Thank you for your understanding.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_WizardLM__WizardLM-70B-V1.0)
+| Metric                | Value                     |
+|-----------------------|---------------------------|
+| Avg.                  | 57.17   |
+| ARC (25-shot)         | 65.44          |
+| HellaSwag (10-shot)   | 84.41    |
+| MMLU (5-shot)         | 64.05         |
+| TruthfulQA (0-shot)   | 54.81   |
+| Winogrande (5-shot)   | 80.82   |
+| GSM8K (5-shot)        | 17.97        |
+| DROP (3-shot)         | 32.71         |