Update README.md
Browse files
README.md
CHANGED
|
@@ -57,7 +57,7 @@ Our average performance for BigBench-Hard: 0.488
|
|
| 57 |
Average for AGIEval: 0.447
|
| 58 |
|
| 59 |
In the Orca paper, they measured their score relative to Vicuna on these evals.
|
| 60 |
-
We have done the same and have found our score averages to **~103%** of the total
|
| 61 |
|
| 62 |
So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
|
| 63 |
|
|
|
|
| 57 |
Average for AGIEval: 0.447
|
| 58 |
|
| 59 |
In the Orca paper, they measured their score relative to Vicuna on these evals.
|
| 60 |
+
We have done the same and have found our score averages to **~103%** of the total performance that was shown in the Orca paper, using the same evaluation methods as outlined in the paper.
|
| 61 |
|
| 62 |
So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
|
| 63 |
|