Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,7 @@
|
|
|
|
|
|
|
|
|
|
1 |
## More details and evals coming soon...
|
2 |
|
3 |
## Sanity check - GSM8k eval
|
@@ -14,4 +18,4 @@
|
|
14 |
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|
15 |
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|
16 |
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.9219|± |0.0074|
|
17 |
-
| | |strict-match | 5|exact_match|↑ |0.9075|± |0.0080|
|
|
|
1 |
+
---
|
2 |
+
base_model:
|
3 |
+
- meta-llama/Llama-4-Scout-17B-16E-Instruct
|
4 |
+
---
|
5 |
## More details and evals coming soon...
|
6 |
|
7 |
## Sanity check - GSM8k eval
|
|
|
18 |
|Tasks|Version| Filter |n-shot| Metric | |Value | |Stderr|
|
19 |
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|
20 |
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.9219|± |0.0074|
|
21 |
+
| | |strict-match | 5|exact_match|↑ |0.9075|± |0.0080|
|