Update README.md
Browse files
README.md
CHANGED
@@ -199,8 +199,8 @@ We select a wide variety of benchmarks that evaluate general reasoning, knowledg
|
|
199 |
| --- | --- | --- | --- | --- | --- | --- | --- |
|
200 |
| IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
|
201 |
| koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
|
202 |
-
| MT-Bench |
|
203 |
-
| KO-MT-Bench | 6.
|
204 |
| LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
|
205 |
|
206 |
|
|
|
199 |
| --- | --- | --- | --- | --- | --- | --- | --- |
|
200 |
| IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
|
201 |
| koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
|
202 |
+
| MT-Bench | 7.00 | 8.15 | 7.81 | 6.32 | 7.86 | 6.76 | 6.84 |
|
203 |
+
| KO-MT-Bench | 6.27 | 8.13 | 7.01 | 4.27 | 6.31 | 2.89 | 4.07 |
|
204 |
| LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
|
205 |
|
206 |
|