juyoung-trl commited on
Commit
953e0cb
·
verified ·
1 Parent(s): 7d799a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -199,8 +199,8 @@ We select a wide variety of benchmarks that evaluate general reasoning, knowledg
199
  | --- | --- | --- | --- | --- | --- | --- | --- |
200
  | IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
201
  | koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
202
- | MT-Bench | 6.53 | 6.75 | - | 6.32 | 7.86 | 6.76 | 6.84 |
203
- | KO-MT-Bench | 6.21 | 6.70 | - | 4.27 | 6.47 | 5.57 | 4.59 |
204
  | LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
205
 
206
 
 
199
  | --- | --- | --- | --- | --- | --- | --- | --- |
200
  | IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
201
  | koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
202
+ | MT-Bench | 7.00 | 8.15 | 7.81 | 6.32 | 7.86 | 6.76 | 6.84 |
203
+ | KO-MT-Bench | 6.27 | 8.13 | 7.01 | 4.27 | 6.31 | 2.89 | 4.07 |
204
  | LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
205
 
206