juyoung-trl commited on
Commit
1814bcf
·
verified ·
1 Parent(s): ae1d6de

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -202,8 +202,8 @@ We select a wide variety of benchmarks that evaluate general reasoning, knowledg
202
  | --- | --- | --- | --- | --- | --- | --- | --- |
203
  | IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
204
  | koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
205
- | MT-Bench | 6.53 | 6.75 | - | 6.32 | 7.86 | 6.76 | 6.84 |
206
- | KO-MT-Bench | 6.21 | 6.70 | - | 4.27 | 6.47 | 5.57 | 4.59 |
207
  | LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
208
 
209
 
 
202
  | --- | --- | --- | --- | --- | --- | --- | --- |
203
  | IFEval | 79.13 | 81.42 | 75.48 | 74.93 | 75.85 | 51.61 | 52.64 |
204
  | koIFEval | 66.58 | 54.65 | 43.30 | 36.07 | 48.55 | 26.12 | 34.22 |
205
+ | MT-Bench | 7.00 | 8.15 | 7.81 | 6.32 | 7.86 | 6.76 | 6.84 |
206
+ | KO-MT-Bench | 6.27 | 8.13 | 7.01 | 4.27 | 6.31 | 2.89 | 4.07 |
207
  | LogicKor | 8.14 | 9.25 | 8.33 | 6.45 | 7.99 | 1.85 | 4.76
208
 
209