Update README.md
Browse files
README.md
CHANGED
@@ -80,11 +80,11 @@ utilizing Llama3.1-8B as the policy model. The top two performances are marked i
|
|
80 |
| **Quality of Individual Unit Tests** | | | | |
|
81 |
| Llama3.1-8B | 60.02 | 44.97 | 13.66 | 46.13 |
|
82 |
| Llama3.1-70B | **73.65** | **70.15** | **11.10** | **34.51** |
|
83 |
-
| *
|
84 |
| **Quality of Multiple Unit Tests** | | | | |
|
85 |
| Llama3.1-8B | 74.21 | 74.35 | 20.44 | 30.55 |
|
86 |
| Llama3.1-70B | <u>78.30</u> | <u>78.76</u> | <u>17.19</u> | <u>25.97</u> |
|
87 |
-
| *
|
88 |
|
89 |
# Prompt Format
|
90 |
|
|
|
80 |
| **Quality of Individual Unit Tests** | | | | |
|
81 |
| Llama3.1-8B | 60.02 | 44.97 | 13.66 | 46.13 |
|
82 |
| Llama3.1-70B | **73.65** | **70.15** | **11.10** | **34.51** |
|
83 |
+
| *CodeRM-8B (Ours)* | <u>69.64</u> | <u>63.63</u> | <u>11.17</u> | <u>38.55</u> |
|
84 |
| **Quality of Multiple Unit Tests** | | | | |
|
85 |
| Llama3.1-8B | 74.21 | 74.35 | 20.44 | 30.55 |
|
86 |
| Llama3.1-70B | <u>78.30</u> | <u>78.76</u> | <u>17.19</u> | <u>25.97</u> |
|
87 |
+
| *CodeRM-8B (Ours)* | **80.46** | **81.27** | **16.48** | **22.71** |
|
88 |
|
89 |
# Prompt Format
|
90 |
|