Update README.md
Browse files
README.md
CHANGED
@@ -45,13 +45,13 @@ The code and data for reproducing our results can be found in our [github repo](
|
|
45 |
|
46 |
| **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
|
47 |
| --- | --- | --- | --- | --- | --- |
|
48 |
-
|
|
49 |
-
|
|
50 |
-
|
|
51 |
| **`POLARIS-7B-Preview`** | **72.6** | **52.6** | 40.2 | **65.4** | **89.0** |
|
52 |
-
|
|
53 |
-
|
|
54 |
-
|
|
55 |
| **`POLARIS-4B-Preview`** | **81.2** | **79.4** | **44.0** | **69.1** | **94.8** |
|
56 |
|
57 |
## Acknowledgements
|
|
|
45 |
|
46 |
| **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
|
47 |
| --- | --- | --- | --- | --- | --- |
|
48 |
+
| Deepseek-R1-Distill-Qwen-7B | 55.0 | 39.7 | 36.7 | 56.8 | 81.9 |
|
49 |
+
| AReal-boba-RL-7B | 61.9 | 48.3 | 39.5 | 61.9 | 86.4 |
|
50 |
+
| Skywork-OR1-7B-Math | 69.8 | 52.3 | **40.8** | 63.2 | 85.3 |
|
51 |
| **`POLARIS-7B-Preview`** | **72.6** | **52.6** | 40.2 | **65.4** | **89.0** |
|
52 |
+
| Deepseek-R1-Distill-Qwen-32B | 72.6 | 54.9 | 42.1 | 59.4 | 84.3 |
|
53 |
+
| qwen3-32B | 81.4 | 72.9 | 44.2 | 66.7 | 92.4 |
|
54 |
+
| qwen3-4B | 73.8 | 65.6 | 43.6 | 62.2 | 87.2 |
|
55 |
| **`POLARIS-4B-Preview`** | **81.2** | **79.4** | **44.0** | **69.1** | **94.8** |
|
56 |
|
57 |
## Acknowledgements
|