Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ This model is ready for commercial use.
|
|
22 |
|
23 |
OpenMath-Nemotron models achieve state-of-the-art results on popular mathematical benchmarks. We present metrics as pass@1 (maj@64) where pass@1
|
24 |
is an average accuracy across 64 generations and maj@64 is the result of majority voting.
|
25 |
-
Please see our [paper](
|
26 |
|
27 |
| Model | AIME24 | AIME25 | HMMT-24-25 | HLE-Math |
|
28 |
|-------------------------------|-----------------|-------|-------|-------------|
|
@@ -106,16 +106,14 @@ Please note that these models have not been instruction tuned on general data an
|
|
106 |
|
107 |
If you find our work useful, please consider citing us!
|
108 |
|
109 |
-
|
110 |
-
|
111 |
-
|
112 |
-
|
113 |
-
|
114 |
-
|
115 |
-
year = {2024},
|
116 |
-
journal = {arXiv preprint arXiv:2410.01560}
|
117 |
}
|
118 |
-
```
|
119 |
|
120 |
## Additional information
|
121 |
|
|
|
22 |
|
23 |
OpenMath-Nemotron models achieve state-of-the-art results on popular mathematical benchmarks. We present metrics as pass@1 (maj@64) where pass@1
|
24 |
is an average accuracy across 64 generations and maj@64 is the result of majority voting.
|
25 |
+
Please see our [paper](https://github.com/NVIDIA/NeMo-Skills/blob/main/recipes/openmathreasoning.pdf) for more details on the evaluation setup.
|
26 |
|
27 |
| Model | AIME24 | AIME25 | HMMT-24-25 | HLE-Math |
|
28 |
|-------------------------------|-----------------|-------|-------|-------------|
|
|
|
106 |
|
107 |
If you find our work useful, please consider citing us!
|
108 |
|
109 |
+
```bibtex
|
110 |
+
@article{moshkov2025aimo2,
|
111 |
+
title = {AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
|
112 |
+
author = {Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
|
113 |
+
year = {2025},
|
114 |
+
journal = {arXiv preprint arXiv:TBD}
|
|
|
|
|
115 |
}
|
116 |
+
```
|
117 |
|
118 |
## Additional information
|
119 |
|