English
nvidia
math

Add paper link and pipeline tag

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -1,20 +1,24 @@
1
  ---
2
- license: llama3.1
3
  base_model:
4
  - meta-llama/Llama-3.1-70B
5
  datasets:
6
  - nvidia/OpenMathInstruct-2
7
  language:
8
  - en
 
9
  tags:
10
  - nvidia
11
  - math
 
 
12
  ---
13
 
14
  # OpenMath2-Llama3.1-70B-nemo
15
 
16
  [NeMo](https://github.com/NVIDIA/NeMo) checkpoint for [OpenMath2-Llama3.1-70B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B) which is obtained by finetuning [Llama3.1-70B-Base](https://huggingface.co/meta-llama/Llama-3.1-70B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
17
 
 
 
18
  The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) on [MATH](https://github.com/hendrycks/math) by 3.9%.
19
 
20
 
@@ -22,10 +26,10 @@ The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/
22
  | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
23
  |:---|:---:|:---:|:---:|:---:|:---:|
24
  | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
25
- | OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
26
  | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
27
  | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
28
- | **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) \| [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
29
  | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
30
 
31
  The pipeline we used to produce the data and models is fully open-sourced!
@@ -65,4 +69,4 @@ If you find our work useful, please consider citing us!
65
 
66
  ## Terms of use
67
 
68
- By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
 
1
  ---
 
2
  base_model:
3
  - meta-llama/Llama-3.1-70B
4
  datasets:
5
  - nvidia/OpenMathInstruct-2
6
  language:
7
  - en
8
+ license: llama3.1
9
  tags:
10
  - nvidia
11
  - math
12
+ pipeline_tag: text-generation
13
+ library_name: nemo
14
  ---
15
 
16
  # OpenMath2-Llama3.1-70B-nemo
17
 
18
  [NeMo](https://github.com/NVIDIA/NeMo) checkpoint for [OpenMath2-Llama3.1-70B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B) which is obtained by finetuning [Llama3.1-70B-Base](https://huggingface.co/meta-llama/Llama-3.1-70B) with [OpenMathInstruct-2](https://huggingface.co/datasets/nvidia/OpenMathInstruct-2).
19
 
20
+ This model is presented in the paper [OpenCodeReasoning: Advancing Data Distillation for Competitive Coding](https://huggingface.co/papers/2504.01943).
21
+
22
  The model outperforms [Llama3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) on [MATH](https://github.com/hendrycks/math) by 3.9%.
23
 
24
 
 
26
  | Model | GSM8K | MATH | AMC 2023 | AIME 2024 | Omni-MATH |
27
  |:---|:---:|:---:|:---:|:---:|:---:|
28
  | Llama3.1-8B-Instruct | 84.5 | 51.9 | 9/40 | 2/30 | 12.7 |
29
+ | OpenMath2-Llama3.1-8B ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)) | 91.7 | 67.8 | 16/40 | 3/30 | 22.0 |
30
  | + majority@256 | 94.1 | 76.1 | 23/40 | 3/30 | 24.6 |
31
  | Llama3.1-70B-Instruct | 95.8 | 67.9 | 19/40 | 6/30 | 19.0 |
32
+ | **OpenMath2-Llama3.1-70B** ([nemo](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo) | [HF](https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B)) | 94.9 | 71.9 | 20/40 | 4/30 | 23.1 |
33
  | + majority@256 | 96.0 | 79.6 | 24/40 | 6/30 | 27.6 |
34
 
35
  The pipeline we used to produce the data and models is fully open-sourced!
 
69
 
70
  ## Terms of use
71
 
72
+ By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)