HuggingFaceTB
/

finemath-ablation-finemath-infimath-3plus

Model card Files Files and versions Community

loubnabnl HF staff commited on 7 days ago

Commit

c0feb87

•

1 Parent(s): 13c1c42

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ base_model:
 ## Model summary
 This model is part of the 📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) ablations, we continue pretraining [Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) base on different math datasets for 60B tokens.
-The model has 3.21B parameters and 4096 context length. It was trained on **160B tokens** using a mix of 40% [FineWeb-Edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu) and 30% FineMath-3+ and 30% InfiWebMath-3+ from the  📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) dataset.
 - **License**: Apache-2
 - **Languages**: English

 ## Model summary
 This model is part of the 📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) ablations, we continue pretraining [Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) base on different math datasets for 60B tokens.
+The model has 3.21B parameters and 4096 context length. It was trained on **60B tokens** using a mix of 50% FineMath-3+ and 50% InfiWebMath-3+ from the  📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) dataset.
 - **License**: Apache-2
 - **Languages**: English