allenai
/

OLMo-2-0325-32B

Text Generation

Model card Files Files and versions Community

amanrangapur commited on 6 days ago

Commit

8aeed46

·

verified ·

1 Parent(s): bbc26e3

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -159,11 +159,11 @@ Core model results for OLMo 2 32B are found below.
 ## Model Details
-### Pretraining
 |  | **OLMo 2 32B** | **OLMo 2 13B** | **OLMo 2 7B** |
 |-------------------|------------|------------|------------|
 | Pretraining Stage 1 | 6 trillion tokens<br>(1.5 epoch) | 5 trillion tokens<br>(1.2 epochs) | 4 trillion tokens<br>(1 epoch) |
-| Pretraining Stage 2 | 100B tokens (2 runs)<br>300B tokens (1 run)<br>*merged* | 100B tokens (3 runs)<br>300B tokens (1 run)<br>*merged* | 50B tokens (3 runs)<br>*merged* |
 | Post-training | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-32b-pref-mix-v1)) | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-1124-13b-preference-mix)) | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix)) |
 #### Stage 1: Initial Pretraining
@@ -171,7 +171,7 @@ Core model results for OLMo 2 32B are found below.
 - Coverage: 95%+ of total pretraining budget
 - 32B Model: ~1.5 epoch
-#### Stage 2: Fine-tuning
 - Dataset: Dolmino-Mix-1124
 - Two training mixes:
   - 100B tokens

 ## Model Details
+### Training
 |  | **OLMo 2 32B** | **OLMo 2 13B** | **OLMo 2 7B** |
 |-------------------|------------|------------|------------|
 | Pretraining Stage 1 | 6 trillion tokens<br>(1.5 epoch) | 5 trillion tokens<br>(1.2 epochs) | 4 trillion tokens<br>(1 epoch) |
+| Pretraining Stage 2 | 100B tokens (3 runs)<br>300B tokens (1 run)<br>*merged* | 100B tokens (3 runs)<br>300B tokens (1 run)<br>*merged* | 50B tokens (3 runs)<br>*merged* |
 | Post-training | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-32b-pref-mix-v1)) | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-1124-13b-preference-mix)) | SFT + DPO + PPO<br>([preference mix](https://huggingface.co/datasets/allenai/olmo-2-1124-7b-preference-mix)) |
 #### Stage 1: Initial Pretraining
 - Coverage: 95%+ of total pretraining budget
 - 32B Model: ~1.5 epoch
+#### Stage 2: Mid-training
 - Dataset: Dolmino-Mix-1124
 - Two training mixes:
   - 100B tokens