Which checkpoint (or soup) corresponds to the final base model?

by Proyag - opened May 21

May 21

The model card for OLMo-2-0425-1B says there was no model merging, and there was only 1 run on 50B tokens. But looking at the checkpoints, there are checkpoints stage2-ingredient1-step23852-tokens51B, stage2-ingredient2-step23852-tokens51B, and stage2-ingredient3-step23852-tokens51B. How do these correspond to the final released model? I think it's probably a model soup of the three and the model card is misleading, but just wanted to check. Thanks!

amanrangapur

Ai2 org May 21

Hey @Proyag , great question, You’re right that the checkpoints mention multiple ingredients, but to clarify: there was no model merging or model soup involved in the final release. The OLMo-2-0425-1B model was trained in a single run to 50B tokens. We tried different recipes across ingredients 2 and 3, but these were independent exploratory runs, not merged into the final model. The released final checkpoint corresponds to ingredient 1, selected based on performance.

amanrangapur changed discussion status to closed May 21

Proyag

May 21

Ah ok, thanks for your response!

amanrangapur

Ai2 org May 28

Hey @Proyag , the released final checkpoint corresponds to ingredient 3, not ingredient 1. I am sorry for the confusion and mistake in previous response. I will add it to readme.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment