OctoThinker
/

Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

Model card Files Files and versions Community

No model card

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

Mid-training Analysis Checkpoints (Llama-3.2-3B)

What makes a base language model suitable for RL? Through controlled experiments, we identify key factors then leverage them to scale up mid-training. • 10 items • Updated Jul 7 • 1