metadata
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
This repository contains the OctoThinker model presented in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.
Project page: https://huggingface.co/OctoThinker