Text Generation
Safetensors
English
llama

Add model card

#1
by nielsr HF Staff - opened

This PR adds a model card for the OctoThinker model, linking it to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling and the project page. It also adds the relevant pipeline and library tags, as well as the license.

Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment