Add model card
#1
by
nielsr HF Staff - opened
This PR adds a model card for the model presented in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling paper.
The PR also adds the appropriate tags for license, library, and pipeline.