OctoThinker
/

OctoThinker-8B-Short-Base

Text Generation

Model card Files Files and versions

Add model card

#1

by nielsr HF Staff - opened Jun 29, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

This PR adds a model card for the model presented in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling paper.
The PR also adds the appropriate tags for license, library, and pipeline.

Add model carde833e09f

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

README.md

· Sign up or log in to comment