NextTerm-440M Checkpoints

Transformers-compatible checkpoints from the OEIS NextTerm-440M run.

These checkpoints use a Qwen3-style causal LM architecture with a 16-token OEIS digit vocabulary. They were converted from the training checkpoints by remapping the custom interleaved RoPE basis into the Hugging Face / Qwen split-half RoPE basis, so they can be loaded directly with AutoModelForCausalLM.

Checkpoints

Folder Tokens trained Notes
checkpoints/final_latest 13,999,999,995 Final checkpoint; recommended default
checkpoints/best_val 9,500,200,875 Best validation-loss checkpoint
checkpoints/checkpoint_tokens_012000258345 12,000,258,345 Historical checkpoint
checkpoints/checkpoint_tokens_012500265837 12,500,265,837 Historical checkpoint
checkpoints/checkpoint_tokens_013000266889 13,000,266,889 Historical checkpoint
checkpoints/checkpoint_tokens_013500289737 13,500,289,737 Historical checkpoint

OEIS Vocab

The model is token-ID based; no text tokenizer is included.

Token ID Meaning
0-9 decimal digits
10 negative sign
11 term separator
12 BOS
13 EOS
14 PAD
15 reserved

For next-term generation, stop on any of [11, 13, 14].

Loading

import torch
from transformers import AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(
    "N8Programs/NextTerm-440M-Checkpoints",
    subfolder="checkpoints/final_latest",
    dtype=torch.bfloat16,
    device_map="auto",
)

Example input IDs for the prefix 1, 2, 3, ...:

input_ids = torch.tensor([[12, 1, 11, 2, 11, 3, 11]], device=model.device)
out = model.generate(
    input_ids,
    max_new_tokens=192,
    do_sample=False,
    eos_token_id=[11, 13, 14],
    pad_token_id=14,
)

Evaluation Notes

OEIS Eval Neo excludes exact packed-sequence overlaps with the training data and uses max_new_tokens=192, which is sufficient for every answer in that eval set.

Known OEIS Eval Neo results:

Checkpoint Accuracy
final_latest 6545 / 19034 = 34.39%
best_val 6477 / 19034 = 34.03%

Each checkpoint folder includes an oeis_checkpoint_meta.json file with training tokens, source checkpoint path, and conversion details.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support