Article 1 Emergent Semantics Beyond Token Embeddings: A GPT-like Transformer Learns with Frozen 16‑D Binary Token-ID Embeddings (n_embed=16)
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 1 day ago • 20 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 1 day ago • 18 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 1 day ago • 13
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 1 day ago • 20
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 1 day ago • 13
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 6 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 11 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Language Models Without a Trainable Input Embedding Table This collection is provided for reproducibility of the paper's main claim Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 1 day ago • 20 Bochkov/llm-fix-min-fixed-minimal-binary-code Text Generation • 0.5B • Updated 1 day ago • 18 Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 1 day ago • 13
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 1 day ago • 20
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 1 day ago • 13
Emergent Semantics Beyond Token Embeddings Paper: 2507.04886 (TMLR, Oct 2025). 'Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations' Bochkov/emergent-semantics-model-uni-glyph-335m Text Generation • Updated Jan 7 • 8 Bochkov/emergent-semantics-model-unfrozen-335m Text Generation • Updated Jan 7 • 6 Bochkov/emergent-semantics-model-16-bit-269m Text Generation • Updated Jan 7 • 11 • 1 Bochkov/emergent-semantics-model-64-bit-272m Text Generation • Updated Jan 7 • 4
Bochkov/llm-fix-min-affine-recoded-minimal-code-table-free Text Generation • 0.5B • Updated 1 day ago • 13
Bochkov/llm-fix-min-baseline-learned-input-table-model-classic Text Generation • 0.5B • Updated 1 day ago • 20
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m Text Generation • Updated Jan 9 • 33
Bochkov/growing-transformers-model-unfrozen-baseline-monolyth-247m Text Generation • Updated Jan 9 • 5
Bochkov/growing-transformers-model-frozen-unicode-baseline-monolyth-247m Text Generation • Updated Jan 9 • 1