Position as Probability: Self-Supervised Transformers that Think Past Their Training for Length Extrapolation Paper • 2506.00920 • Published Jun 1, 2025 • 1