Running Featured 28 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 28 Who needs 1T parameters? Olympiad proofs with a 4B model
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 10 days ago • 19