Running Featured 41 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 41 Who needs 1T parameters? Olympiad proofs with a 4B model
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output 15 days ago • 20