view article Article Where should test-time compute go? Surprisal-guided selection in verifiable environments 11 days ago โข 1
view article Article Frontier Security Agents Don't Lack Detection. They Lack Restraint 26 days ago โข 2