feat: reward verifier alignment, notebook hardening, model name fix cdc237b CreativeEngineer Claude Opus 4.6 commited on Mar 8
refactor: align p1 runtime contract and baseline reporting 6deaccc CreativeEngineer commited on Mar 8