fix: add run_episode wrapper, use .2f score format, update test for strict bounds 11aa990 hellinferno commited on Apr 10
fix: correct inference log format, align openenv.yaml task IDs, harden Dockerfile 852b5ea hellinferno Claude Sonnet 4.6 commited on Apr 10