chore: codebase hygiene pass β untrack weights, migrate to logging, tidy comments 17bc537 Lee93whut Lee93whut commited on 1 day ago
docs: clean up R3/R4 record and consolidate technical narrative 92423f0 Lee93whut Lee93whut commited on 1 day ago
refactor(model): update architecture docs and set dueling as default algorithm 34ad2cc Lee93whut commited on 1 day ago
fix: eliminate infinite-loop risk in maze start/goal sampling 10926f0 Lee93whut commited on 1 day ago
docs(round4): finalize R4 Double DQN results β 78% Holdout, Grid-SPL clarification b14b412 Lee93whut commited on 1 day ago
docs(round4): complete experiment record β A1/A2/A3 full EVAL data and conclusions a91b194 Lee93whut commited on 2 days ago
style(train): remove forward-reference quotes from type hints (Python 3.10+) 274376b Lee93whut commited on 2 days ago
fix(train): use terminated-only mask for TD bootstrap (Gymnasium v0.26) 670449d Lee93whut commited on 2 days ago
fix(train): guarantee BFS-connected start/goal, bounded retry with fallback 92a3812 Lee93whut commited on 2 days ago