interview / docs

Commit History

docs: drop R4-η»­ retrospective paragraph β€” it reintroduced the wrong shaping=0.5 value
8eeeb67
Running

Lee93whut Lee93whut commited on

docs: polish pass β€” README wording, experiment_log structure + honest retrospective, LaTeX micro-style
e1ecae1

Lee93whut Lee93whut commited on

chore: codebase hygiene pass β€” untrack weights, migrate to logging, tidy comments
17bc537

Lee93whut Lee93whut commited on

docs: clean up R3/R4 record and consolidate technical narrative
92423f0

Lee93whut Lee93whut commited on

docs: finalize R4 documentation β€” Dueling 84% Holdout, full ablation record
acbd4c5

Lee93whut commited on

feat(round4): four-algorithm ablation β€” Dueling best at 84% Holdout
44cfe4c

Lee93whut commited on

docs(round4): finalize R4 Double DQN results β€” 78% Holdout, Grid-SPL clarification
b14b412

Lee93whut commited on

docs(round4): complete experiment record β€” A1/A2/A3 full EVAL data and conclusions
a91b194

Lee93whut commited on

feat(round4): upgrade obs 3->4 channels (visited_map) + EVAL-based checkpoint
062d629

Lee93whut commited on

feat(round3): buffer=80k + target_freq=1500 + shaping=0.5 β†’ 74% holdout, SPL=0.735
c1b9ba8

Lee93whut commited on

feat(round2): extended training, Double DQN 64% holdout, SPL=0.633
ff1b1b8

Lee93whut commited on

feat(round1): baseline DQN variants β€” Vanilla/Double/Dueling/Double+Dueling
bf17b0c

Lee93whut commited on