Spaces:

lil58
/

interview

Running

App Files Files Community

interview / docs

Commit History

docs: drop R4-续 retrospective paragraph — it reintroduced the wrong shaping=0.5 value

8eeeb67

Running

Lee93whut Lee93whut commited on 1 day ago

docs: polish pass — README wording, experiment_log structure + honest retrospective, LaTeX micro-style

e1ecae1

Lee93whut Lee93whut commited on 1 day ago

chore: codebase hygiene pass — untrack weights, migrate to logging, tidy comments

17bc537

Lee93whut Lee93whut commited on 1 day ago

docs: clean up R3/R4 record and consolidate technical narrative

92423f0

Lee93whut Lee93whut commited on 1 day ago

docs: finalize R4 documentation — Dueling 84% Holdout, full ablation record

acbd4c5

Lee93whut commited on 1 day ago

feat(round4): four-algorithm ablation — Dueling best at 84% Holdout

44cfe4c

Lee93whut commited on 1 day ago

docs(round4): finalize R4 Double DQN results — 78% Holdout, Grid-SPL clarification

b14b412

Lee93whut commited on 1 day ago

docs(round4): complete experiment record — A1/A2/A3 full EVAL data and conclusions

a91b194

Lee93whut commited on 2 days ago

feat(round4): upgrade obs 3->4 channels (visited_map) + EVAL-based checkpoint

062d629

Lee93whut commited on 2 days ago

feat(round3): buffer=80k + target_freq=1500 + shaping=0.5 → 74% holdout, SPL=0.735

c1b9ba8

Lee93whut commited on 2 days ago

feat(round2): extended training, Double DQN 64% holdout, SPL=0.633

ff1b1b8

Lee93whut commited on 2 days ago

feat(round1): baseline DQN variants — Vanilla/Double/Dueling/Double+Dueling

bf17b0c

Lee93whut commited on 2 days ago