add opd-32b-v33-s150-gptq-w4a16 + dflash phaseL int4mlp-gptq draft 39037e6 verified ycchen commited on 8 days ago
add dflash-32b-draft-v2test-phaseL-int4mlp (int4-MLP draft) f7dd3b2 verified ycchen commited on 8 days ago
add opd-32b-v33-s200-gptq-w4a16 (sink-on + long-ctx + fp8 KV scale) 39aa530 verified ycchen commited on 8 days ago
card: add dflash-32b-draft-v2test-phaseL (phase-2 long-ctx final, job 140680); mark phase-1 as warm-up ef8af61 verified ycchen commited on 8 days ago
card: fix opd-32b-deploy provenance (v33/job135076/step_200, not the V32 158-collapse) + add opd-32b-v33-s150 37c0b61 verified ycchen commited on 8 days ago