Add runtime_config.json with optimal spec decode settings 39e7656 verified darkmaniac7 commited on Mar 22
Update README with honest 3x averaged benchmark numbers 2d9453c verified darkmaniac7 commited on Mar 22
Add config_opencl.json for OpenCL draft backend support 5531dd9 verified darkmaniac7 commited on Mar 22
v3: KL-distilled 0.6B from Qwen3-8B (10K samples, KL=0.339, +41% uplift on SM8850) 1f8c190 verified darkmaniac7 commited on Mar 22
docs: update README — abliterated draft with benchmarks 51a33fe verified darkmaniac7 commited on Mar 18
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance af3c044 verified darkmaniac7 commited on Mar 18
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance baa57ff verified darkmaniac7 commited on Mar 18
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance b9d2cac verified darkmaniac7 commited on Mar 18
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance 470043d verified darkmaniac7 commited on Mar 18
feat: switch to abliterated draft (Huihui-Qwen3-0.6B-abliterated-v2) for better acceptance 33d29cb verified darkmaniac7 commited on Mar 18