inference-optimization/ctest-Qwen3.6-27B-speculator-dataset Viewer • Updated 26 days ago • 5.61k • 48 • 1
ROCmFP4 MTP · Strix Halo Collection Self-speculative MTP quants in custom ROCmFP4 4-bit for AMD Strix Halo (gfx1151). Needs the charlie12345/rocmfp4-llama fork. • 5 items • Updated 9 days ago • 2