7900XTX 24GB vram, can finally fit Q6K+MTP with Qwen 3.6 27B at 131k context (opens in new tab)
Tests on Qwen 3.6 27B show why TurboQuant is overrated but saved by TCQ, q5 deserves more attention, and symmetric q8 might be a waste of VRAM.
Read the original article