LLM Quantization
Qwen3.6 + MTP: Calculated context size is smaller when I use `--spec-draft-type-* q4_0`. is this normal? · ggml-org llama.cpp · Discussion #24102
🧠Local llm Content type: Discussion Content type: CodeNo more posts from akapaka's subscribed feeds.