Quantization
Qwen3.6 + MTP: Calculated context size is smaller when I use `--spec-draft-type-* q4_0`. is this normal? · ggml-org llama.cpp · Discussion #24102
🤖AI Content type: Discussion Content type: CodeMorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models
👁️Computer Vision Content type: AcademicLLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models
💬LLMs Content type: AcademicTempoVLA: Learning Speed-Controllable Vision-Language-Action Policies
🎮Reinforcement Learning Content type: AcademicNo more posts from jhcha.oyo's subscribed feeds.