Quantization
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
💻Local LLMs Content type: News Content type: BlogOn Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation
💻Local LLMs Content type: AcademicMorphoQuant: Modality-Aware Quantization for Omni-modal Large Language Models
💻Local LLMs Content type: AcademicTempoVLA: Learning Speed-Controllable Vision-Language-Action Policies
🎙️Whisper Content type: AcademicNo more posts from matmat's subscribed feeds.