Mlx-optiq: per-layer mixed-precision LLM quantization for Apple Silicon (opens in new tab) 聽馃LLMs 聽Content type: Video 聽Content type: Discussion 聽Content type: Tutorial
Quantize, fine-tune and serve LLMs locally on Apple Silicon (M1 to M5). MLX-native, no PyTorch, no cloud. On PyPI.
Read the original article