Run LLMs locally on your Mac (M1 to M5): mixed-precision quantization, LoRA fine-tuning, and KV-cache serving, all MLX-native. No PyTorch, no GPU, no cloud.
Press ? anytime to show this help