RT by @awnihannun: Today we're shipping our biggest MLX-VLM release yet: v0.6.0 (opens in new tab)
Today we're shipping our biggest MLX-VLM release yet: v0.6.0 ...and we are raising πΈ This one's about turning your Apple devices into real local agent machines. From your desk to your pocket. What's new: β‘ Speculative decoding everywhere β Gemma 4 EAGLE3 + DFlash, Qwen MTP, DeepSeek V4 MTP. Faster tokens, less waiting. π€ Agent-ready server β native Anthropic /v1/messages API, stateful /v1/responses, tool calls, Codex context budgets. Plug Claude Code & Codex straight into local models. ποΈ New...
Read the original article