Back to article

@adlrocha Weekly Newsletter

Towards local plug-and-play AI (opens in new tab)

Covers 6 stories including antirez/ds4: DeepSeek 4 Flash local inference engine for MetalDiscussed on Substack

Covers 6 related stories

antirez/ds4: DeepSeek 4 Flash local inference engine for Metal

Discussed on Hacker News and r/LocalLLaMA

huggingface.co·

Qwen 3.6 27B is out

Discussed on Hacker News and r/LocalLLaMA

Accelerating Gemma 4: faster inference with multi-token prediction drafters

Discussed on Hacker News

huggingface.co·

Alibaba open-sources Qwen3.6-35B-A3B, a 35B MoE model with 3B active parameters

Discussed on Hacker News, r/LocalLLaMA, and r/artificial

Dao-AILab/flash-attention

EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test