🧠 Local LLMs - flipperjibber · Scour

Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization

🤖LLMs Academic

Local AI agents on Arduino UNO Q

🤖Agents Blog

blog.arduino.cc·

Running LLM Inference on Kubernetes: What It Actually Takes

📝NLP Blog

fairwinds.com·

andreyvgavrilov/food_database: AI agent to evaluate recipe nutrition

🤖Agents Code

github.com··r/mcp

LM Link launches on iPhone, bringing local AI model access to iOS devices

alternativeto.net·

Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!

Less-relevant results

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

🔥PyTorch News Blog

andreaborio.substack.com··Substack

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

🤖Qwen Blog

ziraph.com··Hacker News

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization

🤖LLMs Academic

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)

🟣Claude News

How to Run Gemma 4 12B Locally - The Best AI For Consumer Laptops

🧠OpenAI Video

TFLite Edge Model Quantizer Snippet

itsevilduck.gumroad.com··DEV

fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc

👨‍💻AI Coding Code

A system programmer’s guide to LLM inference

📝NLP Blog

blog.xiangpeng.systems··Hacker News

WWDC 2026: Foundation Models (& Anarlog)

skushagra.com·

LM Studio now lets you use your iPhone to talk to local models on your Mac

9to5mac.com··r/apple

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

🔬Deep Learning

aarushgupta.io··Lobsters, Hacker News

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

local-llm.utop.workers.dev··Hacker News

Information Bottleneck Meets Quantization: Finite Rate Analysis and Optimal Designs

🤖LLMs Academic

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

🧠OpenAI News Blog

braddelong.substack.com··Substack

Sign up or log in to see more results

Log in to enable infinite scrolling