🧠 Local AI - c13e · Scour

Why and How to Run Local Models in Zed 🖥️Virtual Machines

I added a second GPU just for local AI workloads, and it cost less than upgrading my main one 🤖AI Engineering

xda-developers.com·3d

I built Mofakir: A native, local AI desktop assistant for Linux that actually interacts with your system ☎️OTP

github.com·5h·r/linux

tvall43/Qwen3.5-14B-A3B-Claude-4.6-Opus-Reasoning-Distilled-reap-gguf at main 💧Elixir

huggingface.co·17h·r/LocalLLaMA

Ollama Cheat Sheet: Local LLMs, Models, API & Integration (2026) 🐫OCaml

meshworld.in·2d·DEV

GPU Memory Math for LLMs: Formula That Tells You What Fits on Your GPU 🛡️Memory Safety

theahmadosman.substack.com·7h·Substack, r/LocalLLaMA

LM Studio 💧Elixir

flathub.org·6d

I tried 4 LLM speedup techniques on CPU. Three made it slower. ⚙️Systems Programming

deemwar-products.github.io·9h·Hacker News

Building a Controllable Inference Platform on Kubernetes with AI Runway 🤖AI Engineering

techcommunity.microsoft.com·2d

AMD says its $4K Ryzen AI Halo workstation practically pays for itself 🖥️Virtual Machines

theregister.com·4h

Build real-time voice applications with Amazon SageMaker AI and vLLM 🤖AI Engineering

aws.amazon.com·11h

Local LLMs are ready for real work ⚗️BEAM Ecosystem

thelurkreport.beehiiv.com·2d·r/LocalLLaMA

Ollama vs vLLM vs llama.cpp: Which Wins for Your Use Case ⚗️BEAM Ecosystem

tildalice.io·5d

LLM Inference 🤖AI Engineering

iop.systems·1h

Self-Hosted AI for Telegram/WhatsApp/Discord via Ollama, Zero Cloud ☎️OTP

crustaidocs.netlify.app·1d·Hacker News

Qwen’s MTP test puts local AI back in startup math 🤖AI Engineering

startupfortune.com·5d

What can a local model do for you in early May 2026? 🤖AI Engineering

manichord.com·2d·Hacker News

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks 🤖AI Engineering

news.ycombinator.com·1d·Hacker News, r/LocalLLaMA

Can You Run LLMs Locally Without a GPU? I Tested 8 Models on Linux 🐫OCaml

itsfoss.com·5d·Hacker News

Benchmarking llama.cpp's brand-new MTP support on Strix Halo ⚙️Systems Programming

calebcoffie.com·2d·Hacker News

Log in to enable infinite scrolling