🧠 Local llm - akapaka · Scour

local AI agents for Cursor with pre-tuned marketplace/commu

🔌Model Context Protocol

locaible.com··Hacker News

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

🧠LLM Inference News

Here's a llama.cpp CLI Command builder.

🧠LLM Inference

llamabuilding.com··r/LocalLLaMA

LM Link launches on iPhone, bringing local AI model access to iOS devices

🧠LLM Inference

alternativeto.net·

Purpose-built local AI agents

🤖Qwen Blog

samihonkonen.com··Hacker News

KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.

🧠LLM Inference Code

github.com··Hacker News

DeskDash - a free Windows tool to easily manage your GGUF files

⚡LLM Quantization

gerry7.itch.io··r/LocalLLaMA

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

📊Prometheus News Blog

braddelong.substack.com··Substack

Self-hosted remote access for Ollama without complicated setup

🏠Self-Hosting

oab.arc-i.co.uk··r/selfhosted

Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent

🧠LLM Inference Blog

dnhkng.github.io·

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

🕸️WebAssembly

buy.polar.sh··DEV

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

🏠Self-Hosting

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

🤖Qwen Code

Alduin 4B, an uncensored Vision LLm just released.

🧠LLM Inference

huggingface.co··r/StableDiffusion

LM Studio veröffentlicht LM Link: Lokale Mac-Modelle per iPhone steuern

⚡LLM Quantization

stadt-bremerhaven.de·

Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial LLM Backends

🧠LLM Inference Academic

RakuOS fixes the one thing that annoys me most about immutable Linux distros

🔄ArgoCD News

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

🏠Self-Hosting

codehamr.com··r/SideProject

fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc

📝SQLite WAL Code

Large companies can add a local LLM filter layer to considerably reducing their AI costs

🧠LLM Inference

umrashrf.github.io··Hacker News

Sign up or log in to see more results

Log in to enable infinite scrolling