Local llm

Feeds to Scour
SubscribedAll
Scoured 228 posts in 8.2 ms

local AI agents for Cursor with pre-tuned marketplace/commu

 🔌Model Context Protocol
locaible.com··Hacker News

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

 🧠LLM Inference  Content type: News
digg.com·

Here's a llama.cpp CLI Command builder.

 🧠LLM Inference

LM Link launches on iPhone, bringing local AI model access to iOS devices

 🧠LLM Inference
alternativeto.net·

Purpose-built local AI agents

 🤖Qwen  Content type: Blog

KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.

 🧠LLM Inference  Content type: Code
github.com··Hacker News

DeskDash - a free Windows tool to easily manage your GGUF files

 LLM Quantization

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

 📊Prometheus  Content type: News  Content type: Blog

Self-hosted remote access for Ollama without complicated setup

 🏠Self-Hosting

Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent

 🧠LLM Inference  Content type: Blog
dnhkng.github.io·

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

 🕸️WebAssembly
buy.polar.sh··DEV

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

 🏠Self-Hosting
tldr.tech·

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

 🤖Qwen  Content type: Code
github.com·

Alduin 4B, an uncensored Vision LLm just released.

 🧠LLM Inference

LM Studio veröffentlicht LM Link: Lokale Mac-Modelle per iPhone steuern

 LLM Quantization
stadt-bremerhaven.de·

Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial LLM Backends

 🧠LLM Inference  Content type: Academic
arxiv.org·

RakuOS fixes the one thing that annoys me most about immutable Linux distros

 🔄ArgoCD  Content type: News
zdnet.com·

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

 🏠Self-Hosting

fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc

 📝SQLite WAL  Content type: Code
github.com·

Large companies can add a local LLM filter layer to considerably reducing their AI costs

 🧠LLM Inference
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help