Local AI

Feeds to Scour
SubscribedAll
Scoured 369 posts in 7.0 ms

Token4Token — pay-per-token inference on Gnosis + Swarm

 ☁️Cloud Infrastructure

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 ⚙️LLM Fine-tuning

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

 💻AI Coding  Content type: News  Content type: Blog

Making Local LLM Go Brrr

 ✍️Prompt Engineering

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 💾ARM  Content type: Blog
adambien.blog·

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

 ⚙️AI Automation  Content type: Code
github.com··DEV

LM Studio now lets you use your iPhone to talk to local models on your Mac

 Wearables
9to5mac.com··r/apple

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 🌐Open Source

Purpose-built local AI agents

 🤖AI Agents  Content type: Blog

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 💾ARM  Content type: News  Content type: Blog
blog.google··Hacker News

NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet

 🔌APIs

Large companies can add a local LLM filter layer to considerably reducing their AI costs

 ⚙️LLM Fine-tuning

Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent

 🍓Raspberry Pi  Content type: Blog
dnhkng.github.io·

Quality Is Not a Safety Proxy Under Quantization

 🛡️AI Safety  Content type: Academic
arxiv.org·

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

 🤖AI Agents
tldr.tech·

Apple rebuilt its on-device AI stack at WWDC 2026

 💾ARM  Content type: Blog
ziraph.com··Hacker News

WWDC 2026: Foundation Models (& Anarlog)

 💾ARM
skushagra.com·

Running LLM Inference on Kubernetes: What It Actually Takes

 ☁️Cloud Infrastructure  Content type: Blog
fairwinds.com·

What's in the Box? A Field Guide to AI Models

 ⚙️LLM Fine-tuning  Content type: Blog
iankduncan.com·

Show HN: Ext-Infer

 🪟Windows

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help