Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local AI
🏠 Local AI
Specific
local-first AI, on-device AI, private AI, ollama, llama.cpp
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
369
posts in
7.0
ms
Token4Token — pay-per-token
inference
on Gnosis + Swarm
☁️
Cloud Infrastructure
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
GGUF
vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization
(and Which One to Pick)
⚙️
LLM Fine-tuning
vettedconsumer.com
·
4d
4 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
"
AI
" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
💻
AI Coding
Content type:
News
Content type:
Blog
braddelong.substack.com
·
2d
2 days ago
·
Substack
Actions for "AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
Making
Local
LLM
Go Brrr
✍️
Prompt Engineering
seanpedersen.github.io
·
6d
6 days ago
Actions for Making Local LLM Go Brrr
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
💾
ARM
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
⚙️
AI Automation
Content type:
Code
github.com
·
9h
9 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
LM
Studio
now lets you use your iPhone to talk to
local
models on your Mac
⌚
Wearables
9to5mac.com
·
6d
6 days ago
·
r/apple
Actions for LM Studio now lets you use your iPhone to talk to local models on your Mac
Integrate
on-device
AI
models into your app using Core
AI
- WWDC26 - Videos
🌐
Open Source
developer.apple.com
·
2d
2 days ago
·
Hacker News
Actions for Integrate on-device AI models into your app using Core AI - WWDC26 - Videos
Purpose-built
local
AI
agents
🤖
AI Agents
Content type:
Blog
samihonkonen.com
·
2d
2 days ago
·
Hacker News
Actions for Purpose-built local AI agents
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
💾
ARM
Content type:
News
Content type:
Blog
blog.google
·
5d
5 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🔌
APIs
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
Large companies can add a
local
LLM
filter layer to considerably reducing their
AI
costs
⚙️
LLM Fine-tuning
umrashrf.github.io
·
5d
5 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
🍓
Raspberry Pi
Content type:
Blog
dnhkng.github.io
·
1d
1 day ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Quality Is Not a Safety Proxy Under
Quantization
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Quality Is Not a Safety Proxy Under Quantization
When
AI
builds itself 👷,
AI
is not a line item 📝,
local
LLMs for agentic coding 🤖
🤖
AI Agents
tldr.tech
·
5d
5 days ago
Actions for When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖
Apple rebuilt its
on-device
AI
stack at WWDC 2026
💾
ARM
Content type:
Blog
ziraph.com
·
1d
1 day ago
·
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
WWDC 2026: Foundation Models (& Anarlog)
💾
ARM
skushagra.com
·
1d
1 day ago
Actions for WWDC 2026: Foundation Models (& Anarlog)
Running
LLM
Inference
on Kubernetes: What It Actually Takes
☁️
Cloud Infrastructure
Content type:
Blog
fairwinds.com
·
5d
5 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
What's in the Box? A Field Guide to
AI
Models
⚙️
LLM Fine-tuning
Content type:
Blog
iankduncan.com
·
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Show HN:
Ext-Infer
🪟
Windows
infer.displace.tech
·
3d
3 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help