Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local LLMs
🏠 Local LLMs
Specific
local models, offline LLM, llama, on-device inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
132
posts in
13.9
ms
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🤗
Open Source AI
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
Qwen
3.6 27B AutoRound
GGUF
, need your feedback
🧠
LLMs
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for Qwen 3.6 27B AutoRound GGUF, need your feedback
What
Ollama
Reveals About
Local
AI, Agents, and Open
Models
🤗
Open Source AI
Content type:
Blog
odsc.medium.com
·
1h
1 hour ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and
Qwen
3.6 MTP
modes
🤗
Open Source AI
Content type:
Code
github.com
·
22h
22 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Integrate
on-device
AI
models
into your app using Core AI - WWDC26 - Videos
🎵
Vibe Coding
developer.apple.com
·
2d
2 days ago
·
Hacker News
Actions for Integrate on-device AI models into your app using Core AI - WWDC26 - Videos
GGUF
vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization
(and Which One to Pick)
🧠
LLMs
vettedconsumer.com
·
4d
4 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
Fixing a stuck
Ollama
runner and building a GPU watchdog
🤗
Open Source AI
patrickmccanna.net
·
2d
2 days ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
Running
Qwen
35B MoE at 450k Context on a Single 32GB GPU
🟢
NVIDIA
local-llm.utop.workers.dev
·
3d
3 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
🟢
NVIDIA
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Less-relevant results
On-device
AI is a margin decision
🤗
Open Source AI
Content type:
Blog
ziraph.com
·
5h
5 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
Gemma 4 QAT
models
: Optimizing model compression for mobile and laptop efficiency
🤗
Open Source AI
Content type:
News
Content type:
Blog
blog.google
·
5d
5 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
Running Two
LLMs
on a Mini PC Sounds Great Until the Benchmarks Arrive
🤗
Open Source AI
hackernoon.com
·
1d
1 day ago
Actions for Running Two LLMs on a Mini PC Sounds Great Until the Benchmarks Arrive
MoQ
GGUFs
and GSQ: Low-Bit
GGUFs
Are About to Get Much Better
🧠
LLMs
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
Re-quantizing
a
local
LLM
14x faster by skipping the tensors that didn't change
🤗
Open Source AI
Content type:
News
Content type:
Blog
andreaborio.substack.com
·
10h
10 hours ago
·
Substack
Actions for Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change
A system programmer’s guide to
LLM
inference
🤗
Open Source AI
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Run (your largest)
local
models
from your iPhone
🧠
LLMs
Content type:
Blog
lmstudio.ai
·
6d
6 days ago
·
Hacker News
,
r/LocalLLaMA
Actions for Run (your largest) local models from your iPhone
martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by
local
LLMs
. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.
🤗
Open Source AI
Content type:
Code
github.com
·
8h
8 hours ago
·
Hacker News
Actions for martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by local LLMs. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.
Purpose-built
local
AI agents
✍️
Prompt Engineering
Content type:
Blog
samihonkonen.com
·
2d
2 days ago
·
Hacker News
Actions for Purpose-built local AI agents
DeskDash - a free Windows tool to easily manage your
GGUF
files
💻
Code Generation
gerry7.itch.io
·
3d
3 days ago
·
r/LocalLLaMA
Actions for DeskDash - a free Windows tool to easily manage your GGUF files
Aspen: Own your intelligence
🤗
Open Source AI
Content type:
Discussion
Content type:
Tutorial
runonaspen.com
·
1d
1 day ago
·
Hacker News
Actions for Aspen: Own your intelligence
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help