Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local LLM Deployment
🏠 Local LLM Deployment
Specific
Model Optimization, GPU Acceleration, Inference, Privacy
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
87
posts in
31.4
ms
flama.dev
·
1d
1 day ago
Serving Large Language
Models
with a Minimalist Python CLI
Covers
2 stories
See all stories this covers
including
uv
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Serving Large Language Models with a Minimalist Python CLI
vettedconsumer.com
·
6d
6 days ago
GLM-5.2: The Most Powerful Open
Model
yet and the Brutal Reality of Running It
Covers
6 stories
See all stories this covers
including
zai-org/GLM-5.2 is here!
Covered by
notes.dsebastien.net
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2: The Most Powerful Open Model yet and the Brutal Reality of Running It
docs.mistral.ai
·
9h
9 hours ago
Mistral
AI
Cookbooks
Covered by
3 sources
See all sources covering this story
including
Mistral AI
,
VentureBeat
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mistral AI Cookbooks
GitHub
·
2d
2 days ago
Show HN: Loqi, a "
local-first
" translation tool using
Ollama/llama.cpp
Covers
Ollama
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp
gHacks
·
2d
2 days ago
Valve Confirms Steam Machine Launches June 30 at $1,049 to $1,349 With Random Reservation Queue
Covered by
kite.kagi.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Valve Confirms Steam Machine Launches June 30 at $1,049 to $1,349 With Random Reservation Queue
everything.one
·
2d
2 days ago
Everything*: An interactive voyage through all orders of magnitude
Covers
Powers of Ten (1977) [video]
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Everything*: An interactive voyage through all orders of magnitude
Hacker News
·
19h
19 hours ago
We trained a real-time world
model
for $2k with Minecraft mod revenue
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for We trained a real-time world model for $2k with Minecraft mod revenue
GitHub
·
2d
2 days ago
DeepSeek V4 Flash
optimized
framework and
model
variants for DGX Spark
Covers
Nvidia RTX Spark
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DeepSeek V4 Flash optimized framework and model variants for DGX Spark
sipp.sh
·
19h
19 hours ago
Run small
local
LLMs in browser 3x faster
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Run small local LLMs in browser 3x faster
Hugging Face
·
1d
1 day ago
Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
Covered by
tldr.tech
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
jeena.net
·
6d
6 days ago
AI
coding: loop engineering a translator
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for AI coding: loop engineering a translator
PC Gamer
·
2d
2 days ago
Steam Machine review
Covered by
7 sources
See all sources covering this story
including
Kotaku
,
GamingOnLinux
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Steam Machine review
autonomy-landing-page.vercel.app
·
5d
5 days ago
Show HN: Autonomy – Self-Harness/Self-Directed
AI
Agent Core Under Development
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Autonomy – Self-Harness/Self-Directed AI Agent Core Under Development
GitHub
·
5d
5 days ago
How do I set the right
llama.cpp
parameters?
Covers
JSON Schema
Covered by
DEV Community
,
Alex Ewerlöf Notes
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How do I set the right llama.cpp parameters?
Aftermath
·
2d
2 days ago
The Steam Machine Is An Iconoclastic Computer Born In Unforgiving Times
Covers
4 stories
See all stories this covers
including
Exclusive
Covered by
5 sources
See all sources covering this story
including
Kotaku
,
The Verge
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Steam Machine Is An Iconoclastic Computer Born In Unforgiving Times
GitHub
·
4d
4 days ago
Show HN: Alloy – a PyTorch backend and
inference
engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
lector.dev
·
5d
5 days ago
Show HN: Evaluating
Local
LLMs as language translators for my app
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Evaluating Local LLMs as language translators for my app
GitHub
·
2d
2 days ago
Open-source security auditors for Supabase, Strapi, Hasura and
Ollama
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Open-source security auditors for Supabase, Strapi, Hasura and Ollama
teachmecoolstuff.com
·
6d
6 days ago
Fine Tuning a Tiny
Local
LLM
to Categorize Questions
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
Martin Alderson
·
3d
3 days ago
Expert-aware quantisation: near-Q4 quality at near-Q2 size?
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Expert-aware quantisation: near-Q4 quality at near-Q2 size?
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report