Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏠 Local LLM Deployment
Model Optimization, GPU Acceleration, Inference, Privacy
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
8190
posts in
102.4
ms
llama.cpp
guide - Running LLMs
locally
, on any hardware, from scratch
blog.steelph0enix.dev
·
12h
🖥️
Self-hosted apps
Zero-Latency
Local AI:
Tuning
Your Linux Kernel for LLM Inference 🐧🧠
dev.to
·
2d
·
Discuss:
DEV
🖥️
Self-hosted apps
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
jack-clark.net
·
2h
🖥️
Self-hosted apps
LlamaLib
: A cross-platform C++/C# library for local LLMs based on
llama.cpp
github.com
·
2d
·
Discuss:
Hacker News
🗃️
SQLite
Optimized
LLM Inference
Engines
rishirajacharya.com
·
5d
🗃️
SQLite
Domain
Knowledge Is the New
Syntax
blog.melashri.net
·
5h
·
Discuss:
Hacker News
🖥️
Self-hosted apps
Main
Content ||
Math
∩ Programming
jeremykun.com
·
17h
🗃️
SQLite
Show HN:
Molinar
– Open-source alternative to ai.com (
AGPL-3.0
)
business.molinar.ai
·
8h
·
Discuss:
Hacker News
🖥️
Self-hosted apps
SDFP
: Speculative Decoding with
FIT-Pruned
Models for Training-Free and Plug-and-Play LLM Acceleration
arxiv.org
·
3d
🗃️
SQLite
From Prediction to
Compilation
: A Manifesto for
Intrinsically
Reliable AI
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🗃️
SQLite
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
2d
·
Discuss:
Hacker News
🗃️
SQLite
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
·
2h
·
Discuss:
Substack
🖥️
Self-hosted apps
Roll with Advantage:
Hacking
Lenovo
Vantage
mkiesel.ch
·
4h
·
Discuss:
Hacker News
🖥️
Self-hosted apps
ML-LIB
: Machine Learning Library Proposed For The Linux Kernel
phoronix.com
·
2d
·
Discuss:
Hacker News
🖥️
Self-hosted apps
OpenClaw
: I gave an AI my credit card and let it
loose
on Amazon
codedojo.com
·
47m
·
Discuss:
Hacker News
🖥️
Self-hosted apps
How AI coding makes
developers
56% faster and 19%
slower
thenewstack.io
·
4h
🖥️
Self-hosted apps
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
17h
·
Discuss:
Hacker News
🖥
Home Lab Setup
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
10h
🖥️
Self-hosted apps
hanig/engram
: Personal knowledge graph and automation system
github.com
·
1d
🖥️
Self-hosted apps
Allium
is an LLM-native language for
sharpening
intent alongside implementation
juxt.github.io
·
5h
·
Discuss:
Hacker News
🗃️
SQLite
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help