Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local LLMs
🏠 Local LLMs
Specific
local models, offline LLM, llama, on-device inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
131
posts in
11.7
ms
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🌐
Web Dev
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
Less-relevant results
Here's a
llama.cpp
CLI Command builder.
💻
Code Generation
llamabuilding.com
·
1d
1 day ago
·
r/LocalLLaMA
Actions for Here's a llama.cpp CLI Command builder.
Konversio: Open-source agentic customer support for digital sovereignty
🤗
Open Source AI
konversio.org
·
6d
6 days ago
·
Hacker News
Actions for Konversio: Open-source agentic customer support for digital sovereignty
local
AI agents for Cursor with pre-tuned marketplace/commu
🤗
Open Source AI
locaible.com
·
8h
8 hours ago
·
Hacker News
Actions for local AI agents for Cursor with pre-tuned marketplace/commu
Youssof Altoukhi (@Youssofal_)
🧠
LLMs
xcancel.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for Youssof Altoukhi (@Youssofal_)
1-bit and 1.58 bit
LLM
Benchmarking on Jetson Orin Nano Super | Bonsai
LM
🟢
NVIDIA
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Show HN: Audit any AI/data pairing with Veritrooper
🤗
Open Source AI
veritrooper.com
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Audit any AI/data pairing with Veritrooper
Apples to Apples: MLX vs.
Llama.cpp
for Gemma 4 12B on an M1 16GB
🤗
Open Source AI
Content type:
Blog
ziraph.com
·
5d
5 days ago
·
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for
llama.cpp
, fully measured on real hardware.
🤗
Open Source AI
Content type:
Code
github.com
·
6h
6 hours ago
·
Hacker News
Actions for KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
The May 2026 AI Toolkit: Stop Picking One
Model
🤖
AI Coding
Content type:
Blog
colinritman.medium.com
·
5d
5 days ago
Actions for The May 2026 AI Toolkit: Stop Picking One Model
Token4Token — pay-per-token
inference
on Gnosis + Swarm
🤗
Open Source AI
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Large companies can add a
local
LLM
filter layer to considerably reducing their AI costs
🤗
Open Source AI
umrashrf.github.io
·
5d
5 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
Show HN:
Ext-Infer
🤗
Open Source AI
infer.displace.tech
·
3d
3 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
Evaluating bigaspv2-5, a Flow Matching Alternative to SDXL
🕵️
Agentic AI
hackernoon.com
·
11h
11 hours ago
Actions for Evaluating bigaspv2-5, a Flow Matching Alternative to SDXL
Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe
⚙️
DevOps
omnifs.dev
·
1d
1 day ago
·
Hacker News
Actions for Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe
Humans and
LLMs
share a mental disorder: Fugue
Lock
✍️
Prompt Engineering
vwwwv.org
·
1d
1 day ago
·
Hacker News
Actions for Humans and LLMs share a mental disorder: Fugue Lock
LLM
AI Chatbots are letting me down every single day
🤗
Open Source AI
umrashrf.github.io
·
5d
5 days ago
·
Hacker News
Actions for LLM AI Chatbots are letting me down every single day
Ask HN: Is it feasible to run a
model
on
device
for complete
privacy
?
🤗
Open Source AI
Content type:
Discussion
news.ycombinator.com
·
3d
3 days ago
·
Hacker News
Actions for Ask HN: Is it feasible to run a model on device for complete privacy?
Launch HN: General Instinct (YC P26) – Frontier
models
on edge
devices
🤗
Open Source AI
Content type:
Discussion
news.ycombinator.com
·
5d
5 days ago
·
Hacker News
Actions for Launch HN: General Instinct (YC P26) – Frontier models on edge devices
Arconia for Spring Boot: DevEx, Observability, Multitenancy, GenAI, Cloud Native
☁️
Cloud Computing
Content type:
Code
arconia.io
·
2d
2 days ago
·
Hacker News
Actions for Arconia for Spring Boot: DevEx, Observability, Multitenancy, GenAI, Cloud Native
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help