Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
🤖 AI
Broad
local llms
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
275
posts in
6.7
ms
Building & Benchmarking:
LLMs
on a 16GB Jetson Orin NX for Hermes Agent
🐧
unix
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Large companies can add a
local
LLM
filter layer to considerably reducing their
AI
costs
🧱
data structures
umrashrf.github.io
·
5d
5 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
Purpose-built
local
AI
agents
🧩
lisp
Content type:
Blog
samihonkonen.com
·
2d
2 days ago
·
Hacker News
Actions for Purpose-built local AI agents
[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo
🕸️
graphs
Content type:
News
latent.space
·
14h
14 hours ago
Actions for [AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo
When
AI
builds itself 👷,
AI
is not a line item 📝,
local
LLMs
for agentic coding 🤖
🧱
data structures
tldr.tech
·
6d
6 days ago
Actions for When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖
Here's a
llama.cpp
CLI Command builder.
🐧
unix
llamabuilding.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Here's a llama.cpp CLI Command builder.
I Processed 2.4 Billion Tokens Across 52
AI
Models for $0.52. Here's the Full Breakdown.
🧱
data structures
saintlex.sbs
·
14h
14 hours ago
·
DEV
Actions for I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.
Google DeepMind releases
Gemma
4 QAT, but Unsloth developer Daniel Han warns naive
llama.cpp
conversions suffer accuracy loss
🧱
data structures
Content type:
News
digg.com
·
6d
6 days ago
Actions for Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss
AMD's Lemonade SDK For
Local
AI
Adds NVIDIA CUDA Support
🧱
data structures
phoronix.com
·
1d
1 day ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
Tokiyo Ooto & Orhythmoが新作アルバム『Play Nursery Rhymes and Children's Songs (童謡と子供の歌を唄う)』をEM Recordsよりリリース | AVE | CORNER PRINTING
🐧
unix
ave-cornerprinting.com
·
8h
8 hours ago
Actions for Tokiyo Ooto & Orhythmoが新作アルバム『Play Nursery Rhymes and Children's Songs (童謡と子供の歌を唄う)』をEM Recordsよりリリース | AVE | CORNER PRINTING
A system programmer’s guide to
LLM
inference
🧱
data structures
Content type:
Blog
blog.xiangpeng.systems
·
3d
3 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Re-quantizing
a
local
LLM
14x faster by skipping the tensors that didn't change
🧱
data structures
Content type:
News
Content type:
Blog
andreaborio.substack.com
·
1d
1 day ago
·
Substack
Actions for Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change
Running
LLM
Inference
on Kubernetes: What It Actually Takes
🧱
data structures
Content type:
Blog
fairwinds.com
·
6d
6 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
Why Spring Teams Don’t Need a Second Runtime for
AI
Agents
🧱
data structures
foojay.io
·
22h
22 hours ago
Actions for Why Spring Teams Don’t Need a Second Runtime for AI Agents
Local
AI
agents on Arduino UNO Q
🧱
data structures
Content type:
Blog
blog.arduino.cc
·
2d
2 days ago
Actions for Local AI agents on Arduino UNO Q
MoQ
GGUFs
and GSQ: Low-Bit
GGUFs
Are About to Get Much Better
🧱
data structures
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
Cyber Triage 3.18: New
AI
+ Cloud Automation Capabilities
🧱
data structures
Content type:
Blog
Content type:
Tutorial
cybertriage.com
·
17h
17 hours ago
Actions for Cyber Triage 3.18: New AI + Cloud Automation Capabilities
The latest
Gemma
4 models use a training trick to slash their on-device memory footprint
🧱
data structures
androidauthority.com
·
5d
5 days ago
Actions for The latest Gemma 4 models use a training trick to slash their on-device memory footprint
How I benchmarked a 100%
local
RAG pipeline to 9/9 (zero API keys)
🧱
data structures
buy.polar.sh
·
2d
2 days ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Qwen 3.6 27B AutoRound
GGUF
, need your feedback
🧱
data structures
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for Qwen 3.6 27B AutoRound GGUF, need your feedback
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help