Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local llm
🧠 Local llm
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
228
posts in
8.2
ms
local
AI agents for Cursor with pre-tuned marketplace/commu
🔌
Model Context Protocol
locaible.com
·
17h
17 hours ago
·
Hacker News
Actions for local AI agents for Cursor with pre-tuned marketplace/commu
Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive
llama.cpp
conversions suffer accuracy loss
🧠
LLM Inference
Content type:
News
digg.com
·
5d
5 days ago
Actions for Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss
Here's a
llama.cpp
CLI Command builder.
🧠
LLM Inference
llamabuilding.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Here's a llama.cpp CLI Command builder.
LM
Link launches on iPhone, bringing
local
AI model access to iOS devices
🧠
LLM Inference
alternativeto.net
·
5d
5 days ago
Actions for LM Link launches on iPhone, bringing local AI model access to iOS devices
Purpose-built
local
AI agents
🤖
Qwen
Content type:
Blog
samihonkonen.com
·
2d
2 days ago
·
Hacker News
Actions for Purpose-built local AI agents
KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4
GPU
(gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for
llama.cpp
, fully measured on real hardware.
🧠
LLM Inference
Content type:
Code
github.com
·
15h
15 hours ago
·
Hacker News
Actions for KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
DeskDash - a free Windows tool to easily manage your
GGUF
files
⚡
LLM Quantization
gerry7.itch.io
·
3d
3 days ago
·
r/LocalLLaMA
Actions for DeskDash - a free Windows tool to easily manage your GGUF files
"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
📊
Prometheus
Content type:
News
Content type:
Blog
braddelong.substack.com
·
2d
2 days ago
·
Substack
Actions for "AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
Self-hosted remote access for
Ollama
without complicated setup
🏠
Self-Hosting
oab.arc-i.co.uk
·
3d
3 days ago
·
r/selfhosted
Actions for Self-hosted remote access for Ollama without complicated setup
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
🧠
LLM Inference
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
How I benchmarked a 100%
local
RAG pipeline to 9/9 (zero API keys)
🕸️
WebAssembly
buy.polar.sh
·
2d
2 days ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
When AI builds itself 👷, AI is not a line item 📝,
local
LLMs for agentic coding 🤖
🏠
Self-Hosting
tldr.tech
·
6d
6 days ago
Actions for When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖
techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (
Ollama
+
Llama
3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.
🤖
Qwen
Content type:
Code
github.com
·
2d
2 days ago
Actions for techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.
Alduin 4B, an uncensored Vision
LLm
just released.
🧠
LLM Inference
huggingface.co
·
8h
8 hours ago
·
r/StableDiffusion
Actions for Alduin 4B, an uncensored Vision LLm just released.
LM
Studio
veröffentlicht
LM
Link:
Lokale
Mac-Modelle per iPhone steuern
⚡
LLM Quantization
stadt-bremerhaven.de
·
6d
6 days ago
Actions for LM Studio veröffentlicht LM Link: Lokale Mac-Modelle per iPhone steuern
Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial
LLM
Backends
🧠
LLM Inference
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial LLM Backends
RakuOS fixes the one thing that annoys me most about immutable Linux distros
🔄
ArgoCD
Content type:
News
zdnet.com
·
1d
1 day ago
Actions for RakuOS fixes the one thing that annoys me most about immutable Linux distros
Would a prepaid pass for a coding agent solve a real need or is it just my itch?
🏠
Self-Hosting
codehamr.com
·
5d
5 days ago
·
r/SideProject
Actions for Would a prepaid pass for a coding agent solve a real need or is it just my itch?
fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc
📝
SQLite WAL
Content type:
Code
github.com
·
1d
1 day ago
Actions for fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc
Large companies can add a
local
LLM
filter layer to considerably reducing their AI costs
🧠
LLM Inference
umrashrf.github.io
·
5d
5 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help