Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🦙 llama.cpp
Specific
llama.cpp, local LLM, GGUF, CPU inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
163
posts in
11.2
ms
Why and How to Run
Local
Models in Zed
🤖
LLM Inference
zed.dev
·
2d
Less-relevant results
Why I Invested ₹5 Lakhs in an M5 Max (64GB) Instead of Real Estate: An Architect’s Bet on On-Device AI and Global Freedom
🤖
LLM Inference
whatsapp.com
·
23h
·
DEV
ROCm 7 on Strix Halo: Benchmarking the New Toolbox Images
🤖
LLM Inference
sleepingrobots.com
·
4d
Qwen 3.7 Preview
🤖
LLM Inference
news.ycombinator.com
·
2d
·
Hacker News
Surprising things I learned putting together a Home Brain
🤖
LLM Inference
bitworking.org
·
3d
·
Hacker News
tokenspeed — feel
LLM
tokens-per-second
🤖
LLM Inference
mikeveerman.github.io
·
1h
Lab notebook: Edit completion #1
⚙️
Zig
randomhacks.net
·
4d
My Zerto Docs MCP Server: Ask Claude (or Copilot, or Cursor) Real Questions
💾
SQLite
jpaul.me
·
15h
Can You Run LLMs
Locally
Without a GPU? I Tested 8 Models on Linux
🤖
LLM Inference
itsfoss.com
·
5d
·
Hacker News
Self-Hosted AI for Telegram/WhatsApp/Discord via Ollama, Zero Cloud
🤖
LLM Inference
crustaidocs.netlify.app
·
1d
·
Hacker News
The Ultimate
LLM
Fine-Tuning Guide
🤖
LLM Inference
promptinjection.net
·
3d
·
Hacker News
I built Mofakir: A native,
local
AI desktop assistant for Linux that actually interacts with your system
🤖
LLM Inference
github.com
·
6h
·
r/linux
Ollama on Mac: Setup and Optimization Guide (2026)
🧠
Memory Allocators
insiderllm.com
·
4d
Capturing ideas with voice,
local
LLMs, and obsidian
⚙️
Zig
aidenredmondd.substack.com
·
2d
·
Substack
VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use
🤖
LLM Inference
arxiv.org
·
6d
AMD just dropped a compact AI workstation that makes discrete GPUs look outdated for running LLMs
🤖
LLM Inference
xda-developers.com
·
5h
Driving DeepSeek V4 Flash on your own Mac
🧠
Memory Allocators
pi.audreyt.org
·
3d
Forensics First. AI Second.
🤖
AI
brettshavers.com
·
2d
froggeric/Qwen3.6-27B-MTP-GGUF
🤖
LLM Inference
huggingface.co
·
3d
·
DEV
michelangeloromerochisco/ternative:
Inference
engine for ternary-weight LLMs with runtime LoRA - the
llama.cpp
of BitNet models
🤖
LLM Inference
github.com
·
1d
·
Hacker News
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help