Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 Local LLMs
Specific
Ollama, LLaMA, Mistral, On-device AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
200083
posts in
23.2
ms
Building Private LLMs Locally with
Ollama
and
LangChain
: A Production Engineer’s Playbook
💾
Local-first Software
medium.com
·
3d
Local LLMs are ready for real work
🥶
Cold Start Problem
thelurkreport.beehiiv.com
·
22h
·
r/LocalLLaMA
My local LLM can call Claude when it's
stuck
, and it changed everything about my local-first
setup
🥶
Cold Start Problem
xda-developers.com
·
1d
LLMForge
: Multi-Backend Hardware-Aware Neural Architecture Search with
Infinite-Head
Attention for Edge Language Models
🧠
Deep Learning
arxiv.org
·
3h
RedToasty/llama.cpp
_
qts
: Fixing --split-mode tensor, with different KV cache quantization types.
🔗
RAG
github.com
·
1d
·
r/LocalLLaMA
I built a Mac app for private meeting/interview AI without
adding
a
bot
to calls
💬
Prompt Engineering
extrabrain.app
·
12h
·
r/SideProject
Exploring
LLMs Speed
Benchmarks
🔢
Kolmogorov Complexity
mlops.community
·
5d
Find bugs in YOUR code using
OpenCode
,
Llama.cpp
and Qwen3.6
🥶
Cold Start Problem
wtarreau.blogspot.com
·
1d
·
Lobsters
,
wtarreau.blogspot.com
Pokemon
NPCs
Powered by Local LLMs
📖
Interactive Fiction
owenmc.dev
·
1d
·
Hacker News
Ollama vs
vLLM
vs
llama.cpp
: Which Wins for Your Use Case
💾
Local-first Software
tildalice.io
·
3d
Mistral's Open TTS, Anthropic's Activation Translator, and Matt
Pocock
's Skills Repo:
Tokenizer
#28
🎭
Anthropic Claude
newsletter.artofsaience.com
·
1d
LM
Studio
🔗
RAG
flathub.org
·
4d
https://
www.together.ai/blog/llama-2-7b-32k
🔗
RAG
together.ai
·
5d
Running a Local LLM on a 12-year-old
Raspberry
Pi
1
💾
Local-first Software
blog.adafruit.com
·
6d
An AI assistant
whose
memory
belongs
to you.
💬
Prompt Engineering
keel-labs.org
·
3d
Google’s
Gemma
4 +
Ollama
: Run the Most Powerful Free AI Model Locally in 2026 (No GPU Required)
🧠
Deep Learning
medium.com
·
3d
Self-hosted AI assistant architecture in Node.js — Telegram, WhatsApp, Discord and
Slack
powered by local
Ollama
💾
Local-first Software
documentcrustai.netlify.app
·
6d
·
r/selfhosted
I built my own
Googlebook
with a
Raspberry
Pi, local LLMs, and old hardware
💾
Local-first Software
xda-developers.com
·
1d
See through local AI
lies
with
Irish
eyes
🧠
LLM Reasoning
theregister.com
·
5d
Llama.cpp
b9180
:
MTP
support landed
💾
Local-first Software
github.com
·
1d
·
Hacker News
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help