Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Local AI
🏠 Local AI
Specific
local-first AI, on-device AI, private AI, ollama, llama.cpp
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
164
posts in
6.0
ms
Open-LLM-VTuber
Review: Offline
AI
Companion with Live2D
💾
ARM
Content type:
Blog
dev.to
·
2d
2 days ago
·
DEV
Actions for Open-LLM-VTuber Review: Offline AI Companion with Live2D
The latest Gemma 4 models use a training trick to slash their
on-device
memory footprint
⚡
AI Hardware
androidauthority.com
·
4d
4 days ago
Actions for The latest Gemma 4 models use a training trick to slash their on-device memory footprint
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
⚙️
AI Automation
Content type:
Code
github.com
·
6h
6 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
I stopped fighting
LM
Studio
's model UI and switched to
Ollama
— setup took minutes instead of hours
⚙️
LLM Fine-tuning
makeuseof.com
·
2d
2 days ago
Actions for I stopped fighting LM Studio's model UI and switched to Ollama — setup took minutes instead of hours
Get this Alexa+ premium feature for free in Home Assistant
📡
Zigbee
howtogeek.com
·
7h
7 hours ago
Actions for Get this Alexa+ premium feature for free in Home Assistant
Running
Ollama
on a 15W CPU sounded ridiculous until I got it working with decent results
🖥️
Self-Hosting
xda-developers.com
·
6d
6 days ago
Actions for Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results
How I benchmarked a 100%
local
RAG pipeline to 9/9 (zero API keys)
🔍
RAG
buy.polar.sh
·
2d
2 days ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
⚡
AI Hardware
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
Less-relevant results
How to Fine-Tune an SLM for Emotion Recognition
⚙️
LLM Fine-tuning
towardsdatascience.com
·
5d
5 days ago
Actions for How to Fine-Tune an SLM for Emotion Recognition
Lenovo ThinkStation PGX review: I may have just found the best mini workstation for OpenClaw, and it’s not a Mac mini
💻
Laptops
techradar.com
·
1d
1 day ago
Actions for Lenovo ThinkStation PGX review: I may have just found the best mini workstation for OpenClaw, and it’s not a Mac mini
The smartest ChatGPT users are putting
local
AI
in front of it — here's why
✍️
Prompt Engineering
tomsguide.com
·
4d
4 days ago
Actions for The smartest ChatGPT users are putting local AI in front of it — here's why
I switched from
LM
Studio
to
llama.cpp
, and I'm never going back to a bloated wrapper
🌐
Open Source
howtogeek.com
·
2d
2 days ago
Actions for I switched from LM Studio to llama.cpp, and I'm never going back to a bloated wrapper
ComfyUI NVFP4 in 2026: 3 Faster Image Generation on RTX 50-Series (and the Right Format for RTX 40-Series)
🏠
Home Assistant
Content type:
Blog
dev.to
·
11h
11 hours ago
·
DEV
Actions for ComfyUI NVFP4 in 2026: 3 Faster Image Generation on RTX 50-Series (and the Right Format for RTX 40-Series)
I got a Crush on this new Terminal-based
AI
coding tool
🔌
APIs
xda-developers.com
·
22h
22 hours ago
Actions for I got a Crush on this new Terminal-based AI coding tool
Google’s latest
on-device
AI
model is custom-made for your laptop
🤖
Android
androidauthority.com
·
6d
6 days ago
Actions for Google’s latest on-device AI model is custom-made for your laptop
My First $5,000 Month Writing About
AI
Engineering on Medium
🤖
AI Agents
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for My First $5,000 Month Writing About AI Engineering on Medium
shoo99/paper-rag: A
private
,
fully-local
RAG over your own PDFs: BGE-M3 + embedded Qdrant + a
local
LLM
via Ollama. ~150 lines, nothing leaves your machine.
🔍
RAG
Content type:
Code
github.com
·
4d
4 days ago
·
DEV
Actions for shoo99/paper-rag: A private, fully-local RAG over your own PDFs: BGE-M3 + embedded Qdrant + a local LLM via Ollama. ~150 lines, nothing leaves your machine.
This cloud workspace gives your laptop the GPU it never had
🤖
AI Agents
makeuseof.com
·
1d
1 day ago
Actions for This cloud workspace gives your laptop the GPU it never had
I used ChatGPT and Gemini side-by-side for a month on Android, and only one behaved like a senior
AI
tool
🤖
Android
androidpolice.com
·
3d
3 days ago
Actions for I used ChatGPT and Gemini side-by-side for a month on Android, and only one behaved like a senior AI tool
Doubling Qwen3.6-27B on One RTX 3090:
ollama
llama.cpp
+ MTP, Lever by Lever (35.7 80.2 tok/s)
⚡
AI Hardware
Content type:
Blog
dev.to
·
1d
1 day ago
·
DEV
Actions for Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help