Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source LLMs
🔓 Open Source LLMs
Specific
open source LLM, Llama, Mistral, Ollama, local models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
234
posts in
7.7
ms
Gemma
4 QAT
models
: Optimizing model compression for mobile and laptop efficiency
🧠
LLMs
Content type:
News
Content type:
Blog
blog.google
·
4d
4 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
Doubling
Qwen3.6-27B
on One RTX 3090:
ollama
llama.cpp
+ MTP, Lever by Lever (35.7 80.2 tok/s)
🛠️
AI Tooling
Content type:
Blog
dev.to
·
1d
1 day ago
·
DEV
Actions for Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)
Neo-X7/Neo-AI: A fully offline AI assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
🛠️
AI Tooling
Content type:
Code
github.com
·
1h
1 hour ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Integrate on-device AI
models
into your app using Core AI - WWDC26 - Videos
💡
AI
developer.apple.com
·
2d
2 days ago
·
Hacker News
Actions for Integrate on-device AI models into your app using Core AI - WWDC26 - Videos
Ask HN: Is it feasible to run a
model
on device for complete privacy?
🧠
LLMs
Content type:
Discussion
news.ycombinator.com
·
3d
3 days ago
·
Hacker News
Actions for Ask HN: Is it feasible to run a model on device for complete privacy?
Token4Token — pay-per-token inference on Gnosis + Swarm
🛠️
AI Tooling
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Show HN: Audit any AI/data pairing with Veritrooper
🧠
LLMs
veritrooper.com
·
4d
4 days ago
·
Hacker News
Actions for Show HN: Audit any AI/data pairing with Veritrooper
Less-relevant results
Node.js Annual Releases, Terraform 1.15,
Gemma
4 Multimodal
✨
Vibe Coding
Content type:
Discussion
thedevsignal.com
·
1d
1 day ago
·
DEV
Actions for Node.js Annual Releases, Terraform 1.15, Gemma 4 Multimodal
No Cloud, No Cost: Build an Offline Visual AI Agent with
Gemma
4
🛠️
AI Tooling
Content type:
Blog
dev.to
·
11h
11 hours ago
·
DEV
Actions for No Cloud, No Cost: Build an Offline Visual AI Agent with Gemma 4
Introducing the Google Colab CLI
⚙️
Workflow Automation
Content type:
Blog
developers.googleblog.com
·
5d
5 days ago
Actions for Introducing the Google Colab CLI
local
AI agents for Cursor with pre-tuned marketplace/commu
🛠️
AI Tooling
locaible.com
·
54m
54 minutes ago
·
Hacker News
Actions for local AI agents for Cursor with pre-tuned marketplace/commu
Claude Now Writes 80% of Its Own Code — Anthropic's Self-Improvement Milestone Arrives Faster Than Expected
🔶
Claude
the-agent-report.com
·
1d
1 day ago
·
DEV
Actions for Claude Now Writes 80% of Its Own Code — Anthropic's Self-Improvement Milestone Arrives Faster Than Expected
Job Searcher
💡
AI
Content type:
Blog
huggingface.co
·
3d
3 days ago
Actions for Job Searcher
How I benchmarked a 100%
local
RAG pipeline to 9/9 (zero API keys)
📚
RAG
buy.polar.sh
·
1d
1 day ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Large companies can add a
local
LLM
filter layer to considerably reducing their AI costs
🧠
LLMs
umrashrf.github.io
·
4d
4 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
Project Log #2: The AI Phone Agent Has a Repo
🛠️
AI Tooling
Content type:
Blog
dev.to
·
23h
23 hours ago
·
DEV
Actions for Project Log #2: The AI Phone Agent Has a Repo
Introducing
Gemma
4 12B: a unified, encoder-free multimodal
model
💡
AI
Content type:
Blog
blog.google
·
6d
6 days ago
·
DEV
,
Hacker News
,
r/LocalLLaMA
Actions for Introducing Gemma 4 12B: a unified, encoder-free multimodal model
Purpose-built
local
AI agents
✍️
Prompt Engineering
Content type:
Blog
samihonkonen.com
·
1d
1 day ago
·
Hacker News
Actions for Purpose-built local AI agents
fix(memory-core): filter stale recall entries in REM harness preview ·
openclaw/openclaw
@92418fc
🛠️
AI Tooling
Content type:
Code
github.com
·
8h
8 hours ago
Actions for fix(memory-core): filter stale recall entries in REM harness preview · openclaw/openclaw@92418fc
ComfyUI NVFP4 in 2026: 3 Faster Image Generation on RTX 50-Series (and the Right Format for RTX 40-Series)
💡
AI
Content type:
Blog
dev.to
·
7h
7 hours ago
·
DEV
Actions for ComfyUI NVFP4 in 2026: 3 Faster Image Generation on RTX 50-Series (and the Right Format for RTX 40-Series)
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help