Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Ollama
🦙 Ollama
Specific
Local LLM Server, Model Management, API Server, Inference Engine
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
209
posts in
37.0
ms
💬
Prompt Engineering
huggingface.co
·
5d
5 days ago
Cheapest way to run GLM 5.x
locally
that's not a unified memory system?
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cheapest way to run GLM 5.x locally that's not a unified memory system?
💻
CLI Tools
unsloth.ai
·
2d
2 days ago
GLM-5.2 – How to Run
Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
news.smol.ai
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
🎨
Chroma
DEV Community
·
1h
1 hour ago
Build a
Local
RAG Chatbot in 30 Minutes with .NET 8,
Ollama
, and React
Covers
Ollama
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Build a Local RAG Chatbot in 30 Minutes with .NET 8, Ollama, and React
🎨
Chroma
GitHub
·
1d
1 day ago
Show HN: Alloy – a PyTorch backend and
inference
engine
for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
🔌
APIs
hackernoon.com
·
5d
5 days ago
Local
LLMs Need More Than OpenAI-Compatible Endpoints
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local LLMs Need More Than OpenAI-Compatible Endpoints
🏠
Home Automation
How-To Geek
·
1d
1 day ago
I built a bedside AI assistant that reads me the news without touching the cloud
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I built a bedside AI assistant that reads me the news without touching the cloud
💬
Prompt Engineering
XDA
·
4d
4 days ago
I replaced my entire browser extension stack with one
local
LLM
, and I'm not going back
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I replaced my entire browser extension stack with one local LLM, and I'm not going back
💬
Prompt Engineering
pypi.org
·
6d
6 days ago
Show HN: Subagent-fleet – AI coding subagents across
local
Ollama
machines
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Subagent-fleet – AI coding subagents across local Ollama machines
💬
Prompt Engineering
huggingface.co
·
20h
20 hours ago
bwen-14b - benthecarman
model
trained on benthecarman
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for bwen-14b - benthecarman model trained on benthecarman
💬
Prompt Engineering
jervi-writes.netlify.app
·
6d
6 days ago
The Free Claude Code - Run Claude Code with Any
Model
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Free Claude Code - Run Claude Code with Any Model
💬
Prompt Engineering
threadreaderapp.com
·
3d
3 days ago
Gemma 4 12B QAT (dense) achieves 1000+ tokens/sec prefill on 8GB VRAM with 120k context
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Gemma 4 12B QAT (dense) achieves 1000+ tokens/sec prefill on 8GB VRAM with 120k context
📝
NLP
GitHub
·
2d
2 days ago
How I Architected a Multi-Provider Fallback for
Local
RAG
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How I Architected a Multi-Provider Fallback for Local RAG
💬
Prompt Engineering
Towards Data Science
·
5d
5 days ago
Run a
Local
LLM
with OpenClaw on Your Mac Mini
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Run a Local LLM with OpenClaw on Your Mac Mini
🎛️
Microcontrollers
Arduino Blog
·
4d
4 days ago
Running
local
LLMs on the Arduino® UNO™ Q board: a practical guide
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running local LLMs on the Arduino® UNO™ Q board: a practical guide
💬
Prompt Engineering
DEV Community
·
1d
1 day ago
Building Cost-Effective AI Workflows: Open Source + Paid Tools Done Right
Covers
Building AI-powered applications in Laravel
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Building Cost-Effective AI Workflows: Open Source + Paid Tools Done Right
🏠
Self-hosting
GitHub
·
4h
4 hours ago
Show HN: Saar Agentic Orchestration Platform
Covers
3 stories
See all stories this covers
including
Open Router- A unified interface for LLMs
Covered by
DEV Community
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Saar Agentic Orchestration Platform
💬
Prompt Engineering
XDA
·
5d
5 days ago
Local
AI is more accessible than ever, but with one major GPU-sized caveat
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local AI is more accessible than ever, but with one major GPU-sized caveat
⚡
Hardware Acceleration
GitHub
·
1d
1 day ago
Running a 35B MoE
model
on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running a 35B MoE model on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
💬
Prompt Engineering
huggingface.co
·
4d
4 days ago
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
Covers
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
The Miners
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
🔬
Deep Learning
GitHub
·
1d
1 day ago
open-source Jarvis project
Discussed on
r/LLM
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for open-source Jarvis project
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report