Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Ollama
🦙 Ollama
Specific
Local LLM Server, Model Management, API Server, Inference Engine
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
207
posts in
33.9
ms
💬
Prompt Engineering
huggingface.co
·
5d
5 days ago
Cheapest way to run GLM 5.x
locally
that's not a unified memory system?
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cheapest way to run GLM 5.x locally that's not a unified memory system?
💻
CLI Tools
unsloth.ai
·
2d
2 days ago
GLM-5.2 – How to Run
Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
news.smol.ai
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
🏠
Home Automation
How-To Geek
·
19h
19 hours ago
I built a bedside AI assistant that reads me the news without touching the cloud
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I built a bedside AI assistant that reads me the news without touching the cloud
🎨
Chroma
GitHub
·
1d
1 day ago
Show HN: Alloy – a PyTorch backend and
inference
engine
for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
🔌
APIs
hackernoon.com
·
5d
5 days ago
Local
LLMs Need More Than OpenAI-Compatible Endpoints
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local LLMs Need More Than OpenAI-Compatible Endpoints
💬
Prompt Engineering
DEV Community
·
1d
1 day ago
Building Cost-Effective AI Workflows: Open Source + Paid Tools Done Right
Covers
Building AI-powered applications in Laravel
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Building Cost-Effective AI Workflows: Open Source + Paid Tools Done Right
💬
Prompt Engineering
XDA
·
4d
4 days ago
I replaced my entire browser extension stack with one
local
LLM
, and I'm not going back
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I replaced my entire browser extension stack with one local LLM, and I'm not going back
💬
Prompt Engineering
pypi.org
·
6d
6 days ago
Show HN: Subagent-fleet – AI coding subagents across
local
Ollama
machines
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Subagent-fleet – AI coding subagents across local Ollama machines
💬
Prompt Engineering
huggingface.co
·
16h
16 hours ago
bwen-14b - benthecarman
model
trained on benthecarman
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for bwen-14b - benthecarman model trained on benthecarman
💬
Prompt Engineering
jervi-writes.netlify.app
·
6d
6 days ago
The Free Claude Code - Run Claude Code with Any
Model
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Free Claude Code - Run Claude Code with Any Model
💬
Prompt Engineering
threadreaderapp.com
·
3d
3 days ago
Gemma 4 12B QAT (dense) achieves 1000+ tokens/sec prefill on 8GB VRAM with 120k context
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Gemma 4 12B QAT (dense) achieves 1000+ tokens/sec prefill on 8GB VRAM with 120k context
📝
NLP
GitHub
·
2d
2 days ago
How I Architected a Multi-Provider Fallback for
Local
RAG
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How I Architected a Multi-Provider Fallback for Local RAG
💬
Prompt Engineering
Towards Data Science
·
5d
5 days ago
Run a
Local
LLM
with OpenClaw on Your Mac Mini
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Run a Local LLM with OpenClaw on Your Mac Mini
🎛️
Microcontrollers
Arduino Blog
·
3d
3 days ago
Running
local
LLMs on the Arduino® UNO™ Q board: a practical guide
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running local LLMs on the Arduino® UNO™ Q board: a practical guide
⚡
Hardware Acceleration
GitHub
·
1d
1 day ago
Running a 35B MoE
model
on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running a 35B MoE model on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
💬
Prompt Engineering
XDA
·
4d
4 days ago
Local
AI is more accessible than ever, but with one major GPU-sized caveat
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local AI is more accessible than ever, but with one major GPU-sized caveat
🧪
Testing
DEV Community
·
6d
6 days ago
Grammarly costs $
12/mo
— a
local
LLM
does it for free (Chrome + Ollama)
Covers
Ollama
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Grammarly costs $12/mo — a local LLM does it for free (Chrome + Ollama)
🔬
Deep Learning
GitHub
·
1d
1 day ago
open-source Jarvis project
Discussed on
r/LLM
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for open-source Jarvis project
💬
Prompt Engineering
huggingface.co
·
3d
3 days ago
yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
Covers
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
The Miners
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF
💬
Prompt Engineering
GitHub
·
2d
2 days ago
How do I set the right
llama.cpp
parameters?
Covers
JSON Schema
Covered by
DEV Community
,
Alex Ewerlöf Notes
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How do I set the right llama.cpp parameters?
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report