Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🏠 Local LLM Deployment
GitHub
·
1d
1 day ago
Running a 35B MoE
model
on a 2017 AMD RX 580 8GB via Vulkan (no
ROCm/CUDA
)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running a 35B MoE model on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
unsloth.ai
·
2d
2 days ago
GLM-5.2 – How to Run
Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
news.smol.ai
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
huggingface.co
·
4d
4 days ago
Cheapest way to run GLM 5.x
locally
that's not a unified memory system?
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Cheapest way to run GLM 5.x locally that's not a unified memory system?
pypi.org
·
6d
6 days ago
Show HN: Subagent-fleet –
AI
coding subagents across
local
Ollama
machines
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Subagent-fleet – AI coding subagents across local Ollama machines
Vicki Boykis
·
6d
6 days ago
Running
local
models
is good now
Covers
9 stories
See all stories this covers
including
Pi.dev: There are many coding agents, but this one is mine
Covered by
10 sources
See all sources covering this story
including
Simon Willison's Newsletter
,
lemmy.ml
Discussed on
Hacker News
,
Hacker News
, and
Lobsters
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running local models is good now
hackernoon.com
·
3d
3 days ago
Quantization
Is Quietly Eating the
AI
Hardware Business. Where Next?
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Quantization Is Quietly Eating the AI Hardware Business. Where Next?
konxios.com
·
2d
2 days ago
Show HN: Konxios a
local
first
AI
OS that connects
LM
Studio, Ollama and cloud
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Konxios a local first AI OS that connects LM Studio, Ollama and cloud
vettedconsumer.com
·
6d
6 days ago
The KV Cache, Explained: Why Long Context Eats Your
VRAM
(and How to Fit More)
Covers
2 stories
See all stories this covers
including
Efficient Memory Management for Large Language Model Serving with PagedAttention
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The KV Cache, Explained: Why Long Context Eats Your VRAM (and How to Fit More)
XDA
·
3d
3 days ago
I tested Google's new Gemma 4 12B on my 8GB
GPU
, and now I don't want to go back to smaller
models
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I tested Google's new Gemma 4 12B on my 8GB GPU, and now I don't want to go back to smaller models
devashish.me
·
5d
5 days ago
Two Qwen3
models
on one DGX Spark: the residency math
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Two Qwen3 models on one DGX Spark: the residency math
lymegrove.com
·
1d
1 day ago
Jobflo – A
local-first
job tracker built with SwiftUI
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Jobflo – A local-first job tracker built with SwiftUI
zarldev.github.io
·
4d
4 days ago
Show HN: Zkit – Go libraries for building agents, not a framework
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Zkit – Go libraries for building agents, not a framework
docs.everruns.com
·
4d
4 days ago
Feature reach agent harness in Rust
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Feature reach agent harness in Rust
GitHub
·
6d
6 days ago
ahwurm/localharness
:
Model-agnostic
agent harness for local LLMs — configure agents in YAML and run them on your own hardware (vLLM,
Ollama
, LM Studio, llama.cpp).
Covers
uv
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ahwurm/localharness: Model-agnostic agent harness for local LLMs — configure agents in YAML and run them on your own hardware (vLLM, Ollama, LM Studio, llama.cpp).
speechmark.co
·
2d
2 days ago
On-device
meeting notes for Mac (no bot, no cloud)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On-device meeting notes for Mac (no bot, no cloud)
huggingface.co
·
5d
5 days ago
NovaVest/VN-Noxa-v1-7B-Beta-Low
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for NovaVest/VN-Noxa-v1-7B-Beta-Low
hackernoon.com
·
4d
4 days ago
Local
LLMs Need More Than OpenAI-Compatible Endpoints
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local LLMs Need More Than OpenAI-Compatible Endpoints
projecthuginn.com
·
4d
4 days ago
cheaper
AI
training on idle GPUs
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for cheaper AI training on idle GPUs
tablething.com
·
4d
4 days ago
local-first
database client with BYOK
AI
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for local-first database client with BYOK AI
dirge-code.github.io
·
4d
4 days ago
Dirge: Rust coding agent with intelligent steering and memory
Covers
5 stories
See all stories this covers
including
XiaomiMiMo/MiMo-Code: MiMo Code: Where Models and Agents Co-Evolve
Covered by
yogthos.net
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Dirge: Rust coding agent with intelligent steering and memory
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report