Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
lm studio
🤖 lm studio
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
106
posts in
6.4
ms
147th airhacks tv:
Local
LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
🃏
Card Layout
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
MoQ
GGUFs
and GSQ: Low-Bit
GGUFs
Are About to Get Much Better
🎰
Procedural Generation
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
Less-relevant results
I wired a fully offline voice loop to Ollama +
LM
Studio
— 100% CPU, no GPU, nothing leaves your machine (Silero VAD + Parakeet STT + Supertonic TTS 3)
🎲
Playtesting
Content type:
Code
github.com
·
13h
13 hours ago
·
r/LocalLLaMA
Actions for I wired a fully offline voice loop to Ollama + LM Studio — 100% CPU, no GPU, nothing leaves your machine (Silero VAD + Parakeet STT + Supertonic TTS 3)
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
🧩
Heuristics
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
🎰
Procedural Generation
local-llm.utop.workers.dev
·
4d
4 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
Quality Is Not a Safety Proxy Under
Quantization
🎰
Procedural Generation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Quality Is Not a Safety Proxy Under Quantization
DeskDash - a free Windows tool to easily manage your
GGUF
files
🗂️
Obsidian
gerry7.itch.io
·
4d
4 days ago
·
r/LocalLLaMA
Actions for DeskDash - a free Windows tool to easily manage your GGUF files
LM
Link launches on iPhone, bringing
local
AI model access to iOS devices
🎲
Playtesting
alternativeto.net
·
6d
6 days ago
Actions for LM Link launches on iPhone, bringing local AI model access to iOS devices
Apples to Apples: MLX vs.
Llama.cpp
for Gemma 4 12B on an M1 16GB
🎰
Procedural Generation
Content type:
Blog
ziraph.com
·
5d
5 days ago
·
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive
llama.cpp
conversions suffer accuracy loss
🦀
rust
Content type:
News
digg.com
·
5d
5 days ago
Actions for Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss
Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
🎰
Procedural Generation
alternativeto.net
·
3d
3 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
fix(memory): move
local
llama.cpp
runtime to provider plugin · openclaw/openclaw@3137110
🦀
rust
Content type:
Code
github.com
·
2d
2 days ago
Actions for fix(memory): move local llama.cpp runtime to provider plugin · openclaw/openclaw@3137110
Apple rebuilt its on-device AI stack at WWDC 2026
🎰
Procedural Generation
Content type:
Blog
ziraph.com
·
2d
2 days ago
·
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
Show HN:
Ext-Infer
🦀
rust
infer.displace.tech
·
4d
4 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
Gemma 4 12B: A unified, encoder-free multimodal model
🎰
Procedural Generation
Content type:
Discussion
news.ycombinator.com
·
4d
4 days ago
·
Hacker News
Actions for Gemma 4 12B: A unified, encoder-free multimodal model
KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for
llama.cpp
, fully measured on real hardware.
🎲
Playtesting
Content type:
Code
github.com
·
23h
23 hours ago
·
Hacker News
Actions for KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
🎰
Procedural Generation
Content type:
Blog
towardsai.net
·
3d
3 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
When AI builds itself 👷, AI is not a line item 📝,
local
LLMs for agentic coding 🤖
🎰
Procedural Generation
tldr.tech
·
6d
6 days ago
Actions for When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖
Ideogram4
GGUF
is out!
✒️
Typography
huggingface.co
·
4d
4 days ago
·
r/StableDiffusion
Actions for Ideogram4 GGUF is out!
I got a Crush on this new Terminal-based AI coding tool
🎰
Procedural Generation
xda-developers.com
·
1d
1 day ago
Actions for I got a Crush on this new Terminal-based AI coding tool
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help