Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
lm studio
🤖 lm studio
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
108
posts in
6.9
ms
I got a Crush on this new Terminal-based AI coding tool
🎰
Procedural Generation
xda-developers.com
·
1d
1 day ago
Actions for I got a Crush on this new Terminal-based AI coding tool
BeeLlama.cpp
DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster
🎲
Playtesting
sleepingrobots.com
·
4d
4 days ago
Actions for BeeLlama.cpp DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster
A system programmer’s guide to
LLM
inference
🎰
Procedural Generation
Content type:
Blog
blog.xiangpeng.systems
·
3d
3 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model
🎰
Procedural Generation
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model
6. Air-Gapped Claude Code - The Claude Code SRE Handbook
🤖
claude code
har-ki.github.io
·
39m
39 minutes ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for
llama.cpp
, fully measured on real hardware.
🎲
Playtesting
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.
Launch HN: General Instinct (YC P26) – Frontier models on edge devices
🎲
Tabletop Simulators
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for Launch HN: General Instinct (YC P26) – Frontier models on edge devices
fix(
lmstudio
): preserve wizard prompter binding · openclaw/openclaw@22276e6
🗂️
Obsidian
Content type:
Code
github.com
·
4d
4 days ago
Actions for fix(lmstudio): preserve wizard prompter binding · openclaw/openclaw@22276e6
How to Make Your SMALL
Local
AI Models 10X SMARTER
🎲
Tabletop Simulators
Content type:
Video
youtube.com
·
4d
4 days ago
Actions for How to Make Your SMALL Local AI Models 10X SMARTER
google/gemma-4-12B-it-qat-q4_
0-gguf
🤖
claude code
huggingface.co
·
5d
5 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
Remove padding and multiple D2D copies for MTP by gaugarg-nv · Pull Request #24086 ·
ggml-org/llama.cpp
🃏
Card Layout
Content type:
Code
github.com
·
23h
23 hours ago
·
r/LocalLLaMA
Actions for Remove padding and multiple D2D copies for MTP by gaugarg-nv · Pull Request #24086 · ggml-org/llama.cpp
[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo
🎰
Procedural Generation
Content type:
News
latent.space
·
13h
13 hours ago
Actions for [AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo
[AINews] not much happened today
🗂️
Obsidian
Content type:
News
latent.space
·
5d
5 days ago
Actions for [AINews] not much happened today
Inside Out
🗂️
Obsidian
inkdroid.org
·
1d
1 day ago
Actions for Inside Out
alexziskind1/model-shelf: Model Shelf is a
local-first
model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best
local
path for
GGUF
, MLX, safetensors, Ollama, vLLM, and other
local
AI workflows.
🎲
Playtesting
Content type:
Code
github.com
·
6d
6 days ago
Actions for alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.
vla.cpp
: A Unified
Inference
Runtime for Vision-Language-Action Models
🎰
Procedural Generation
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for vla.cpp: A Unified Inference Runtime for Vision-Language-Action Models
stable-diffusion.cpp/docs/quantization
_and_
gguf.md
at master ·
leejet/stable-diffusion.cpp
🦀
rust
Content type:
Code
github.com
·
4d
4 days ago
·
r/StableDiffusion
Actions for stable-diffusion.cpp/docs/quantization_and_gguf.md at master · leejet/stable-diffusion.cpp
I tested
local
AI vs. ChatGPT side-by-side — here are the 7 biggest differences
🎰
Procedural Generation
tomsguide.com
·
4d
4 days ago
Actions for I tested local AI vs. ChatGPT side-by-side — here are the 7 biggest differences
fix(codex): avoid guardian review for
local
models (#88630) · openclaw/openclaw@b4cdd92
🗂️
Obsidian
Content type:
Code
github.com
·
1d
1 day ago
Actions for fix(codex): avoid guardian review for local models (#88630) · openclaw/openclaw@b4cdd92
The smartest ChatGPT users are putting
local
AI in front of it — here's why
🎰
Procedural Generation
tomsguide.com
·
5d
5 days ago
Actions for The smartest ChatGPT users are putting local AI in front of it — here's why
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help