lm studio

Feeds to Scour
SubscribedAll
Scoured 108 posts in 6.9 ms

I got a Crush on this new Terminal-based AI coding tool

 🎰Procedural Generation
xda-developers.com·

BeeLlama.cpp DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster

 🎲Playtesting
sleepingrobots.com·

A system programmer’s guide to LLM inference

 🎰Procedural Generation  Content type: Blog

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

 🎰Procedural Generation  Content type: Academic
arxiv.org·

6. Air-Gapped Claude Code - The Claude Code SRE Handbook

 🤖claude code
har-ki.github.io··Hacker News

KaiFelixBennett/gemma4-turboquant-rdna4: Run Gemma-4-31B at full 256K context on a $1,400 AMD RDNA4 GPU (gfx1201): TurboQuant KV cache + HIP-graph-safe Flash-Attention for llama.cpp, fully measured on real hardware.

 🎲Playtesting  Content type: Code
github.com··Hacker News

Launch HN: General Instinct (YC P26) – Frontier models on edge devices

 🎲Tabletop Simulators  Content type: Discussion

fix(lmstudio): preserve wizard prompter binding · openclaw/openclaw@22276e6

 🗂️Obsidian  Content type: Code
github.com·

How to Make Your SMALL Local AI Models 10X SMARTER

 🎲Tabletop Simulators  Content type: Video
youtube.com·

google/gemma-4-12B-it-qat-q4_0-gguf

 🤖claude code
huggingface.co·

Remove padding and multiple D2D copies for MTP by gaugarg-nv · Pull Request #24086 · ggml-org/llama.cpp

 🃏Card Layout  Content type: Code
github.com··r/LocalLLaMA

[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo

 🎰Procedural Generation  Content type: News
latent.space
·

[AINews] not much happened today

 🗂️Obsidian  Content type: News
latent.space
·

Inside Out

 🗂️Obsidian
inkdroid.org·

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

 🎲Playtesting  Content type: Code
github.com·

vla.cpp: A Unified Inference Runtime for Vision-Language-Action Models

 🎰Procedural Generation  Content type: Academic
arxiv.org·

stable-diffusion.cpp/docs/quantization_and_gguf.md at master · leejet/stable-diffusion.cpp

 🦀rust  Content type: Code

I tested local AI vs. ChatGPT side-by-side — here are the 7 biggest differences

 🎰Procedural Generation
tomsguide.com
·

fix(codex): avoid guardian review for local models (#88630) · openclaw/openclaw@b4cdd92

 🗂️Obsidian  Content type: Code
github.com·

The smartest ChatGPT users are putting local AI in front of it — here's why

 🎰Procedural Generation
tomsguide.com
·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help