Local LLMs

Feeds to Scour
SubscribedAll
Scoured 440 posts in 13.8 ms

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

 🤗Open Source AI  Content type: Code
github.com··DEV

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🟢NVIDIA

What's in the Box? A Field Guide to AI Models

 🧠LLMs  Content type: Blog
iankduncan.com·

Optimal Post-Training Quantization Scales and Where to Find Them

 🧠LLMs  Content type: Academic
arxiv.org·

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

 🤗Open Source AI
xda-developers.com·

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 🤗Open Source AI  Content type: Blog
adambien.blog·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🤗Open Source AI  Content type: News  Content type: Blog
blog.google··Hacker News
Less-relevant results

On-device AI is a margin decision

 🤗Open Source AI  Content type: Blog
ziraph.com··Hacker News

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

 🤗Open Source AI  Content type: News  Content type: Blog

LM Studio now lets you use your iPhone to talk to local models on your Mac

 🤗Open Source AI
9to5mac.com··r/apple

NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet

 🌐Web Dev

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

 🧠LLMs  Content type: News  Content type: Blog

Previewing nAnalyst, the layer that finally explains your network

 🤖AI Coding
ntop.org·

Using local LLMs for agentic coding

 🤗Open Source AI  Content type: Blog
blog.alexewerlof.com·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

 🤗Open Source AI  Content type: Blog
towardsai.net·

martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by local LLMs. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.

 🤗Open Source AI  Content type: Code
github.com··Hacker News

WWDC 2026: Foundation Models (& Anarlog)

 🤗Open Source AI
skushagra.com·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 🤗Open Source AI
androidauthority.com·

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

 🤗Open Source AI  Content type: News  Content type: Blog

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

 🤗Open Source AI  Content type: Blog
ziraph.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help