Local LLMs

Feeds to Scour
SubscribedAll
Scoured 416 posts in 7.2 ms

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 📝NLP  Content type: News  Content type: Blog
blog.google··Hacker News

The Inference Alpha: Maximizing Frontier Models on AMD

 🤖Transformers  Content type: Blog
digitalocean.com·

martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by local LLMs. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.

 🤖Qwen  Content type: Code
github.com··Hacker News

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

 🧠OpenAI
everylocalai.com··DEV

local llm on laptop 780M GPU using llama + gemma 4 qat

 🤖LLMs  Content type: Blog
alper.bearblog.dev·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

 🧠OpenAI  Content type: Blog
towardsai.net·

Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation

 🔥PyTorch  Content type: Academic
arxiv.org·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🤖Agents  Content type: Blog
adambien.blog·

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

 🧠OpenAI
posts.inthecyber.com·

Using Scikit-LLM with Open-Source LLMs

 🐍Python

HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs

 🔍RAG  Content type: Blog
elastic.co·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 🤖Transformers
androidauthority.com·

Purpose-built local AI agents

 🤖Agents  Content type: Blog

Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization

 🤖LLMs  Content type: Academic
arxiv.org·

I built an open-source persistent memory layer for AI coding agents

 💻Claude Code  Content type: Code

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

 📝NLP  Content type: News  Content type: Blog

NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet

 🔧Data Engineering

Using local LLMs for agentic coding

 🔬Deep Learning  Content type: Blog
blog.alexewerlof.com·
Less-relevant results

local AI agents for Cursor with pre-tuned marketplace/commu

 🤖Agents
locaible.com··Hacker News

What's in the Box? A Field Guide to AI Models

 📝NLP  Content type: Blog
iankduncan.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help