Local LLMs

Feeds to Scour
SubscribedAll
Scoured 132 posts in 13.9 ms

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

 🤗Open Source AI

Qwen 3.6 27B AutoRound GGUF, need your feedback

 🧠LLMs

What Ollama Reveals About Local AI, Agents, and Open Models

 🤗Open Source AI  Content type: Blog
odsc.medium.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🤗Open Source AI  Content type: Code
github.com··Hacker News

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 🎵Vibe Coding

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🧠LLMs

Fixing a stuck Ollama runner and building a GPU watchdog

 🤗Open Source AI

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🟢NVIDIA

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 🟢NVIDIA  Content type: News  Content type: Blog
developer.nvidia.com·
Less-relevant results

On-device AI is a margin decision

 🤗Open Source AI  Content type: Blog
ziraph.com··Hacker News

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🤗Open Source AI  Content type: News  Content type: Blog
blog.google··Hacker News

Running Two LLMs on a Mini PC Sounds Great Until the Benchmarks Arrive

 🤗Open Source AI
hackernoon.com·

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

 🧠LLMs  Content type: News  Content type: Blog

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

 🤗Open Source AI  Content type: News  Content type: Blog

A system programmer’s guide to LLM inference

 🤗Open Source AI  Content type: Blog

Run (your largest) local models from your iPhone

 🧠LLMs  Content type: Blog

martidu4/honey-ai: 🍯 All-in-one AI honeypot powered by local LLMs. SSH, HTTP, FTP, Telnet, SMTP, MySQL, Redis, Git, VNC, RDP — with canary tokens, tarpits, GZIP bombs, and threat intel reporting.

 🤗Open Source AI  Content type: Code
github.com··Hacker News

Purpose-built local AI agents

 ✍️Prompt Engineering  Content type: Blog

DeskDash - a free Windows tool to easily manage your GGUF files

 💻Code Generation

Aspen: Own your intelligence

 🤗Open Source AI  Content type: Discussion  Content type: Tutorial

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help