🖥️ Local AI - lhoughton684 · Scour

Improved performance and model support with GGUF

🦙Ollama Blog

A system programmer’s guide to LLM inference

🦙Ollama Blog

blog.xiangpeng.systems··Hacker News

Token4Token — pay-per-token inference on Gnosis + Swarm

t4t.eth.link··Hacker News

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

alternativeto.net·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🤖AI Blog

adambien.blog·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

vettedconsumer.com··Hacker News

Joint Structural Pruning and Mixed-Precision Quantization for LLM Compression

💬LLMs Academic

I built an open-source persistent memory layer for AI coding agents

🤖AI Code

github.com··r/GithubCopilot

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

deemwar-products.github.io··Hacker News

iOS 27's most advanced on-device AI needs 12GB of RAM – and most iPhones don't have it

🦾AI Agents News

NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet

huggingface.co··Hacker News

Unsloth Gemma 4 QAT

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

xda-developers.com·

Fixing a stuck Ollama runner and building a GPU watchdog

patrickmccanna.net··Hacker News

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

🦙Ollama News Blog

blog.google··Hacker News

Apple's most advanced on-device AI features will only work on select devices

🤖AI News

DeskDash - a free Windows tool to easily manage your GGUF files

gerry7.itch.io··r/LocalLLaMA

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🤖AI Blog

towardsai.net·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

androidauthority.com·

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

venturebeat.com·

Log in to enable infinite scrolling