🤖 Local LLMs - samuelfastfinge · Scour

Improved performance and model support with GGUF

🍎Apple Blog

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

🍎Apple Code

github.com··Hacker News

A system programmer’s guide to LLM inference

🔊Screen Readers Blog

blog.xiangpeng.systems··Hacker News

Qwen 3.6 27B AutoRound GGUF, need your feedback

huggingface.co··r/LocalLLaMA

Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support

🔊Screen Readers

alternativeto.net·

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

🍎Apple Blog

ziraph.com··Hacker News

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🍎Apple Blog

adambien.blog·

Fixing a stuck Ollama runner and building a GPU watchdog

🏠Self-hosting

patrickmccanna.net··Hacker News

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

🦋Akkoma Code

github.com··DEV

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

deemwar-products.github.io··Hacker News

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

🍎Apple News Tutorial

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🔊Screen Readers Blog

towardsai.net·

Run local agentic AI on the Mac using MLX [video]

💻macOS Video

youtube.com··Hacker News

Using Scikit-LLM with Open-Source LLMs

machinelearningmastery.com·

I built an open-source persistent memory layer for AI coding agents

🏠Self-hosting Code

github.com··r/GithubCopilot

Distilling Safe LLM Systems via Soft Prompts for On Device Settings

🔊Screen Readers Academic

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

vettedconsumer.com··Hacker News

I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory

xda-developers.com·

maziyarpanahi/openmed: open-source healthcare ai

🍎Apple Code

Using local LLMs for agentic coding

🍎Apple Blog

blog.alexewerlof.com·

Log in to enable infinite scrolling