🦙 Ollama - eaksquad · Scour

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

🤖Transformers Blog

ziraph.com··Hacker News

Less-relevant results

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

buy.polar.sh··DEV

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt

posts.inthecyber.com·

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

🗄️Vector Databases Code

github.com··DEV

How to Set Up Codebase Indexing in Kilo Code

🗄️Vector Databases News Blog

The week AI infrastructure crossed from a technology story to a financial one

🤖Automation News

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🤖Transformers Blog

towardsai.net·

Token4Token — pay-per-token inference on Gnosis + Swarm

🤖Transformers

t4t.eth.link··Hacker News

On-device AI is a margin decision

🤖Transformers Blog

ziraph.com··Hacker News

🤖 AI Agents Weekly: Microsoft's Seven MAI Models, Gemma 4 12B, NVIDIA Nemotron 3 Ultra, Agents' Last Exam, Devin Desktop, and More

🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt News

nlp.elvissaravia.com

·

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt

codehamr.com··r/SideProject

RakuOS fixes the one thing that annoys me most about immutable Linux distros

🏠Self-hosting News

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🤖Transformers

vettedconsumer.com··Hacker News

Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe

📁File Systems

omnifs.dev··Hacker News

DiffusionGemma 26B A4B results on my 5090

🤖Transformers

huggingface.co··r/LocalLLaMA

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

🤖Transformers

phoronix.com··r/artificial

Large companies can add a local LLM filter layer to considerably reducing their AI costs

umrashrf.github.io··Hacker News

fix(agents): project thinking catalog compat · openclaw/openclaw@68ec783

🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt Code

local AI agents for Cursor with pre-tuned marketplace/commu

🎨Low-Code Platforms

locaible.com··Hacker News

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt

Log in to enable infinite scrolling