Ollama

Feeds to Scour
SubscribedAll
Scoured 156 posts in 11.8 ms

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

 🤖Transformers  Content type: Blog
ziraph.com··Hacker News
Less-relevant results

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

 📝NLP
buy.polar.sh··DEV

Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.

 🗄️Vector Databases  Content type: Code
github.com··DEV

How to Set Up Codebase Indexing in Kilo Code

 🗄️Vector Databases  Content type: News  Content type: Blog
blog.kilo.ai·

The week AI infrastructure crossed from a technology story to a financial one

 🤖Automation  Content type: News
mlwhiz.com·

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

 🤖Transformers  Content type: Blog
towardsai.net·

Token4Token — pay-per-token inference on Gnosis + Swarm

 🤖Transformers
t4t.eth.link··Hacker News

On-device AI is a margin decision

 🤖Transformers  Content type: Blog
ziraph.com··Hacker News

🤖 AI Agents Weekly: Microsoft's Seven MAI Models, Gemma 4 12B, NVIDIA Nemotron 3 Ultra, Agents' Last Exam, Devin Desktop, and More

 🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt  Content type: News
nlp.elvissaravia.com
·

RakuOS fixes the one thing that annoys me most about immutable Linux distros

 🏠Self-hosting  Content type: News
zdnet.com·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🤖Transformers

Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe

 📁File Systems
omnifs.dev··Hacker News

DiffusionGemma 26B A4B results on my 5090

 🤖Transformers

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🤖Transformers
phoronix.com··r/artificial

Large companies can add a local LLM filter layer to considerably reducing their AI costs

 📝NLP

fix(agents): project thinking catalog compat · openclaw/openclaw@68ec783

 🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt  Content type: Code
github.com·

local AI agents for Cursor with pre-tuned marketplace/commu

 🎨Low-Code Platforms

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

 🤖n8n, automation, AI agents, Gemini, Claude, openrouter, grok, chatgpt
tldr.tech·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help