Local AI

Feeds to Scour
SubscribedAll
Scoured 137 posts in 28.8 ms

I switched from LM Studio to llama.cpp, and I'm never going back to a bloated wrapper

 💻Local LLMs
howtogeek.com·

Doubling Qwen3.6-27B on One RTX 3090: ollama llama.cpp + MTP, Lever by Lever (35.7 80.2 tok/s)

 💻Local LLMs  Content type: Blog
dev.to··DEV

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

 🔓Open Source AI  Content type: Code
github.com·

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

 🍯Deception Technology

LM Studio now lets you use your iPhone to talk to local models on your Mac

 💻Local LLMs
9to5mac.com··r/apple

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🔓Open Source AI  Content type: News  Content type: Blog
blog.google··Hacker News

I built a fully local AI coding assistant in Windows with Ollama and VS Code

 🧠LLM Tooling
howtogeek.com·

How to Tune llama.cpp --n-gpu-layers: A Practical VRAM Guide (2026)

 🟩Nvidia  Content type: Blog
dev.to··DEV

Google Gemma 4 12B: Architecture, Benchmarks, Access, and Hands-on Guide for Developers

 🔓Open Source AI  Content type: Blog
analyticsvidhya.com·

zaydmulani09/mnemo: Local-first AI memory layer for any LLM. Persistent knowledge graph, entity extraction, semantic retrieval. Works with Ollama, OpenAI, Anthropic, or any OpenAI-compatible backend.

 🦙Ollama  Content type: Code
github.com··Hacker News

Open-LLM-VTuber Review: Offline AI Companion with Live2D

 🧠LLM  Content type: Blog
dev.to··DEV

shoo99/paper-rag: A private, fully-local RAG over your own PDFs: BGE-M3 + embedded Qdrant + a local LLM via Ollama. ~150 lines, nothing leaves your machine.

 🤖Large Language Models  Content type: Code
github.com··DEV

Running a Local AI Engineering Agent with deepstrain: A Step-by-Step Tutorial

 🧠LLM Tooling  Content type: Blog
dev.to··DEV

sancheznot/Godot-AI-Assistant: Golem-AI is an AI-powered editor assistant for Godot 4. Chat with local or cloud models (Ollama, LM Studio, OpenAI, Anthropic, Gemini, Cursor) directly from an editor dock.

 ♟️Game Theory  Content type: Code
github.com··DEV

Run Coding Agents on Local AI — Zero Cloud, Full Control

 🧠LLM Tooling  Content type: Blog
dev.to··DEV

How to Tune --n-gpu-layers for Your VRAM Budget

 📊Compute Markets  Content type: Blog
dev.to··DEV

Running AI Locally: Skip the API Bills and Build Faster

 💻Local LLMs  Content type: Blog
dev.to··DEV

I built a self-hosted AI workspace for macOS — meet Odysee

 🧠LLM Tooling  Content type: Blog
dev.to··DEV

Run Gemma-4 12B on WSL2 with llama.cpp

 🔓Open Source AI  Content type: Blog
dev.to··DEV

Built an AI-Powered Spring Boot Log Analyzer Using RAG + Ollama

 🤖Large Language Models  Content type: Blog
dev.to
··DEV

No more posts from buckman's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help