🦙 llama.cpp - anarcher · Scour

Why and How to Run Local Models in Zed 🤖LLM Inference

Less-relevant results

Why I Invested ₹5 Lakhs in an M5 Max (64GB) Instead of Real Estate: An Architect’s Bet on On-Device AI and Global Freedom 🤖LLM Inference

whatsapp.com·23h·DEV

ROCm 7 on Strix Halo: Benchmarking the New Toolbox Images 🤖LLM Inference

sleepingrobots.com·4d

Qwen 3.7 Preview 🤖LLM Inference

news.ycombinator.com·2d·Hacker News

Surprising things I learned putting together a Home Brain 🤖LLM Inference

bitworking.org·3d·Hacker News

tokenspeed — feel LLM tokens-per-second 🤖LLM Inference

mikeveerman.github.io·1h

Lab notebook: Edit completion #1 ⚙️Zig

randomhacks.net·4d

My Zerto Docs MCP Server: Ask Claude (or Copilot, or Cursor) Real Questions 💾SQLite

Can You Run LLMs Locally Without a GPU? I Tested 8 Models on Linux 🤖LLM Inference

itsfoss.com·5d·Hacker News

Self-Hosted AI for Telegram/WhatsApp/Discord via Ollama, Zero Cloud 🤖LLM Inference

crustaidocs.netlify.app·1d·Hacker News

The Ultimate LLM Fine-Tuning Guide 🤖LLM Inference

promptinjection.net·3d·Hacker News

I built Mofakir: A native, local AI desktop assistant for Linux that actually interacts with your system 🤖LLM Inference

github.com·6h·r/linux

Ollama on Mac: Setup and Optimization Guide (2026) 🧠Memory Allocators

insiderllm.com·4d

Capturing ideas with voice, local LLMs, and obsidian ⚙️Zig

aidenredmondd.substack.com·2d·Substack

VectraYX-Nano: A 42M-Parameter Spanish Cybersecurity Language Model with Curriculum Learning and Native Tool Use 🤖LLM Inference

AMD just dropped a compact AI workstation that makes discrete GPUs look outdated for running LLMs 🤖LLM Inference

xda-developers.com·5h

Driving DeepSeek V4 Flash on your own Mac 🧠Memory Allocators

pi.audreyt.org·3d

Forensics First. AI Second. 🤖AI

brettshavers.com·2d

froggeric/Qwen3.6-27B-MTP-GGUF 🤖LLM Inference

huggingface.co·3d·DEV

michelangeloromerochisco/ternative: Inference engine for ternary-weight LLMs with runtime LoRA - the llama.cpp of BitNet models 🤖LLM Inference

github.com·1d·Hacker News

Log in to enable infinite scrolling