🏠 Local LLM Deployment · Scour

🏠 Local LLM DeploymentSpecific

Model Optimization, GPU Acceleration, Inference, Privacy

Serving Large Language Models with a Minimalist Python CLI

Covers 2 stories including uv

Discussed on Hacker News

vettedconsumer.com·

GLM-5.2: The Most Powerful Open Model yet and the Brutal Reality of Running It

Covers 6 stories including zai-org/GLM-5.2 is here!

Covered by notes.dsebastien.net

Discussed on Hacker News

docs.mistral.ai·

Mistral AI Cookbooks

Covered by 3 sources including Mistral AI, VentureBeat

Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp

Covers Ollama

Discussed on Hacker News

Valve Confirms Steam Machine Launches June 30 at $1,049 to $1,349 With Random Reservation Queue

Covered by kite.kagi.com

everything.one·

Everything*: An interactive voyage through all orders of magnitude

Covers Powers of Ten (1977) [video]

Discussed on Hacker News

We trained a real-time world model for $2k with Minecraft mod revenue

Discussed on Hacker News

DeepSeek V4 Flash optimized framework and model variants for DGX Spark

Covers Nvidia RTX Spark

Discussed on Hacker News

Run small local LLMs in browser 3x faster

Discussed on Hacker News

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

Covered by tldr.tech

AI coding: loop engineering a translator

Discussed on Hacker News

Steam Machine review

Covered by 7 sources including Kotaku, GamingOnLinux

Discussed on Hacker News

autonomy-landing-page.vercel.app·

Show HN: Autonomy – Self-Harness/Self-Directed AI Agent Core Under Development

Discussed on Hacker News

How do I set the right llama.cpp parameters?

Covers JSON Schema

Covered by DEV Community, Alex Ewerlöf Notes

Discussed on r/LocalLLaMA

The Steam Machine Is An Iconoclastic Computer Born In Unforgiving Times

Covers 4 stories including Exclusive

Covered by 5 sources including Kotaku, The Verge

Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon

Discussed on Hacker News

Show HN: Evaluating Local LLMs as language translators for my app

Discussed on Hacker News

Open-source security auditors for Supabase, Strapi, Hasura and Ollama

Discussed on Hacker News

teachmecoolstuff.com·

Fine Tuning a Tiny Local LLM to Categorize Questions

Discussed on Hacker News and Hacker News

Martin Alderson·

Expert-aware quantisation: near-Q4 quality at near-Q2 size?

Discussed on Hacker News

Log in to enable infinite scrolling