⚡ Fast AI Inference - emschwartz · Scour

Llama.cpp now has an official website: llama.app 🤖AI

llama.app·6d·Hacker News

Build Personal AI Agents on Windows PCs with New Tools from Microsoft and Nvidia 🤖AI Blog

developer.nvidia.com·2d·Hacker News

Dell's AI Surge, SpaceX Spending, & SoftBanks's New Play 🔬AI Labs

briefing.forwardfuture.ai·3d

How to Run Gemma 4 12B Locally - The Best AI For Consumer Laptops 🤖AI Video

youtube.com·1d

Location: Oslo, Norway (CET) Remote: Yes (EU/US time zones) Willing to relocate:... 🦀Rust Discussion

news.ycombinator.com·3d·Hacker News

paralleliq/piqc: Kubernetes scanner that discovers LLMs running on vLLM and extracts their deployment and runtime facts. 🏗️LLM Infrastructure Code

github.com·2d·Hacker News

We have built the first of it's kind interactive blog for matching open-source LLMs to GPUs. 🤖AI Blog

agentswarms.fyi·2d·r/ChatGPT, r/OpenAI

Micron Powers AI Everywhere at COMPUTEX 2026 🤖AI

cdrinfo.com·3d

GGUF vs MLX: A Decision Guide, Not Another Benchmark 🤖AI

muhammadraza.me·2d

[AINews] not much happened today 🤖AI News

latent.space·4h

Multi-Lora-Continual-Learning 📅Resource Scheduling

trajectory.ai·6d·Hacker News

Holo3.1: Fast & Local Computer Use Agents 🤖AI Blog

huggingface.co·2d

Lodestar: An Online-Learning LLM Inference Router 🏗️LLM Infrastructure Academic

Show HN: We built an LLM inference engine in pure Python 🏗️LLM Infrastructure Code

github.com·2d·Hacker News

A Sovereign Brain on a Laptop: Local LLM + Pi Agent + Markdown 🏠Self-Hosting

sovereignbrain.me·3d

Google makes Gemma 4 12B a local AI bet for startups 🆕New AI

startupfortune.com·1d

Location: Göttingen, Germany Remote: Yes (preferred; hybrid also fine) Willing t... 🤖AI Discussion

news.ycombinator.com·1d·Hacker News

Nemotron 3 Ultra announced: high-speed, leading US open weights intelligence 🆕New AI

artificialanalysis.ai·4d·Hacker News

Part 2 — Serve-Level Speed: System Design That Stabilizes P95/P99 🧠LLM Inference

towardsai.net·1d

Experience with "nvidia/LocateAnything-3B" 🤖AI

huggingface.co·6d·r/LocalLLaMA

Sign up or log in to see more results

Log in to enable infinite scrolling