🤖 AI - tionis · Scour

Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent

🐧unix Blog

dnhkng.github.io·

Large companies can add a local LLM filter layer to considerably reducing their AI costs

🧱data structures

umrashrf.github.io··Hacker News

Purpose-built local AI agents

🧩lisp Blog

samihonkonen.com··Hacker News

[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo

🕸️graphs News

·

When AI builds itself 👷, AI is not a line item 📝, local LLMs for agentic coding 🤖

🧱data structures

Here's a llama.cpp CLI Command builder.

llamabuilding.com··r/LocalLLaMA

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

🧱data structures

saintlex.sbs··DEV

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

🧱data structures News

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

🧱data structures

phoronix.com··r/artificial

Tokiyo Ooto & Orhythmoが新作アルバム『Play Nursery Rhymes and Children's Songs (童謡と子供の歌を唄う)』をEM Recordsよりリリース | AVE | CORNER PRINTING

ave-cornerprinting.com·

A system programmer’s guide to LLM inference

🧱data structures Blog

blog.xiangpeng.systems··Hacker News

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change

🧱data structures News Blog

andreaborio.substack.com··Substack

Running LLM Inference on Kubernetes: What It Actually Takes

🧱data structures Blog

fairwinds.com·

Why Spring Teams Don’t Need a Second Runtime for AI Agents

🧱data structures

Local AI agents on Arduino UNO Q

🧱data structures Blog

blog.arduino.cc·

MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better

🧱data structures News Blog

kaitchup.substack.com··r/LocalLLaMA

Cyber Triage 3.18: New AI + Cloud Automation Capabilities

🧱data structures Blog Tutorial

cybertriage.com·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

🧱data structures

androidauthority.com·

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

🧱data structures

buy.polar.sh··DEV

Qwen 3.6 27B AutoRound GGUF, need your feedback

🧱data structures

huggingface.co··r/LocalLLaMA

Sign up or log in to see more results

Log in to enable infinite scrolling