🧠 LLM Inference - akapaka · Scour

TIL: For long-lived LLM sessions, swapping KV Cache to RAM is ~10x faster than recalculating it. Why isn't this a standard feature?

reddit.com·3d·

Discuss: r/LocalLLaMA

Flag this post

AI Models Write Code with Security Flaws 18–50% of the Time, New Study Finds

medium.com·1d·

Discuss: Hacker News

Flag this post

Qwen3 VL 30b a3b is pure love

reddit.com·2d·

Discuss: r/LocalLLaMA

🏠Self-Hosting

Flag this post

Windsurf Codemaps: Understand Code, Before You Vibe It

cognition.ai·12h·

Discuss: Hacker News, Hacker News

Flag this post

Emergent introspective awareness in large language models

transformer-circuits.pub·5d·

Discuss: Hacker News

Flag this post

Inline vs. Pipeline Ray Tracing

evolvebenchmark.com·16h·

Discuss: Hacker News

Flag this post

Improving in chess is hard. I built the world's most accurate human-like chess AI to help me.

mbuffett.com·4d·

Discuss: Hacker News

Flag this post

What is vibe coding? AI writes the code so developers can think big

infoworld.com·21h

Flag this post

I Processed the Internet on a Single Machine to Find Valuable Expired Domains

blog.mbrt.dev·17h·

Discuss: Hacker News

Flag this post

The mind-boggling valuations of AI companies

theguardian.com·15h·

Discuss: Hacker News

Flag this post

Thought Engineering

pranavc28.github.io·5d·

Discuss: Hacker News

Flag this post

Introducing Agent-o-rama: build, trace, evaluate, and monitor stateful LLM agents in Java or Clojure

blog.redplanetlabs.com·1d·

Discuss: Hacker News

Flag this post

From a Curious Outsider to a GreptimeDB Advocator Journey into Contribution

greptime.com·1h·

Discuss: Hacker News

Flag this post

Context-Bench: Benchmarking LLMs on Agentic Context Engineering

letta.com·4d·

Discuss: Hacker News

Flag this post

Engineering a Trillion-Parameter Architecture on Consumer Hardware

hackernoon.com·2d

🏠Self-Hosting

Flag this post

Ask HN: is this a common LLM-assisted development workflow?

news.ycombinator.com·3d·

Discuss: Hacker News

Flag this post

The Trap of Applying Generic Models to Business Needs

gmicloud.ai·9h·

Discuss: Hacker News

Flag this post

Weekly AI Startup Funding: October 26 - November 1, 2025

hackernoon.com·8h

Flag this post

AI Chip History Not Only Rhymes but Also Repeat Itself

diblante.com·9h·

Discuss: Hacker News

Flag this post

How Well Does RL Scale?

tobyord.com·5d·

Discuss: Hacker News

Flag this post

Loading more...