KV Cache

Feeds to Scour
SubscribedAll
Scoured 181 posts in 72.3 ms

detects when ML research consensus is shifting using Bayesian CUSUM

 🔢Vector DBs
tattvaai.org··Hacker News

LLM Inference Guide: Temperature, KV Cache & Speed

 🧠LLM Inference  Content type: Blog
medium.com
·
Less-relevant results

Sors: a Rust proxy that reorders prompts to maximize vLLM prefix cache hits

 🧠LLM Inference  Content type: Code
github.com··Hacker News

DiffusionGemma: Discrete diffusion in a large language model

 🧠LLM Inference

Most people use Ollama or llama.cpp for local LLMs, but these are the tools I switch to when it gets serious

 🧠LLM Inference

vLLM Internalised: The Mechanics of Modern LLM Inference

 🧠LLM Inference  Content type: Blog
medium.com
·

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

 🧠LLM Inference

AnchorKV: Safety-Aware KV Cache Compression via Soft Penalty with a Refusal Anchor

 🧠LLM Inference  Content type: Academic
arxiv.org·

Friday Five — June 12, 2026

 🧠LLM Inference
redhat.com·

Running local LLMs on the Arduino® UNO™ Q board: a practical guide

 💬LLMs  Content type: Blog
blog.arduino.cc·

China’s DeepSeek reportedly raises $7.4B in funding at $50B+ valuation

 🤖AI Agents

Why Transformer Models Get Costlier as Context Grows

 💬LLMs
siliconopera.com·

New comment by Greenpants in "Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?"

 💬LLMs  Content type: Discussion

How Public AI delivers sovereign LLM inference on AWS and Intel

 🧠LLM Inference  Content type: Blog

Cosmicgpt – A GPT-in-space simulator to research SpaceX AI satellite viability

 💬LLMs  Content type: Code
github.com··Hacker News

ReMP: Low-Downtime Runtime Model-Parallelism Reconfiguration for LLM Serving

 🌐Distributed Systems  Content type: Academic
arxiv.org·

Free LLM APIs Compared: Rate Limits, Models, and Real Costs (2026)

 📄ML Papers  Content type: Blog  Content type: Discussion
openrouter.ai··Covers 6 stories

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help