Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
💾 Prompt Caching
Context Reuse, KV Cache, Inference Optimization, Token Efficiency
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
20566
posts in
1.11
s
ProphetKV
: User-Query-Driven Selective
Recomputation
for Efficient KV Cache Reuse in Retrieval-Augmented Generation
arxiv.org
·
18h
⚡
Vectorized Execution
A Guide to
Effective
Prompt
Engineering
blog.bytebytego.com
·
7h
🪄
Prompt Engineering
Optimized
LLM Inference
Engines
rishirajacharya.com
·
9h
🏗️
LLM Infrastructure
Building a privacy-first,
EU-hosted
AI chat in Rust (
Leptos
)
limbochat.com
·
8h
·
Discuss:
Hacker News
📡
ESPHome
nilpunch/massive-ecs
:
Bitset-based
ECS with rollbacks. C# library and Unity package.
github.com
·
21h
🗄
LiteFS
PROBE: Co-Balancing Computation and Communication in
MoE
Inference via Real-Time Predictive
Prefetching
arxiv.org
·
18h
🧠
LLM Inference
Lessons from
BF-Tree
: Building a
Concurrent
Larger-Than-Memory Index in Rust
zhihanz.github.io
·
2h
·
Discuss:
Hacker News
💨
Cache-Friendly Algorithms
Inference Energy
Consumption
Diagnosed
: LLM Tasks Show 25% Energy Differences
quantumzeitgeist.com
·
1d
🏗️
LLM Infrastructure
The
Heartbeat
of Tetris 🟥🟥🟥🟥: What a
1x1
Pixel Taught Me About Concurrency
qianarthurwang.substack.com
·
1d
·
Discuss:
r/programming
🔓
Lock-Free Structures
Threads
- A context strategy for
humans
and LLMs
blog.sao.dev
·
2d
🪄
Prompt Engineering
Building an RSS
Aggregator
with
Astro
raymondcamden.com
·
2d
📡
RSS
Super speed, super quality: lessons from the
Aptos
Network site launch—Martian Chronicles, Evil
Martians
’ team blog
evilmartians.com
·
23h
🔤
Tokenization
The Top 10 Best
Practices
for AI/BI
Dashboards
Performance Optimization (Part 2)
databricks.com
·
1h
⚡
SQL Optimization
Mekara
:
Workflows
as Code Proof-of-Concept
meksys-dev.github.io
·
19h
·
Discuss:
Hacker News
🪄
Prompt Engineering
Show HN:
Tabstack
Research – An API for verified web research (by
Mozilla
)
news.ycombinator.com
·
6h
·
Discuss:
Hacker News
🔍
Quickwit
The control
layer
for AI
blog.dottxt.ai
·
23h
🛡️
AI Security
Polling vs. Long Polling vs.
SSE
vs. WebSockets vs.
Webhooks
blog.algomaster.io
·
1d
📡
Network Latency
The Art of Being
Lazy
(log): Lower latency and Higher Availability With Delayed
Sequencing
warpstream.com
·
3h
·
Discuss:
Hacker News
💫
IO_uring
Training a Small Language Model
elijahpotter.dev
·
1d
🔤
Tokenization
AI Safety at the
Frontier
: Paper Highlights of January 2026
lesswrong.com
·
1d
🛡️
AI Safety
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help