Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
💾 Prompt Caching
Context Reuse, KV Cache, Inference Optimization, Token Efficiency
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
32928
posts in
10.6
ms
🔗 Least
recently
used
cache
yellowduck.be
·
1h
🔮
Prefetching
yuechen-li-dev/GenerativeCompressionProtocol
: The first model-native prompt compression protocol
github.com
·
1d
💾
Binary Formats
fast-servers: an
interesting
pattern
geocar.sdf1.org
·
18h
·
Discuss:
Lobsters
🧵
Async
SideQuest
: Model-Driven
KV
Cache Management for Long-Horizon Agentic Reasoning
arxiv.org
·
2d
🧠
LLM Inference
DualPath
: Breaking the Storage
Bandwidth
Bottleneck in Agentic LLM Inference
mesuvash.github.io
·
1d
·
Discuss:
Hacker News
🏗️
LLM Infrastructure
NevaMind-AI/memU
: Memory for 24/7 proactive agents like openclaw (moltbot, clawdbot).
github.com
·
1d
💻
Coding Agents
Tools to generate realistic prompts help surprisingly little with
Petri
audit
realism
lesswrong.com
·
2h
🪄
Prompt Engineering
From
19k
to 4.2M events/SEC: story of a SQLite query
optimisation
mnt.io
·
2h
·
Discuss:
Hacker News
🗄️
libSQL
Optimal Heterogeneous Memory Configs for AI Tasks Under
Specified
Performance Metrics (Stanford,
UCSC
)
semiengineering.com
·
19m
🧠
Memory Hierarchy Design
Evaluating
Frameworks
for Mobile Performance
frontendmasters.com
·
18h
🚀
Web Performance
Dissecting
the CPU-memory relationship in garbage collection (
OpenJDK
26)
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
🏊
Memory Pools
Building
Telco
Reasoning Models for Autonomous Networks with NVIDIA
NeMo
developer.nvidia.com
·
3h
💻
Coding Agents
RAC
:
Relation-Aware
Cache Replacement for Large Language Models
arxiv.org
·
3d
🧠
LLM Inference
The Weekly Challenge 362:
Spellbound
Echo
blog.firedrake.org
·
1h
📑
Inverted Indexes
Build Your Own Key-Value Storage Engine—Week 7
read.thecoder.cafe
·
2d
🌳
Data Structures
MicroGPT
Explained
Interactively
growingswe.com
·
10h
·
Discuss:
Hacker News
🔢
BitNet
MQTT
: The Protocol Behind Every Smart Device (
Golang
)
youtu.be
·
15h
·
Discuss:
r/golang
,
r/programming
🌐
Network Protocols
uCache
: A Customizable
Unikernel-based
IO Cache
usenix.org
·
4d
🎯
Data Locality
My AI development
stack
ultralinx.notion.site
·
3h
👨💻
AI Coding
Tokens and Context are a Modern Shopping
Cart
in an ‘AI
Supermarket
’
pub.towardsai.net
·
3d
🔤
Tokenization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help