Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Caching
cache invalidation, Redis, Memcached, cache eviction
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
232
posts in
15.2
ms
Building a basic
cache
with
SQLite
🗄️
Database Internals
alexwlchan.net
·
6d
·
Hacker News
When
benchmarks
go bad - what I learned from
measuring
performance wrong
⚡
Performance Engineering
hollycummins.com
·
2d
Performance
Predictability
in
Heterogeneous
Memory
🔁
Cache Coherence
danglingpointers.substack.com
·
19h
·
Substack
Study
compares
Rust and C languages for embedded
firmware
development
⚡
Low-Latency Systems
cnx-software.com
·
2h
SCION: Size-aware Policy Orchestration for
Nonstationary
Object
Caches
(Long Paper Version)
💾
Cache Optimization
arxiv.org
·
1d
My New
Ebook
(Free Download):
Quantization
for Modern AI Systems
📉
Embeddings Optimization
pawankjha.substack.com
·
3d
·
Substack
Embeddings
& Search
📐
Vector Search
yamsmemory.ai
·
2d
A post-quantum
cryptography
toolkit for
microcontrollers
🔐
Cryptography
blog.adafruit.com
·
1d
Quantum Error
Correction
Faces Another
Hurdle
🔐
Cryptography
link.aps.org
·
1d
Predictive
Multi-Tier Memory Management for
KV
Cache in Large-Scale GPU Inference
📦
CPU Caches
arxiv.org
·
5d
GhostServe
: A Lightweight
Checkpointing
System in the Shadow for Fault-Tolerant LLM Serving
🔁
Cache Coherence
arxiv.org
·
1d
A Semantic Quantum
Circuit
Cache for Scalable and Distributed
Quantum-Classical
Workflows
🔁
Cache Coherence
arxiv.org
·
6d
SplitZip
: Ultra Fast Lossless KV Compression for
Disaggregated
LLM Serving
⚡
Low-Latency Systems
arxiv.org
·
1d
FaaSMoE
: A Serverless Framework for
Multi-Tenant
Mixture-of-Experts Serving
🌐
Distributed Systems
arxiv.org
·
6d
Tempus: A Temporally Scalable Resource-Invariant
GEMM
Streaming Framework for
Versal
AI Edge
⚡
Hardware Acceleration
arxiv.org
·
2d
DUAL-BLADE: Dual-Path NVMe-Direct
KV-Cache
Offloading
for Edge LLM Inference
⚡
Low-Latency Systems
arxiv.org
·
6d
A Treasure
Trove
of Performance: Analyzing the
IO500
Submission Data
⚡
Performance
arxiv.org
·
1d
Caliper-in-the-Loop
: Black-Box Optimization for
Hyperledger
Fabric Performance Tuning
⚡
Low-Latency Systems
arxiv.org
·
1d
DAK
: Direct-Access-Enabled GPU Memory
Offloading
with Optimal Efficiency for LLM Inference
⚡
DMA
arxiv.org
·
6d
Stochastic
Sparse
Attention for Memory-Bound Inference
⚡
Vectorized Execution
arxiv.org
·
1d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help