Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Cache Optimization
Data Locality, Cache-Friendly Code, Memory Hierarchy, Performance
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
154236
posts in
19.6
ms
Data
Oriented
Design by
example
(2017)
🚀
Performance
nikitablack.github.io
·
1d
·
Hacker News
Solution
for a high performance data
lake
🗄️
Databases
beacon.maris.nl
·
16h
Presentation: When Every Bit Counts: How
Valkey
Rebuilt Its
Hashtable
for Modern Hardware
⚡
Caching Strategies
infoq.com
·
3d
Optimizing Python
Compiler
Project in Rust:
Balancing
Organization, Focus, and Community Engagement
🚀
Performance
github.com
·
3h
·
DEV
A Full-Stack Performance Evaluation Infrastructure for
3D-DRAM-based
LLM
Accelerators
🚀
Performance
arxiv.org
·
1d
Principles of
Mechanical
Sympathy
⚡
Caching Strategies
martinfowler.com
·
3d
·
Hacker News
,
Hacker News
Runahead
Execution vs. Conventional Data Prefetching in the IBM
POWER6
Microprocessor (2010)
🚀
Performance
pages.cs.wisc.edu
·
2d
·
Lobsters
Low-Rank Key Value Attention: Reducing
KV
Cache Memory and
Maintaining
Head Diversity
🧱
Chunking
fin.ai
·
1d
·
Hacker News
The Hidden Value of
CPU-Intensive
Compression
on Modern Hardware
📉
Model Quantization
klarasystems.com
·
2d
Garbage
Collection in the Era of Virtual Threads (Project
Loom
)
🚀
Performance
blog.gceasy.io
·
2d
Cloudflare and ETH
Zurich
Outline
Approaches for AI-Driven Cache Optimization
⚡
Caching Strategies
infoq.com
·
2d
3D-Stacked
NMP
, LLM Decoding, Systolic Array
Microarchitecture
, Multi-Core Scheduling
🚀
Performance
arxiv.org
·
4d
Fast
Cross-Operator
Optimization of Attention
Dataflow
🚀
Performance
arxiv.org
·
4d
CIDER
: Boosting
Memory-Disaggregated
Key-Value Stores with Pessimistic Synchronization
🚀
Performance
arxiv.org
·
5d
Sparsity-Aware
Roofline
Models for Sparse Matrix-Matrix
Multiplication
🚀
Performance
arxiv.org
·
2d
NEURA
: A Unified and
Retargetable
Compilation Framework for Coarse-Grained Reconfigurable Architectures
⚡
Performance Engineering
arxiv.org
·
4d
JZ-Tree
: GPU friendly neighbour search and friends-of-friends with dual tree walks in
JAX
plus CUDA
🚀
Performance
arxiv.org
·
3d
AutoLALA
: Automatic Loop Algebraic
Locality
Analysis for AI and HPC Kernels
💸
Affordable LLMs
arxiv.org
·
3d
DeepStack
: Scalable and Accurate Design Space Exploration for Distributed 3D-Stacked AI
Accelerators
🚀
Performance
arxiv.org
·
4d
Computer Architecture's
AlphaZero
Moment: Automated Discovery in an
Encircled
World
🚀
Performance
arxiv.org
·
4d
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help