Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Cache-Aware Algorithms
Memory Hierarchy, Data Locality, Performance Optimization, NUMA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112011
posts in
419.1
ms
UMEM
: Unified Memory Extraction and Management Framework for
Generalizable
Memory
arxiv.org
·
1d
🧠
Memory Allocators
Breaking the
Tractability
Barrier: A Generic Low-Level Solver for
NP-Hard
Instances (N=63) on Commodity 64-Bit Silicon
zenodo.org
·
1h
·
Discuss:
Hacker News
🐹
Minimal Go
How
caching
helps
in LLM Application?
dev.to
·
16h
·
Discuss:
DEV
🧠
Memory Models
Execution-Centric Characterization of
FP8
Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD
MI300A
arxiv.org
·
1d
🎯
CPU Dispatch
Supercharging
Inference for AI Factories: KV Cache
Offload
as a Memory-Hierarchy Problem
blog.min.io
·
20h
🧠
Memory Hierarchy
Building an Embedding API with Rust, Arm, and
EmbeddingGemma
on AWS
Lambda
sobolev.substack.com
·
44m
·
Discuss:
Substack
📋
JSON Parsing
Minimum
Energy Per
Query
semiengineering.com
·
1d
⏲️
Embedded GC
How
octorus
Renders
300K
Lines of Diff at High Speed
dev.to
·
4h
·
Discuss:
DEV
🌊
Async Compilers
Scaling llama.cpp On
Neoverse
N2: Solving
Cross-NUMA
Performance Issues
semiengineering.com
·
1d
💾
Zero-Copy
The
Fourth
Wave
of Computing
lucibrowser.com
·
1h
·
Discuss:
Hacker News
🌱
Green Threads
Intel Posts 2026 Update For
Cache
Aware
Scheduling
On Linux
phoronix.com
·
14h
💾
Cache Algorithms
Cache-aware
disaggregated
inference for up to 40% faster long-context LLM
serving
together.ai
·
2d
·
Discuss:
Hacker News
,
r/LocalLLaMA
⏲️
Embedded GC
Optimizing the
MongoDB
Java Driver: How minor
optimizations
led to macro gains
linkedin.com
·
1d
·
Discuss:
DEV
⚡
Interpreter Optimization
AI in Multiple
GPUs
: Understanding the Host and Device
Paradigm
towardsdatascience.com
·
22h
🤝
Cooperative Threading
Avoiding
UB
but "safe" data race in a lock-free slab
allocator
- help - The Rust Programming Language Forum
users.rust-lang.org
·
1d
🔒
Rust Borrowing
C++20 matching engine - arena allocator, lock-free
SPSC
, intrusive linked lists, 255ns
p50
latency
github.com
·
6h
·
Discuss:
r/cpp
🔢
Algebraic Datatypes
Zero State
Architecture
deep
dive
news.ycombinator.com
·
18h
·
Discuss:
Hacker News
📡
Erlang BEAM
Best CPU 2026 – the top AMD
Ryzen
and Intel Core
processors
tested
club386.com
·
1h
🔀
SIMD Programming
Performance Tip of the Week #62:
Identifying
and reducing memory
bandwidth
needs
abseil.io
·
5d
⚡
Cache Optimization
Nvidia’s new
technique
cuts LLM reasoning costs by 8x without losing
accuracy
venturebeat.com
·
13h
🗺️
Region Inference
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help