Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81304
posts in
755.9
ms
The Avatar Cache:
Enabling
On-Demand Security with
Morphable
Cache Architecture
arxiv.org
·
1d
⚡
CUDA Programming Patterns
Same Engine, Multiple Gears: Parallelizing
Fixpoint
Iteration at Different
Granularities
(Extended Version)
arxiv.org
·
1d
🌊
CUDA Streams
Local-First AI: How
SLMs
are Fixing the
Latency
Gap 💻✨
dev.to
·
1d
·
Discuss:
DEV
⚡
Flash Attention
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
·
1d
·
Discuss:
Hacker News
✂️
CUTLASS
Comparing
accumulate
to C++
23s
fold_left
meetingcpp.com
·
2d
🚀
Compiler Optimization
What should I program?
jamesmcm.github.io
·
2d
✂️
CUTLASS
Your Ray Data Pipeline Works at
10K
Samples
. Here's Why It Crashes at 1M.
dev.to
·
1d
·
Discuss:
DEV
🌐
Distributed Computing
Faster
than
Dijkstra
?
systemsapproach.org
·
1d
·
Discuss:
Hacker News
📊
CUDA Graphs
Lucene
HNSW
performance: A deep dive into the OS page cache
opensearch.org
·
1d
📊
Profiling Tools
How the GNU C Compiler became the
Clippy
of
cryptography
theregister.com
·
1d
·
Discuss:
Hacker News
,
r/programming
🚀
Compiler Optimization
Why JavaScript Needs
Structured
Concurrency
| Blog
frontside.com
·
1d
·
Discuss:
Hacker News
,
r/javascript
🚀
Compiler Optimization
January 2026 Monthly report | Alternative Rust
Compiler
for
GCC
rust-gcc.github.io
·
8h
·
Discuss:
r/rust
📦
uv
Understanding the Go
Runtime
: The
Bootstrap
internals-for-interns.com
·
1d
·
Discuss:
Hacker News
,
r/golang
📊
Profiling Tools
the
mathematics
of
compression
in database systems
bitsxpages.com
·
1d
·
Discuss:
Hacker News
📉
Model Quantization
Sculptor
: The missing
UI
for coding agents
imbue.com
·
18h
🤖
AI Coding Tools
John
Carmack
muses
using a long fiber line as as an L2 cache for streaming AI data — programmer imagines fiber as alternative to DRAM
tomshardware.com
·
1d
·
Discuss:
Hacker News
⚡
Flash Attention
Geospatial
System Design
Patterns
systemdr.substack.com
·
2d
·
Discuss:
Substack
⚡
CUDA Programming Patterns
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
1d
🤖
AI Coding Tools
Intel Core Ultra "Arrow Lake Refresh" Chips Focus on E-core Count and
L3
Cache
Uplifts
techpowerup.com
·
1d
🧠
CPU Architecture
How To Go
Slow
artima.com
·
2d
📊
Profiling Tools
Loading...
Loading more...
« Page 2
•
Page 4 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help