Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112754
posts in
441.5
ms
Better
Diameter
Bounds for Efficient Shortcuts and a Structural Criterion for
Constructiveness
arxiv.org
·
1d
📊
CUDA Graphs
Beyond
Bilinear
Complexity: What Works and What Breaks with Many
Modes
?
arxiv.org
·
21h
🔗
Kernel Fusion
AI
Infra
HPC
dev.to
·
10h
·
Discuss:
DEV
🔄
SIMD Programming
Linux 7.0
MM
Changes Bring Some Very Nice Performance
Optimizations
phoronix.com
·
1d
📊
Profiling Tools
ml-rust/fluxbench
: Benchmarking framework with crash isolation,
bootstrap
statistics, and CI integration
github.com
·
13h
·
Discuss:
r/rust
📊
Profiling Tools
Introduction To
Concurrency
|
Concurrency
Interview |
AlgoMaster.io
algomaster.io
·
2d
🌊
CUDA Streams
A
stack-buffer-overflow
exercise with
AddressSanitizer
and PostgreSQL
enterprisedb.com
·
1d
·
Discuss:
Lobsters
,
Hacker News
📊
Profiling Tools
GPU-Serving
Two-Tower
Models for Lightweight Ads Engagement Prediction
medium.com
·
2h
⚡
Flash Attention
The
cache
as a product
category
janmeppe.com
·
1d
🏗️
Build Optimization
Olmix
: A framework for data mixing throughout
LM
development
allenai.org
·
10h
⚡
ONNX Runtime
Show HN: Solving
Sudoku
reasoning via Energy
Geometric
models
davisgeometric.com
·
1d
·
Discuss:
Hacker News
✂️
CUTLASS
January in
TigerLand
kill-the-newsletter.com
·
12h
🐕
Ruff
Get 32GB of DDR5-6400 RAM for $150 when you buy the new AMD Ryzen 7
9850X3D
, thanks to this
Newegg
combo deal
tomshardware.com
·
14h
⚡
Flash Attention
Savior
: Low-Level Design
dev.to
·
19h
·
Discuss:
DEV
⚙️
Systems Programming
The
Fourth
Wave
of Computing
lucibrowser.com
·
16h
·
Discuss:
Hacker News
💡
LSP
Index
Compression
,
Query
Execution Improvements
marginalia.nu
·
1d
📊
Profiling Tools
Memory
Bandwidth
Napkin
Math
forrestthewoods.com
·
6d
⚡
CUDA Programming Patterns
Atomistic
, but non-complete
lattices
dominiczypen.wordpress.com
·
16h
✂️
CUTLASS
Reflections on
prototyping
a
sysadmin
benchmark
samek.fyi
·
6h
📊
Profiling Tools
The 4
Precision
Formats
: How to Train AI 2× Faster with Half the Memory
pub.towardsai.net
·
12h
📉
Model Quantization
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help