Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83098
posts in
869.9
ms
DSB
: Dynamic
Sliding
Block Scheduling for Diffusion LLMs
arxiv.org
·
1d
📊
Gradient Accumulation
Reducing the Computational Cost Scaling of Tensor Network Algorithms via
Field-Programmable
Gate Array
Parallelism
arxiv.org
·
1d
🎯
Tensor Cores
⚖️ Beginner-Friendly Guide 'Minimum
Removals
to Balance
Array
' - Problem 3634 (C++, Python, JavaScript)
dev.to
·
1d
·
Discuss:
DEV
🔄
SIMD Programming
**Abstract:** This paper introduces a novel framework for automated verification of deformations applied to Hilbert
polytopes
, a crucial step in
understandin
...
freederia.com
·
1d
✂️
CUTLASS
Show HN: C discrete event SIM w
stackful
coroutines runs 45x faster than
SimPy
github.com
·
3d
·
Discuss:
Hacker News
⏱️
CUDA Events
Sharding
Databases with Spring Boot: Patterns,
Pitfalls
, and Failure Modes
dev.to
·
1d
·
Discuss:
DEV
🌳
Git Internals
The
Null
Pointer
Crisis: Running God-Mode Software on Legacy Hardware
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
⚙️
Systems Programming
avoiding
trigonometry
- Inigo
Quilez
:: computer graphics, maths, shaders, fractals, demoscene
iquilezles.org
·
3d
✂️
CUTLASS
DC
Byte
analysis warns the era of ‘cheap and
abundant
RAM’ is over
kitguru.net
·
2d
⚡
Flash Attention
Designing
energy-efficient AI chips: Why power must be an early
consideration
edn.com
·
1d
🧠
CPU Architecture
Anthropic
's Performance Take-Home: A 65x Optimization (For
Dummies
)
ikot.blog
·
3d
·
Discuss:
Hacker News
🎛️
CUDA Optimization
ML for Energy-Performance-Aware Scheduling On Heterogeneous
Multicore
Architectures (
Cambridge
)
semiengineering.com
·
4d
📈
Occupancy Optimization
Engineering
Ethereum
's Speed: How we made
Ethrex
20x faster
blog.lambdaclass.com
·
2d
⏱️
Benchmarking
A Faster
WBT/SBT
Implementation Than Linux
RBT
typecombinator.github.io
·
3d
·
Discuss:
r/cpp
🏗️
Build Optimization
Claude Code's
renderer
is more
complex
than a game engine
spader.zone
·
4d
·
Discuss:
Hacker News
📈
GPU Occupancy
Sukr
: A minimal static site
compiler
in Rust with zero-JS output
lobste.rs
·
2d
·
Discuss:
Lobsters
🐕
Ruff
**Abstract:** This paper introduces a novel framework for automated vulnerability discovery and
patching
of
neuromorphic
hardware, leveraging hyper-dimension...
freederia.com
·
1d
🔄
SIMD Programming
A few
CPU
hardware
bugs
taricorp.net
·
2d
·
Discuss:
Lobsters
,
Hacker News
🧠
CPU Architecture
Sampling the
Oxford
CS
Library
blog.computationalcomplexity.org
·
2d
·
Discuss:
blog.computationalcomplexity.org
🔬
Static Analysis
The Linux
graphics
stack in a
nutshell
, part 1
lwn.net
·
2d
·
Discuss:
Hacker News
🔧
PTX
Loading...
Loading more...
« Page 4
•
Page 6 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help