Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
129377
posts in
1.17
s
Constrained parallel
tempering
in
traveling-salesman
problems with circular neighborhoods
link.aps.org
·
3d
🌐
Distributed Computing
Accelerate your discovery by
parallelizing
experiments
magellink.com
·
2d
·
Discuss:
Hacker News
🌐
Distributed Computing
miniKanren.org
minikanren.org
·
2d
·
Discuss:
Lobsters
✂️
CUTLASS
How Virtual
Textures
Really Work
shlom.dev
·
4d
·
Discuss:
Hacker News
📈
GPU Occupancy
Continual
learning and the post
monolith
AI era
baseten.co
·
4d
·
Discuss:
Hacker News
📊
Gradient Accumulation
C++
Latch
and
Barrier
leimao.github.io
·
4d
⚡
CUDA Programming Patterns
Show HN: 289x
speedup
over
MLP
using Spectral Graphs
zenodo.org
·
3d
·
Discuss:
Hacker News
🔀
Operator Fusion
Local-First AI: How
SLMs
are Fixing the
Latency
Gap 💻✨
dev.to
·
1d
·
Discuss:
DEV
⚡
Flash Attention
Investigating Energy Bounds of
Analog
Compute-in-Memory with Local
Normalization
arxiv.org
·
18h
🎯
Tensor Cores
tmilovan/composite-machine
: Composite Machine: Automatic Calculus via Dimensional
Arithmetic
github.com
·
1d
·
Discuss:
Hacker News
✂️
CUTLASS
2026W05
jordivillar.com
·
2d
🔍
Nsight
Definitive
Guide to
Multi-Threaded
Rendering on the Web
hackernoon.com
·
3d
⚡
CUDA Programming Patterns
Show HN:
LocalGPT
– A local-first AI assistant in Rust with
persistent
memory
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
🦀
PyO3
Atomics
in C++26?
meetingcpp.com
·
3d
📊
Profiling Tools
How I
synced
Cursor, Claude, and
Windsurf
with one shared brain (MCP)
dev.to
·
18h
·
Discuss:
DEV
⚡
CUDA Programming Patterns
An
attempt
at a
First-Proof
AI challenge
abhvio.us
·
2d
·
Discuss:
Hacker News
🔗
Kernel Fusion
Performance
Tip
of the Week #7: Optimizing for application
productivity
abseil.io
·
3d
⚙️
Systems Programming
Cooperative
Visitor
: A Template Technique for
Visitor
Creation
artima.com
·
3d
✂️
CUTLASS
*Abstract* We present a scalable, fully‑implemented framework for enumerating
Goldbach
partitions of large even integers in the range \(10^{12}\!\
leq
n \
leq
...
freederia.com
·
4d
🌊
CUDA Streams
MemFly
: On-the-Fly Memory Optimization via Information
Bottleneck
arxiv.org
·
18h
⚡
Flash Attention
Loading...
Loading more...
« Page 7
•
Page 9 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help