Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115128
posts in
1.74
s
Beyond a Single
Queue
:
Multi-Level-Multi-Queue
as an Effective Design for
SSSP
problems on GPUs
arxiv.org
·
1h
🌊
CUDA Streams
Series-Parallel-Loop
Decompositions
of Control-flow Graphs
arxiv.org
·
1d
🔀
Operator Fusion
Memory
Bandwidth
Napkin
Math
forrestthewoods.com
·
3d
⚡
CUDA Programming Patterns
An introduction to
lockless
algorithms [
LWN.net
]
lwn.net
·
1d
⚡
CUDA Programming Patterns
Squares
of a
Sorted
Array: Coding Problem Explained
dev.to
·
1h
·
Discuss:
DEV
🚀
Compiler Optimization
Parallel Track Transformers:
Enabling
Fast GPU Inference with Reduced
Synchronization
machinelearning.apple.com
·
1d
⏱️
CUDA Events
Benchmarking
Claude C
Compiler
dineshgdk.substack.com
·
23h
·
Discuss:
Substack
,
r/programming
🚀
Compiler Optimization
Building a
Regex
Engine with a team of parallel
Claudes
lesswrong.com
·
6h
⚙️
Code Generation
CS
6120: The
Self-Guided
Course
cs.cornell.edu
·
3h
🤖
AI Coding Tools
DFlash
: Block Diffusion for Flash
Speculative
Decoding
z-lab.ai
·
1d
·
Discuss:
Hacker News
📜
TorchScript
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
·
1d
·
Discuss:
Hacker News
⚙️
Code Generation
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
1d
🎯
Tensor Cores
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
2d
·
Discuss:
Hacker News
,
Hacker News
🎛️
CUDA Optimization
Performance Tip of the Week #62:
Identifying
and reducing memory
bandwidth
needs
abseil.io
·
3d
📊
Profiling Tools
AFMTJ
Model For In-Memory Computing (University of
Arizona
)
semiengineering.com
·
13h
⚡
CUDA Programming Patterns
I
Assumed
Java Streams Had Minimal
Overhead
. They Didn’t
ilusr.com
·
3h
·
Discuss:
DEV
🏗️
Build Optimization
Backtracking
Algorithms
algos.khourani.com
·
15h
🚀
Compiler Optimization
Interesting things about the
Lua
interpreter
thesephist.com
·
9h
⚙️
Code Generation
🌌Beginner-Friendly Guide 'Longest Balanced
Subarray
II' -
Leetcode
3721 (C++, Python, JavaScript)
dev.to
·
2h
·
Discuss:
DEV
🔍
Type Checkers
A Local Code Copilot for
Edits
: Why
sweep-next-edit-1.5B
Is Built for Speed
hackernoon.com
·
4h
💡
LSP
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help