Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 SIMD Programming
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82692
posts in
687.7
ms
Show HN: C discrete event SIM w
stackful
coroutines runs 45x faster than
SimPy
github.com
·
1d
·
Discuss:
Hacker News
⏱️
CUDA Events
Mitigating
Staleness
in Asynchronous Pipeline
Parallelism
via Basis Rotation
arxiv.org
·
11h
🌊
CUDA Streams
Anthropic
's Performance Take-Home: A 65x Optimization (For
Dummies
)
ikot.blog
·
1d
·
Discuss:
Hacker News
🎛️
CUDA Optimization
Spin
your own
Micro
hackster.io
·
5h
⚡
Flash Attention
WebGPU
Cameras
webgpufundamentals.org
·
8h
🎮
NVIDIA
Fast
Reconfiguration
for
Programmable
Matter
arxiv.org
·
11h
✂️
CUTLASS
**Abstract:** This research introduces a novel framework for rapid and accurate analysis of
transient
flow characteristics within
microchannel
heat sinks, a ...
freederia.com
·
5h
📊
Profiling Tools
Demystifying
ARM SME to Optimize General Matrix
Multiplications
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
🚀
Compiler Optimization
AMD Intros
Kintex
UltraScale
+ Gen 2 FPGAs
servethehome.com
·
9h
🔧
PTX
Cpu
Work (2)
dev.to
·
2d
·
Discuss:
DEV
🚀
Compiler Optimization
Engineering
Ethereum
's Speed: How we made
Ethrex
20x faster
blog.lambdaclass.com
·
1h
⏱️
Benchmarking
Optimized
LLM Inference
Engines
rishirajacharya.com
·
1h
⚡
ONNX Runtime
The Launch of
RISC-V
Now! A New
Chapter
in Open Computing
semiwiki.com
·
34m
🔧
PTX
“
Parallelizing
MCMC
Across the Sequence Length”: This one is really cool.
statmodeling.stat.columbia.edu
·
1d
⚡
ONNX Runtime
Scaling
Video
Encoding
with Edge AI Power
dev.to
·
11h
·
Discuss:
DEV
⚡
Flash Attention
Converting data to
hexadecimal
outputs
quickly
lemire.me
·
2d
·
Discuss:
Hacker News
✂️
CUTLASS
A
Demonstration
of
Self-Profiling
geoffchappell.com
·
23h
·
Discuss:
Hacker News
📊
Profiling Tools
The Linux
graphics
stack in a
nutshell
, part 1
lwn.net
·
7h
·
Discuss:
Hacker News
🔧
PTX
The
Heartbeat
of Tetris 🟥🟥🟥🟥: What a
1x1
Pixel Taught Me About Concurrency
qianarthurwang.substack.com
·
23h
·
Discuss:
r/programming
⚡
CUDA Programming Patterns
Taking on
Anthropic
's Public Performance Engineering Interview Challenge
matthewtejo.substack.com
·
14h
·
Discuss:
r/programming
🤖
AI Coding Tools
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help