Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 SIMD Programming
Specific
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
175468
posts in
24.6
ms
Vectorization
of
Verilog
Designs and its Effects on Verification and Synthesis
arxiv.org
·
5h
🚀
Compiler Optimization
wexionar/fast-simplex
: Fast 2D interpolation engine: 20-40x faster than
Delaunay
, 99.5% success rate, handles 10M+ points. Proximity-first philosophy for practical performance.
github.com
·
13h
·
Discuss:
r/Python
✂️
CUTLASS
ACPV-Net
: All-Class
Polygonal
Vectorization for Seamless Vector Map Generation from Aerial Imagery
arxiv.org
·
1d
🔗
Kernel Fusion
Why real-world AI performance
depends
on the control
layer
theregister.com
·
53m
⏱️
CUDA Events
High-Performance AST
Extraction
in Rust: An
Architectural
Deep Dive
medium.com
·
20h
🔬
Static Analysis
From Exact
kNN
to
DiskANN
: The Evolution of High-Performance Vector Search
hackernoon.com
·
1d
⚡
ONNX Runtime
seanwevans/lockstep
: Data-oriented systems programming language for high-throughput, deterministic compute pipelines, enforcing a straight-line SIMD execution model and static memory topology for maximum CPU vectorization.
github.com
·
3d
·
Discuss:
Hacker News
✂️
CUTLASS
Testing the performance of
various
chip
programmers
blog.adafruit.com
·
2d
📊
Profiling Tools
RISC-V based
Vectorization
of Classic
McEliece
Key Generation
eprint.iacr.org
·
3d
✂️
CUTLASS
33. C# (
Loop
Performance)
dev.to
·
4d
·
Discuss:
DEV
🚀
Compiler Optimization
Spanda
: The High-Performance
Animation
Engine Built for Every Platform in Rust
aarambhdevhub.medium.com
·
1d
✂️
CUTLASS
Less-relevant results
GeForce
RTX path
tracing
performance will be a million times faster in the future
tweaktown.com
·
3d
🔍
Nsight
AI on
HPC
Workshop
2026
ai-on-hpc.github.io
·
16h
⚡
ONNX Runtime
Fast
Arcsine
in Apple
blog.vmchale.com
·
4d
⚡
Flash Attention
Time-Travel
Debugging
in State Management: Part 2 — Performance & Advanced
Topics
dev.to
·
5d
·
Discuss:
DEV
📊
Profiling Tools
IDA
Plugin
Updates on 2026-03-17
williballenthin.com
·
2d
📦
uv
SkillsBench
—
Benchmarking
How Well Agent Skills Work
skillsbench.ai
·
2d
🤖
AI Coding Tools
GPU-Optimized
PyTorch
Builds Made Easy with
Flox
and Nix
flox.dev
·
3d
📜
TorchScript
Apple
M5
GPU
Roofline
Analysis
michaelstinkerings.org
·
3d
🔍
Nsight
Enabling Efficient Sparse
Computations
using Linear Algebra Aware
Compilers
(Technical Report)
osti.gov
·
6d
·
Discuss:
Hacker News
,
r/programming
✂️
CUTLASS
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help