Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 GPU Kernels
CUDA Kernels, Optimization, Memory Coalescing, Shared Memory
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121791
posts in
1.95
s
ZipFlow
: a Compiler-based Framework to Unleash
Compressed
Data Movement for Modern GPUs
arxiv.org
·
2d
🌊
CUDA Streams
FlashSketch
: Sketch-Kernel Co-Design for Fast Sparse
Sketching
on GPUs
arxiv.org
·
3d
🎛️
CUDA Optimization
[News] SK
hynix
Unveils AI Chip Architecture with
HBF
, Reportedly Boosts Performance per Watt by Up to 2.69×
trendforce.com
·
5h
·
Discuss:
r/hardware
⚡
Flash Attention
Discussion - Investigation of Single Thread CPU "
Thoughput/cycle
"
forums.anandtech.com
·
7h
📊
Profiling Tools
datavorous/spheni
: An in-memory vector search library in C++ with Python bindings
github.com
·
16h
·
Discuss:
Hacker News
✂️
CUTLASS
building
cuda-gdb
from sources
redplait.blogspot.com
·
3d
·
Discuss:
redplait.blogspot.com
⚡
CUDA Programming Patterns
DeepComputing
Unveils
RVA23-Compliant
Mainboard III for Linux on Framework 13
lxer.com
·
16h
🎯
Tensor Cores
New AMD
Adrenalin
Driver
bluesnews.com
·
5h
🎮
NVIDIA
AI agent
sandboxing
in 2026: how to choose between primitives,
runtimes
, and platforms
manveerc.substack.com
·
12h
·
Discuss:
Substack
🏗️
Bazel
Ph42oN
/
dxvk-gplasync
gitlab.com
·
9h
⏱️
CUDA Events
How a ‘
zombie
’
chipmaker
became Nvidia’s vital AI ally
ft.com
·
1d
⚡
CUDA Programming Patterns
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
3d
·
Discuss:
Hacker News
,
Hacker News
🎛️
CUDA Optimization
What Nvidia, Google and Meta Are Building Beyond
Chips
and
Compute
pymnts.com
·
1d
🔍
Nsight
Intel Releases New Compute Runtime,
Upstreams
More
SYCL
Code To LLVM
phoronix.com
·
17h
🔧
PTX
Guney-olu/nanoslg
: A from-scratch implementation of distributed LLM inference in simple readable Python
github.com
·
2d
·
Discuss:
Hacker News
,
r/LLM
⏱️
CUDA Events
Nvidia bundles Resident Evil Requiem as AMD
counters
with
Crimson
Desert
techspot.com
·
19h
🎮
NVIDIA
How to connect
Convex
to
RunPod
for serverless GPU workloads
stack.convex.dev
·
2d
🔧
PTX
How PCIe,
NVLink
, and
NUMA
Topology Affect GPU Scheduling Outcomes
dev.to
·
3d
·
Discuss:
DEV
📊
CUDA Graphs
AMD's 3D
V-Cache
is still the best gaming upgrade money can buy
xda-developers.com
·
8h
🔧
PTX
GeForce Now on Linux
Feels
Like a Real
Turning
Point for Cloud Gaming
cgmagonline.com
·
1d
🎮
NVIDIA
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help