Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔧 PTX
GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112734
posts in
896.6
ms
Mesa 26.0:
RADV
catapults
Radeon ray tracing forward on Linux
igorslab.de
·
10h
🔍
Nsight
Distributed Training Across Mixed GPUs:
Solving
the
Heterogeneous
Fleet Problem
shardpool.aurora-sentient.net
·
3h
·
Discuss:
DEV
🔗
NCCL
CUDA
Shared Memory Bank Conflict-Free
Vectorized
Access
leimao.github.io
·
1d
🎛️
CUDA Optimization
5 Days, One GPU
Gameboy
Swarm
bkase.io
·
1d
·
Discuss:
Hacker News
⏱️
CUDA Events
qwatts-dev/bitnet-webgpu-poc
: An experimental WebGPU compute shader proof-of-concept for running
BitNet
b1.58 (1-bit ternary) matrix math natively in JavaScript.
github.com
·
8h
·
Discuss:
r/LocalLLaMA
✂️
CUTLASS
A
RISC-V
vector
extension primer
blog.adafruit.com
·
1d
🔄
SIMD Programming
a Linux
VM
manager with easy
GPU-passthrough
and more
vm-curator.org
·
1d
·
Discuss:
Hacker News
⏱️
CUDA Events
How low-bit
inference
enables
efficient AI
dropbox.tech
·
3h
·
Discuss:
Hacker News
🎯
Tensor Cores
Fine-Tuning
GPT-5 for GPU
Kernel
Generation
arxiv.org
·
2d
·
Discuss:
Hacker News
🎯
GPU Kernels
Show HN: Skill that lets Claude
Code/Codex
spin up
VMs
and GPUs
news.ycombinator.com
·
1h
·
Discuss:
Hacker News
🤖
AI Coding Tools
AI in Multiple
GPUs
: Point-to-Point and
Collective
Operations
towardsdatascience.com
·
1d
🔗
NCCL
Arch Linux Running Well On LoongArch -
Loongson
3B6000
Benchmarks
tcaf.com
·
6h
⚙️
Systems Programming
Security Assessment of Intel
TDX
with support for Live
Migration
arxiv.org
·
1d
⏱️
CUDA Events
European Chip Startup Pulls Off Working
RISC-V
Solution on the Intel 3 Node,
Marking
One ‘Small’ Step Towards Having Sovereign Infrastructure
wccftech.com
·
1d
🧠
CPU Architecture
Can Logs Help Optimize Databases? Using
Grafana
Loki
as Support
dev.to
·
4h
·
Discuss:
DEV
📊
Profiling Tools
Nvidia
DGX
Spark update cuts idle power by 32% or more — hot-plug detection on
ConnectX
NIC makes for a more efficient AI workstation
tomshardware.com
·
1d
📈
GPU Occupancy
Building a Zero-Dependency
secp256k1
CUDA
Engine from Scratch (2.5B ops/SEC)
github.com
·
3d
·
Discuss:
Hacker News
⚡
CUDA Programming Patterns
My First
Vulkan
Extension
christian-gmeiner.info
·
1d
·
Discuss:
Lobsters
,
Hacker News
🔍
Nsight
Custom
Kernels
for All from
Codex
and Claude
huggingface.co
·
1d
·
Discuss:
Hacker News
🎯
GPU Kernels
Radeon Vega lives on, Acemagic
N3A
NAS launches with Ryzen 7
3750H
“Picasso” APU
videocardz.com
·
4h
📈
GPU Occupancy
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help