Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 GPU Kernels
CUDA Kernels, Optimization, Memory Coalescing, Shared Memory
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112741
posts in
794.0
ms
Beyond a Single
Queue
:
Multi-Level-Multi-Queue
as an Effective Design for
SSSP
problems on GPUs
arxiv.org
·
3d
🌊
CUDA Streams
AMD Ryzen 7
9850X3D
vs Ryzen 7 9800X3D
faceoff
— an extra $30 buys you very little performance
tomshardware.com
·
4h
🔧
PTX
a Linux
VM
manager with easy
GPU-passthrough
and more
vm-curator.org
·
1d
·
Discuss:
Hacker News
🔧
PTX
NVIDIA
DGX
Spark
Powers Big Projects in Higher Education
blogs.nvidia.com
·
2d
🔗
NCCL
Moss
: A Linux-compatible Rust
async
kernel, 3 months on
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
📦
uv
Oxide
plans new rack attack,
packing
in Zen 5 CPUs and DDR5 RAM
theregister.com
·
17h
·
Discuss:
Hacker News
🧠
CPU Architecture
Breaking the
Tractability
Barrier: A Generic Low-Level Solver for
NP-Hard
Instances (N=63) on Commodity 64-Bit Silicon
zenodo.org
·
1d
·
Discuss:
Hacker News
🎯
Tensor Cores
BOute
: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via
Multi-Objective
Bayesian Optimization
arxiv.org
·
2d
🔗
NCCL
From hand-tuned to generated: A
reproducible
Triton
GPU kernel benchmark across different vendors
next.redhat.com
·
1d
⏱️
CUDA Events
NVIDIA RTX 5070 vs
Radeon
RX
9070: Which GPU should you buy in 2026?
tech.sportskeeda.com
·
3h
🔍
Nsight
The 5
Distributed
Training
Methods
: How to Train Models Too Large for One GPU
pub.towardsai.net
·
1d
🔗
NCCL
OpenAI GPT-5.3-Codex-Spark Now Running at 1K Tokens Per
Secondon
BIG
Cerebras
Chips
servethehome.com
·
21h
·
Discuss:
Hacker News
⚡
Flash Attention
Nvidia Deepens AI Inference Push With
Groq
Deal And
Rubin
Platform
finance.yahoo.com
·
1d
🎮
NVIDIA
AI, GPU, And
HPC
Data
Centers
: The Infrastructure Behind Modern AI
semiengineering.com
·
2d
⏱️
CUDA Events
Show HN: GPU
ROI
simulator
based on token usage and model architecture
axiomos.ai
·
4d
·
Discuss:
Hacker News
📈
GPU Occupancy
Nvidia’s new
technique
cuts LLM reasoning costs by 8x without losing
accuracy
venturebeat.com
·
1d
·
Discuss:
r/LocalLLaMA
🔗
NCCL
Building a Zero-Dependency
secp256k1
CUDA
Engine from Scratch (2.5B ops/SEC)
github.com
·
3d
·
Discuss:
Hacker News
🔧
PTX
Nvidia-Leased
Data Center Wraps Up In-Demand $
3.8B
Bond
bloomberg.com
·
15h
🔗
NCCL
Linux 7.0
MM
Changes Bring Some Very Nice Performance
Optimizations
phoronix.com
·
1d
📊
Profiling Tools
NVIDIA RTX
6000D
PCB Spotted With 84GB
GDDR7
Using 28x 3GB Chip Configuration
eteknix.com
·
20h
🔍
Nsight
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help