Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ฏ GPU Kernels
CUDA Kernels, Optimization, Memory Coalescing, Shared Memory
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81116
posts in
738.9
ms
CUDA
Guide:
Workflow
for Performance Tuning
digitalocean.com
ยท
4d
โก
CUDA Programming Patterns
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
ยท
7h
๐
Nsight
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
ยท
10h
ยท
Discuss:
Hacker News
๐๏ธ
CUDA Optimization
building
cuda-gdb
from sources
redplait.blogspot.com
ยท
20h
ยท
Discuss:
redplait.blogspot.com
โก
CUDA Programming Patterns
How PCIe,
NVLink
, and
NUMA
Topology Affect GPU Scheduling Outcomes
dev.to
ยท
3h
ยท
Discuss:
DEV
๐
CUDA Graphs
GeForce RTX 6090 in 2028 at the
earliest
: When memory shortages
dictate
Nvidia's roadmap
igorslab.de
ยท
4h
โฑ๏ธ
CUDA Events
llama.cpp
guide - Running LLMs
locally
, on any hardware, from scratch
blog.steelph0enix.dev
ยท
5h
๐ก
LSP
Hardware
Acceleration
jellyfin.org
ยท
1d
โฑ๏ธ
CUDA Events
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
ยท Pull Request #19378
github.com
ยท
3d
ยท
Discuss:
r/LocalLLaMA
๐ฏ
Tensor Cores
12 years ago, I left AMD for NVIDIA, and AMD has never
given
me a
reason
to come back
xda-developers.com
ยท
10h
๐ฎ
NVIDIA
From Sequential to Parallel:
Reformulating
Dynamic Programming as GPU Kernels for Large-Scale Stochastic
Combinatorial
Optimization
arxiv.org
ยท
3d
๐
CUDA Graphs
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
ยท
19h
ยท
Discuss:
Hacker News
โก
CUDA Programming Patterns
H100
GPU:
Powering
the Next Era of AI and High-Performance Computing
dev.to
ยท
2d
ยท
Discuss:
DEV
๐
NCCL
Graphics
Programming
Conference
graphicsprogrammingconference.com
ยท
17h
๐ฎ
NVIDIA
Linux 6.19 Released With Better Support For
Older
AMD GPUs,
DRM
Color Pipeline API
phoronix.com
ยท
12h
๐
Nsight
The Avatar Cache:
Enabling
On-Demand Security with
Morphable
Cache Architecture
arxiv.org
ยท
4h
โก
CUDA Programming Patterns
๐ฆ Lance x
DuckDB
SQL Retrieval, ๐ Uber-Scale Storage, โก 1.5M
IOPS
lancedb.com
ยท
9h
๐ณ
Git Internals
**Abstract:** Modern ray tracing implementations in real-time rendering engines face significant performance bottlenecks in
fragment
shaders due to the
compl
...
freederia.com
ยท
4d
๐
Nsight
30,000 NVIDIA
Engineers
Use Generative AI for 3x Higher Code
Output
techpowerup.com
ยท
1d
๐ฎ
NVIDIA
GeForce Now
officially
arrives
on Linux with native beta application for Ubuntu users
mixvale.com.br
ยท
6h
๐ฎ
NVIDIA
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help