Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔍 Nsight
GPU Profiling, CUDA Debugging, Performance Analysis, Trace
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115854
posts in
2.09
s
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
·
2d
⏱️
CUDA Events
ollama on
bazzite.gg
on AMD GPU
lemmy.ml
·
36m
🎮
NVIDIA
Show HN: GPU
ROI
simulator
based on token usage and model architecture
axiomos.ai
·
21h
·
Discuss:
Hacker News
📈
GPU Occupancy
lightonai/next-plaid
: Multi-vector search
github.com
·
3h
🔄
ONNX
Inside
Mesa
26.0's
RADV
RT improvements
pixelcluster.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
,
r/linux_gaming
🔧
PTX
Software Space Analytics: Towards
Visualization
and
Statistics
of Internal Software Execution
arxiv.org
·
1d
📊
Profiling Tools
LLM Performance in
Astro
, React,
Tailwind
and Cloudflare
10xbench.ai
·
13h
·
Discuss:
Hacker News
⏱️
Benchmarking
Not So Fast:
Analyzing
the Performance of
WebAssembly
vs. Native Code
usenix.org
·
3h
🏗️
Build Optimization
Megapixel
Ventana Deep Matte at ISE 2026 + HELIOS + AMD compute: 1000 nit HDR
microLED
tile demo
armdevices.net
·
1h
⏱️
Benchmarking
Anubis
OSS
— Local LLM Benchmarking for Apple Silicon
devpadapp.com
·
1d
·
Discuss:
r/opensource
📊
Profiling Tools
DVTRGA2
The Official Graphics Engine of
Neuro
‑OS Genesis Enters a New Era
dev.to
·
9h
·
Discuss:
DEV
🔧
PTX
What Nvidia, Google and Meta Are Building Beyond
Chips
and
Compute
pymnts.com
·
10h
⏱️
CUDA Events
tencent/hunyuan-3d-3.1
replicate.com
·
2h
🔄
ONNX
Guney-olu/nanoslg
: A from-scratch implementation of distributed LLM inference in simple readable Python
github.com
·
2d
·
Discuss:
Hacker News
,
r/LLM
⏱️
CUDA Events
DLLM
Agent: See
Farther
, Run Faster
arxiv.org
·
1d
⚡
ONNX Runtime
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
2d
·
Discuss:
Hacker News
,
Hacker News
🎛️
CUDA Optimization
RTX 2060
forums.anandtech.com
·
13h
📈
GPU Occupancy
Container
Timing
:
measuring
web components performance
blogs.igalia.com
·
17h
·
Discuss:
Hacker News
⏱️
CUDA Events
Parallel Track Transformers:
Enabling
Fast GPU Inference with Reduced
Synchronization
machinelearning.apple.com
·
1d
⏱️
CUDA Events
[
AINews
] Qwen Image 2 and
Seedance
2
latent.space
·
5h
🔄
ONNX
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help