Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 GPU Microarchitecture
GPU ISA, shader cores, warp scheduling, SIMT execution
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
184446
posts in
25.0
ms
FACT:
Compositional
Kernel
Synthesis
with a Three-Stage Agentic Workflow
🌐
Distributed Systems
arxiv.org
·
10h
Pragmata
: Benchmarks with NVIDIA
DLSS
4 and NVIDIA
DLSS
4.5
⚡
PTX
en.gamegpu.com
·
5d
Tile
Kernels
: An optimized GPU
kernels
library written in
TileLang
⚡
PTX
github.com
·
2d
·
Hacker News
Microsoft previews
Shader
Model 6.10 with a matrix math API, making neural rendering a standard
DirectX
feature
🔴
ROCm
tweaktown.com
·
1h
Announcing
Shader
Model 6.10 Preview, Including
Batched
Asynchronous Command List APIs
⚙️
PTX-to-SASS
devblogs.microsoft.com
·
2d
·
r/hardware
Building an
x86
Gaming PC Without Intel, NVIDIA or AMD
parts
🔧
Custom CPUs
hackaday.com
·
6h
GPU Scheduling in
Kubernetes
: Why It Starts Before the
Scheduler
⏱️
Scheduler Internals
rack2cloud.com
·
2h
·
DEV
Gluon
&Linear
Layouts
Deep-Dive:Tile-Based GPU Programming with Low-Level Control [video]
🗄️
CUDA Memory
youtube.com
·
6d
·
Hacker News
Facilitating
Complex
SoC
Design Through Automation And Integration
⚙️
ISA Design
semiengineering.com
·
7h
ASUS GeForce RTX 5090 Matrix
Platinum
Review - 800 W
Powerhouse
🔴
ROCm
techpowerup.com
·
1d
Godot 4
Rendering
Backends
: A Technical Comparison
🖥️
GPU Drivers
slicker.me
·
9h
·
r/godot
I've been running some of the biggest
open-weight
LLMs for free on Nvidia's cloud
🔴
ROCm
xda-developers.com
·
4h
Reimagining Kernel Generation at the
PTX
Layer: An LLM System Learning from
DSLs
to Outperform Them
⚡
PTX
standardkernel.com
·
2d
·
Hacker News
GPU vs CPU Inference: 5
Scenarios
, Real Costs &
Latency
🔴
ROCm
tildalice.io
·
6d
What it takes to
transpose
a
matrix
(2024)
⚡
PTX
gudok.xyz
·
13h
·
Lobsters
,
Hacker News
Aphelion
Benchmarks
& PC Performance Analysis
🔴
ROCm
dsogaming.com
·
2d
Computational
Sculpture
⚡
PTX
hackster.io
·
21h
Expanding Processing’s Future With a Rust
Rendering
Engine (
lgm2026
)
🔧
Compilers
cdn.media.ccc.de
·
6d
How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5
397B
on DigitalOcean NVIDIA
HGX
™ B300 GPU Droplets
🏗️
AI Infrastructure
digitalocean.com
·
2d
From
Standards
To Systems: The
Chiplet
Era On Arm
⚙️
ISA Design
semiengineering.com
·
7h
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help