GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

Feeds to Scour
SubscribedAll
Scoured 82819 posts in 1.32 s
The Linux graphics stack in a nutshell, part 1
lwn.net·4h·
Discuss: Hacker News
📈GPU Occupancy
Preview
Report Post
grgalex/nvshare: Practical GPU Sharing Without Memory Size Constraints
github.com·21h
🎯GPU Kernels
Preview
Report Post
Anthropic's Performance Take-Home: A 65x Optimization (For Dummies)
ikot.blog·23h·
Discuss: Hacker News
🎛️CUDA Optimization
Preview
Report Post
I Burned $500 on GPU Cloud Credits: A Developer's Pivot to Multi-Model APIs
dev.to·6h·
Discuss: DEV
🔄ONNX
Preview
Report Post
Design of a GPU with Heterogeneous Cores for Graphics
arxiv.org·2d
🎯GPU Kernels
Preview
Report Post
TACC Explores Mixed Precision And FP64 Emulation For HPC With Horizon
nextplatform.com·16h
🔍Nsight
Preview
Report Post
Using Nsight Compute with large codebases - Part 2 : Profiling large code bases
blog.ncompass.tech·21h·
Discuss: Hacker News
🔍Nsight
Preview
Report Post
AMD Intros Kintex UltraScale+ Gen 2 FPGAs
servethehome.com·6h
🔄SIMD Programming
Preview
Report Post
Hetccl Shows Scaling Of Multi-Vendor GPU Clusters For Large Language Models
quantumzeitgeist.com·13h
🔗NCCL
Preview
Report Post
The Heartbeat of Tetris 🟥🟥🟥🟥: What a 1x1 Pixel Taught Me About Concurrency
qianarthurwang.substack.com·20h·
Discuss: r/programming
CUDA Programming Patterns
Preview
Report Post
The SPECviewperf benchmark reaches milestone
jonpeddie.com·19h
🔍Nsight
Preview
Report Post
Intel Arc GPUs are having their moment, and nobody is noticing
xda-developers.com·1d
📈GPU Occupancy
Preview
Report Post
**Abstract:** This paper introduces a novel framework for solving complex geometric inequalities based on the Arithmetic-Geometric Mean Inequality (AM-GM) an...
freederia.com·5h
🔢cuBLAS
Preview
Report Post
HW-Triggered Backdoors Across Common GPU Accelerators (BIFOLD, TU Berlin, CISPA)
semiengineering.com·1d
⏱️CUDA Events
Preview
Report Post
WebGPU Compute Shaders
webgpufundamentals.org·5h
🎮NVIDIA
Preview
Report Post
DaguangZhou/TitanShell: 🦾 Advanced tactical desktop client for OpenClaw — Secure, fast, and beautiful
github.com·6h·
Discuss: Hacker News
📦uv
Preview
Report Post
Millets: A practical memory-safety and thread-safety experiment
eagledot.xyz·1d·
⚙️Systems Programming
Preview
Report Post
Claude Code's renderer is more complex than a game engine
spader.zone·1d·
Discuss: Hacker News
📈GPU Occupancy
Preview
Report Post
How Virtual Textures Really Work
shlom.dev·1h·
Discuss: Hacker News
📈GPU Occupancy
Preview
Report Post
Intel attacks the workstation segment with Xeon 600 featuring up to 86 cores and a new platform
igorslab.de·8h
🧠CPU Architecture
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help