90% RAM usage while gaming
๐GPU Occupancy
Flag this post
A hitchhiker's guide to CUDA programming
๐ฏGPU Kernels
Flag this post
TIL: For long-lived LLM sessions, swapping KV Cache to RAM is ~10x faster than recalculating it. Why isn't this a standard feature?
๐ฒLoop Tiling
Flag this post
Some Fun Videos on Optimizing NES Code
bumbershootsoft.wordpress.comยท14h
๐Compiler Optimization
Flag this post
Utilizing Chiplet-Locality For Efficient Memory Mapping In MCM GPUs (ETRI, Sungkyunkwan Univ.)
semiengineering.comยท2d
๐ง CUDA Memory Management
Flag this post
Microstutter in games? Your RGB software might be why
howtogeek.comยท18h
โฑ๏ธCUDA Events
Flag this post
I tested Arc Raiders across four GPUs of different ages โ optimization still exists
xda-developers.comยท17h
๐งPTX
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
๐NCCL
Flag this post
A unified threshold-constrained optimization framework for consistent and interpretable cross-machine condition monitoring
sciencedirect.comยท13h
โฑ๏ธBenchmarking
Flag this post
Beyond LZ4 Limits, Logging at high speed with on-the-fly compression
๐Profiling Tools
Flag this post
DGX Spark UMA can trick you
๐ง CUDA Memory Management
Flag this post
Well-Typed.Com: Case Study: Debugging a Haskell space leak
well-typed.comยท2d
๐Profiling Tools
Flag this post
Challenging the Fastest OSS Workflow Engine
๐งPTX
Flag this post
PCI Resizable BAR Improvements Heading To Linux 6.19
phoronix.comยท21h
๐Profiling Tools
Flag this post
Loading...Loading more...