A hitchhiker's guide to CUDA programming
๐ฏGPU Kernels
Flag this post
A unified threshold-constrained optimization framework for consistent and interpretable cross-machine condition monitoring
sciencedirect.comยท13h
โฑ๏ธBenchmarking
Flag this post
Challenging the Fastest OSS Workflow Engine
๐งPTX
Flag this post
Opportunistically Parallel Lambda Calculus
๐กLSP
Flag this post
The next RISC-V processor frontier: AI
edn.comยท1d
๐ง CPU Architecture
Flag this post
Async/Await is finally back in Zig
โฑ๏ธCUDA Events
Flag this post
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
โกFlash Attention
Flag this post
Rubin, Vera and the 1800-watt question: Nvidia shows off its future and prepares for the next AI storm
igorslab.deยท2d
โฑ๏ธCUDA Events
Flag this post
A Hybrid Reconstruction Framework for Efficient High-Order Shock-Capturing on Unstructured Meshes
arxiv.orgยท2d
โ๏ธCUTLASS
Flag this post
Machine-learning predictive autoscaling for Flink
engineering.grab.comยท3d
โฑ๏ธCUDA Events
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
๐NCCL
Flag this post
How fast can an LLM go?
๐๏ธTensorRT
Flag this post
Inference Acceleration from the Ground Up
semiwiki.comยท3d
๐ฏTensor Cores
Flag this post
A portable picokernel for async I/O
๐Profiling Tools
Flag this post
Loading...Loading more...