🔢 cuBLAS - miterion · Scour

My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X

gau-nernst.github.io·4h·

Discuss: Hacker News

🎯GPU Kernels

Flag this post

Can-t stop till you get enough

cant.bearblog.dev·11h·

Discuss: Hacker News

📜TorchScript

Flag this post

Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs

arxiv.org·42m

Flag this post

onedraw — a GPU-driven 2D renderer

dev.to·16h·

Discuss: DEV

Flag this post

A hitchhiker's guide to CUDA programming

seanzhang.me·3d·

Discuss: Hacker News

🎯GPU Kernels

Flag this post

ZkML Breakthrough: 13B Models Verified in 15 Minutes

lightcapai.medium.com·13h·

Discuss: Hacker News

🎯Tensor Cores

Flag this post

Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)

sebastianraschka.com·2h·

Discuss: r/LLM

👁️Attention Optimization

Flag this post

I made a tensor runtime & inference framework in C (good for learning how inference works)

github.com·4h·

Discuss: r/C_Programming

📜TorchScript

Flag this post

Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache

tweaktown.com·8h

Flag this post

A Practitioner's Guide to Kolmogorov-Arnold Networks

arxiviq.substack.com·11h·

Discuss: Substack

📉Model Quantization

Flag this post

Scalable In-Memory Associative Processing for Graph Neural Network Inference

dev.to·17h·

Discuss: DEV

⚡Flash Attention

Flag this post

A unified threshold-constrained optimization framework for consistent and interpretable cross-machine condition monitoring

sciencedirect.com·1d

⏱️Benchmarking

Flag this post

Performance evaluation of image convolution with gradient filters in OpenCL

milania.de·4d·

Discuss: Hacker News

Flag this post

Deep Neural Watermarking for Robust Copyright Protection in 3D Point Clouds

arxiv.org·42m

Flag this post

Text rendering and effects using GPU-computed distances

blog.pkh.me·1d

Flag this post

The Evolution of GPUs: How Floating-Point Changed Computing

dell.com·15h·

Discuss: Hacker News

🎯Tensor Cores

Flag this post

ClipTagger-12B VLM: Frame Captioning Tutorial

dev.to·13h·

Discuss: DEV

Flag this post

Programming for Computations: Matlab/Octave

link.springer.com·58m·

Discuss: Hacker News

🔄SIMD Programming

Flag this post

[CrabGraph] A Modern, Safe, and Ergonomic Rust Cryptography Library

reddit.com·17h·

Discuss: r/rust

Flag this post

Integer overflow checking with C23

blog.gnoack.org·9h

🔬Static Analysis

Flag this post

Loading more...