CUDA Linear Algebra, Matrix Operations, GPU BLAS, cuBLASLt

Jordan triple system
ncatlab.org·2d
🔗Kernel Fusion
Flag this post
Math's New Muse: AI as a Reasoning Partner
dev.to·8h·
Discuss: DEV
🎯Tensor Cores
Flag this post
Deep Integration and the Convergence of Model Architecture and Hardware in AI
dev.to·9h·
Discuss: DEV
🎯Tensor Cores
Flag this post
Cells, Queries, and Chaos: The Game of Life in SQL!
dev.to·15h·
Discuss: DEV
✂️CUTLASS
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·3d·
Discuss: Hacker News
💡LSP
Flag this post
Unlock Linear Solver Speed: Symbolic Preconditioning for Hyper-Performance
dev.to·4d·
Discuss: DEV
🎯Tensor Cores
Flag this post
Comparing images with AVX
dev.to·12h·
Discuss: DEV
🔄SIMD Programming
Flag this post
I'm a beginner at C and I would like feedback about the optimisation of my code
reddit.com·15h·
🔍Type Checkers
Flag this post
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
paperium.net·4h·
Discuss: DEV
🧩Attention Kernels
Flag this post
CEO Interview with Wilfred Gomes of Mueon Corporation
semiwiki.com·13h
Flash Attention
Flag this post
Weak-To-Strong Generalization
lesswrong.com·1d
📉Model Quantization
Flag this post
DC4GS: Directional Consistency-Driven Adaptive Density Control for 3D Gaussian Splatting
arxiv.org·41m
🧮cuDNN
Flag this post
You Don't Always Need Grafana for GPU Monitoring
dev.to·1d·
Discuss: DEV
🔍Nsight
Flag this post
Iterators - Dive into Lazy, Composable Processing
itsfoxstudio.substack.com·19h·
Discuss: r/rust
🦀Rust
Flag this post
Computational Complexity and Explanations in Physics
gilkalai.wordpress.com·8h
🔄ONNX
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·18h·
Discuss: Substack
Flash Attention
Flag this post
Using GNU toolchain for Windows kernel-mode drivers
dev.to·41m·
Discuss: DEV
🏗️Build Systems
Flag this post
Adaptive continuity-preserving simplification of street networks
sciencedirect.com·1d
🌐Distributed Computing
Flag this post