Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.orgยท42m
๐ฏTensor Cores
Flag this post
Deep Integration and the Convergence of Model Architecture and Hardware in AI
๐ฏTensor Cores
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
๐๏ธAttention Optimization
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
๐คAI Coding Tools
Flag this post
GPU Pro โ Master Your AI Workflow
๐Nsight
Flag this post
A faster problem-solving tool that guarantees feasibility
news.mit.eduยท42m
โกONNX Runtime
Flag this post
AndesVL Technical Report: An Efficient Mobile-side Multimodal Large LanguageModel
๐๏ธTensorRT
Flag this post
onedraw โ a GPU-driven 2D renderer
โ๏ธCUTLASS
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.orgยท42m
๐ONNX
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.comยท1d
๐ง CPU Architecture
Flag this post
I made a tensor runtime & inference framework in C (good for learning how inference works)
๐TorchScript
Flag this post
Rethinking Networking for the AI/ML Era
lukew.comยท2d
๐Distributed Computing
Flag this post
Federico Biancuzzi, Shane Warden, & Anders Hejlsberg
deprogrammaticaipsum.comยท2h
๐กLSP
Flag this post
Opportunistically Parallel Lambda Calculus
๐กLSP
Flag this post
Loading...Loading more...