Lowering in Reverse
buttondown.comยท13h
๐Ruff
Flag this post
Design of quasi phase matching crystal based on differential gray wolf algorithm
arxiv.orgยท3h
๐Distributed Computing
Flag this post
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
arxiv.orgยท3h
โกFlash Attention
Flag this post
A hitchhiker's guide to CUDA programming
๐ฏGPU Kernels
Flag this post
Building blobd: single-machine object store with sub-millisecond reads and 15 GB/s uploads
๐ณGit Internals
Flag this post
Intel's killed-off BMG-X3/X4 GPUs: 3D stacked die, up to 40 GPU cores, 512MB Adamantine cache
tweaktown.comยท1d
๐งPTX
Flag this post
Tetris: An SLA-aware Application Placement Strategy in the Edge-Cloud Continuum
arxiv.orgยท3h
๐Distributed Computing
Flag this post
A unified threshold-constrained optimization framework for consistent and interpretable cross-machine condition monitoring
sciencedirect.comยท2d
โฑ๏ธBenchmarking
Flag this post
Fungus: The Befunge CPU(2015)
โ๏ธSystems Programming
Flag this post
rqlite 9.2 โ the Distributed SQLite Database โ Fast Restarts with GB Datasets
๐๏ธBuild Optimization
Flag this post
Entropy in algorithm analysis
11011110.github.ioยท2d
๐Kernel Fusion
Flag this post
Transformer-Based Decoding in Concatenated Coding Schemes Under Synchronization Errors
arxiv.orgยท3h
โกFlash Attention
Flag this post
Dissecting my MiniBanners program โ part 1
subethasoftware.comยท13h
โ๏ธCUTLASS
Flag this post
Loading...Loading more...