Operator Fusion, Memory Bandwidth, Graph Optimization, Intermediate Elimination

MobileNetV3 Paper Walkthrough: The Tiny Giant Getting Even Smarter
towardsdatascience.com·1d
🎯Tensor Cores
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·1d·
Discuss: Hacker News
📜TorchScript
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UW–Madison, Washington State)
semiengineering.com·5h
🌊CUDA Streams
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·21h
🔗NCCL
Flag this post
FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model
paperium.net·1d·
Discuss: DEV
🧩Attention Kernels
Flag this post
ZkML Breakthrough: 13B Models Verified in 15 Minutes
lightcapai.medium.com·1d·
Discuss: Hacker News
🎯Tensor Cores
Flag this post
Defeating KASLR by Doing Nothing at All
googleprojectzero.blogspot.com·7h·
📊Profiling Tools
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·22h·
Discuss: r/LLM
👁️Attention Optimization
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·21h
🎓Model Distillation
Flag this post
CHIP8 – writing emulator, assembler, example game and VHDL hardware impl
blog.dominikrudnik.pl·5h·
Discuss: Hacker News
🔄SIMD Programming
Flag this post
Enhanced Slater Determinant Calculation via Hybrid Tensor Decomposition & Adaptive Mesh Refinement
dev.to·1d·
Discuss: DEV
🔀Operator Fusion
Flag this post
Don’t Just Normalize, Batch Normalize! A Guide to Stable Neural Networks
pub.towardsai.net·19h
📊Gradient Accumulation
Flag this post
Automated Defect Prediction via Cross-Entropy Regularized Graph Neural Networks for Microservice Architectures
dev.to·59m·
Discuss: DEV
ONNX Runtime
Flag this post
Weak-To-Strong Generalization
lesswrong.com·1d
📉Model Quantization
Flag this post
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
machinelearning.apple.com·1d
Flash Attention
Flag this post
Dive into Systems
diveintosystems.org·9h·
Discuss: Hacker News
⚙️Systems Programming
Flag this post
Spiking Neural Networks: The Future of Brain-Inspired Computing
arxiv.org·21h
Flash Attention
Flag this post
Understanding Federated Learning: Best Practices for Implementing Privacy-Preserving AI in C# Projects
dev.to·18h·
Discuss: DEV
🔄ONNX
Flag this post