Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·1d·
Discuss: Substack
Flash Attention
Flag this post
Engineering.ai: A Platform for Teams of AI Engineers in Computational Design
arxiv.org·4h
🔄ONNX
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·22h·
Discuss: Substack
ONNX Runtime
Flag this post
Spiking Neural Networks: The Future of Brain-Inspired Computing
arxiv.org·1d
Flash Attention
Flag this post
Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials
arxiv.org·4h
👁️Attention Optimization
Flag this post
The best AI inference for your project. Blazing fast responses.
dev.to·9h·
Discuss: DEV
Flash Attention
Flag this post
I repurposed my old GPU for self-hosted AI and it changed my life
xda-developers.com·17h
🤖AI Coding Tools
Flag this post
Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation
arxiv.org·4h
🧮cuDNN
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·4h
🏎️TensorRT
Flag this post
DCcluster-Opt: Benchmarking Dynamic Multi-Objective Optimization for Geo-Distributed Data Center Workloads
arxiv.org·4h
🔗NCCL
Flag this post
Panther: A Cost-Effective Privacy-Preserving Framework for GNN Training and Inference Services in Cloud Environments
arxiv.org·4h
🔄ONNX
Flag this post
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
arxiv.org·4h
🛠Ml-eng
Flag this post
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
arxiv.org·1d
🏎️TensorRT
Flag this post
Region-Aware Reconstruction Strategy for Pre-training fMRI Foundation Model
arxiv.org·4h
📊Gradient Accumulation
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·1d·
Discuss: Substack
📉Model Quantization
Flag this post
The True Cost of AI Integrations: Comparing Performance and Pricing Models for C# Libraries
dev.to·8h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Unlocking AI Potential: Squeezing Giant Models into Tiny Spaces
dev.to·1d·
Discuss: DEV
📉Model Quantization
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.org·4h
🧮cuDNN
Flag this post