Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

How We Built a Custom Vision LLM to Improve Document Processing at Grab
engineering.grab.com·17h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
The Role of GPUs in Accelerating Deep Learning Training
acecloud.ai·5d·
Discuss: DEV
🔗NCCL
Flag this post
Inference Acceleration from the Ground Up
semiwiki.com·6d
🧠CPU Architecture
Flag this post
Honest take: I tested 12+ AI vibe coding tools, but this one actually surprised me
vibe.forem.com·5h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.org·12h
📊Gradient Accumulation
Flag this post
A Comparative Analysis of LLM Adaptation: SFT, LoRA, and ICL in Data-Scarce Scenarios
arxiv.org·12h
🎓Model Distillation
Flag this post
Scalable In-Memory Associative Processing for Graph Neural Network Inference
dev.to·2d·
Discuss: DEV
Flash Attention
Flag this post
Will Spiking Neural Nets Revolutionize AI by Mimicking Brain Efficiency? by Arvind Sundararajan
dev.to·14h·
Discuss: DEV
Flash Attention
Flag this post
Building WriteRight: My Journey Creating an AI Writing Assistant with Mastra
dev.to·17h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.org·1d
🏎️TensorRT
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·12h
📊Gradient Accumulation
Flag this post
Beyond Bandwidth: AI's Quantum Leap in Image Transmission
dev.to·6h·
Discuss: DEV
Flash Attention
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.org·1d
🛠Ml-eng
Flag this post
VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes
arxiv.org·1d
🧮cuDNN
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.org·12h
📊Gradient Accumulation
Flag this post
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
machinelearning.apple.com·1d
Flash Attention
Flag this post
Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
dev.to·53m·
Discuss: DEV
📊Gradient Accumulation
Flag this post
Running MiniMax-M2 locally - Existing Hardware Advice
reddit.com·1h·
Discuss: r/LocalLLaMA
🔧PTX
Flag this post