Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

Jeff Su: 4 Next-Level ChatGPT Techniques (Save 15+ Hours Weekly)
youtube.com·3h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Modeling the geopolitics of AI development
lesswrong.com·23m
🤖AI Coding Tools
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·1d·
Discuss: Substack
🐕Ruff
Flag this post
Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
arxiv.org·12h
📉Model Quantization
Flag this post
This feels like the early Internet moment for AI.
threadreaderapp.com·8h
ONNX Runtime
Flag this post
Quantum-Resistant Federated Learning with Homomorphic Encryption for Medical Imaging Diagnostics
dev.to·2d·
Discuss: DEV
🎓Model Distillation
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·12h
✂️CUTLASS
Flag this post
DM-QPMNET: Dual-modality fusion network for cell segmentation in quantitative phase microscopy
arxiv.org·12h
🔗Kernel Fusion
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.com·1d·
Discuss: r/cpp
🏎️TensorRT
Flag this post
The Curvature Rate {\lambda}: A Scalar Measure of Input-Space Sharpness in Neural Networks
arxiv.org·12h
📉Model Quantization
Flag this post
OmniBrainBench: A Comprehensive Multimodal Benchmark for Brain Imaging Analysis Across Multi-stage Clinical Tasks
arxiv.org·12h
🏎️TensorRT
Flag this post
Prog8
github.com·2h·
Discuss: Hacker News
🚀Compiler Optimization
Flag this post
MoSa: Motion Generation with Scalable Autoregressive Modeling
arxiv.org·12h
🏎️TensorRT
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.to·2d·
Discuss: DEV
🔄ONNX
Flag this post
End-to-End Framework Integrating Generative AI and Deep Reinforcement Learning for Autonomous Ultrasound Scanning
arxiv.org·12h
🧮cuDNN
Flag this post
Enhanced Block Copolymer Lithography via Adaptive Stochastic Gradient Descent and Dynamic Mask Optimization
dev.to·4h·
Discuss: DEV
⏱️Benchmarking
Flag this post
Spatial Sense: Unleashing Language Models on Location Data by Arvind Sundararajan
dev.to·20h·
Discuss: DEV
ONNX Runtime
Flag this post
Adversarial Spatio-Temporal Attention Networks for Epileptic Seizure Forecasting
arxiv.org·12h
👁️Attention Optimization
Flag this post
Fleming-VL: Towards Universal Medical Visual Reasoning with Multimodal LLMs
arxiv.org·12h
🏎️TensorRT
Flag this post