Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
arxiv.org·1d
🏎️TensorRT
Flag this post
LinEAS: End-to-end Learning of Activation Steering with a Distributional Loss
machinelearning.apple.com·1d
Flash Attention
Flag this post
Region-Aware Reconstruction Strategy for Pre-training fMRI Foundation Model
arxiv.org·9h
📊Gradient Accumulation
Flag this post
How can I use an STM32 and FPGA together for a CNN-based face recognition project?
reddit.com·1d·
Discuss: r/embedded
📉Model Quantization
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·1d·
Discuss: Substack
📉Model Quantization
Flag this post
The True Cost of AI Integrations: Comparing Performance and Pricing Models for C# Libraries
dev.to·13h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Unlocking AI Potential: Squeezing Giant Models into Tiny Spaces
dev.to·1d·
Discuss: DEV
📉Model Quantization
Flag this post
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
arxiv.org·9h
🧮cuDNN
Flag this post
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
arxiv.org·9h
🏎️TensorRT
Flag this post
Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
arxiv.org·9h
📉Model Quantization
Flag this post
Alpamayo-R1: Bridging Reasoning and Action Prediction for Generalizable Autonomous Driving in the Long Tail
arxiv.org·9h
🏎️TensorRT
Flag this post
QuantumBench: A Benchmark for Quantum Problem Solving
arxiv.org·9h
🔄ONNX
Flag this post
This feels like the early Internet moment for AI.
threadreaderapp.com·4h
ONNX Runtime
Flag this post
Quantum-Resistant Federated Learning with Homomorphic Encryption for Medical Imaging Diagnostics
dev.to·2d·
Discuss: DEV
🎓Model Distillation
Flag this post
CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-World
arxiv.org·9h
🧮cuDNN
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·9h
✂️CUTLASS
Flag this post
DM-QPMNET: Dual-modality fusion network for cell segmentation in quantitative phase microscopy
arxiv.org·9h
🔗Kernel Fusion
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.com·23h·
Discuss: r/cpp
🏎️TensorRT
Flag this post