Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

Mechlands Vibe 99: A superb keyboard with one questionable layout decision
zmescience.com·15h
🧠BF16
Flag this post
Intel, Cisco Collaboration Delivers Industry’s First Systems Approach for AI Workloads at the Edge
newsroom.intel.com·17h
🔗NCCL
Flag this post
Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT
arxiv.org·2d
📉Model Quantization
Flag this post
DRAM Prices Surge 172% YoY with No Signs of Slowing Down
techpowerup.com·1h
Flash Attention
Flag this post
Announcing the fastest inference for realtime voice AI agents
together.ai·2d
🤖AI Coding Tools
Flag this post
AMD Ryzen 7 9700X3D Appears in Leaked PassMark Benchmark
techpowerup.com·17h
📈GPU Occupancy
Flag this post
World Simulation with Video Foundation Models for Physical AI
arxiv.org·2d
🏎️TensorRT
Flag this post
Automated Prompt Generation for Code Intelligence: An Empirical study and Experience in WeChat
arxiv.org·4h
🤖AI Coding Tools
Flag this post
A brief guide for those who slept (on AI) the last two years
dev.to·17h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.org·3d
🏎️TensorRT
Flag this post
Tech With Tim: I Let 3 AIs Compete to Build the Same App…
dev.to·7h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·2d
📊Gradient Accumulation
Flag this post
A nonsurgical brain implant for focal neuromodulation
nature.com·5h·
Discuss: Hacker News
Flash Attention
Flag this post
Kiroween Hackathon: Resurrecting Punch Cards and Discovering Exciting New Experiences with My Old Friend, Kiro IDE
dev.to·1h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Beyond Bandwidth: AI's Quantum Leap in Image Transmission
dev.to·1d·
Discuss: DEV
Flash Attention
Flag this post
VISAT: Benchmarking Adversarial and Distribution Shift Robustness in Traffic Sign Recognition with Visual Attributes
arxiv.org·3d
🧮cuDNN
Flag this post
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
arxiv.org·2d
📊Gradient Accumulation
Flag this post
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
arxiv.org·2d
ONNX Runtime
Flag this post
BoolSkel: Unlocking Boolean Network Efficiency Through Structural Pruning by Arvind Sundararajan
dev.to·20h·
Discuss: DEV
🔗Kernel Fusion
Flag this post
CARMA: Comprehensive Automatically-annotated Reddit Mental Health Dataset for Arabic
arxiv.org·4h
🛠Ml-eng
Flag this post