Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

UCD buys €724,000 Nvidia supercomputer for AI-led research boost
siliconrepublic.com·6h
📊Gradient Accumulation
Flag this post
My computer has $18 million worth of RAM in it
aardvark.co.nz·57m
Flash Attention
Flag this post
mkinitcpio v40 released and now in core-testing
lists.archlinux.org·18h·
Discuss: r/archlinux
📊Profiling Tools
Flag this post
Indexing sparse vectors with Turso
turso.tech·17h
🔗Kernel Fusion
Flag this post
The Craft of Science with AI: Evidence, Judgment, and Practice
datasociety.net·2h
🤖AI Coding Tools
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·2d·
Discuss: Substack
🐕Ruff
Flag this post
Continuous cell-type diversification in mouse visual cortex development
nature.com·4m
🧩Attention Kernels
Flag this post
A Decade of AI Platform at Pinterest
medium.com·23h·
Discuss: Hacker News
🚀MLOps
Flag this post
This feels like the early Internet moment for AI.
threadreaderapp.com·1d
ONNX Runtime
Flag this post
Real-time Semantic Segmentation for AR Glasses: Dynamic Occlusion Handling via Bayesian Fusion
dev.to·1d·
Discuss: DEV
🏎️TensorRT
Flag this post
Spatial Sense: Unleashing Language Models on Location Data by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
ONNX Runtime
Flag this post
Variational Geometric Information Bottleneck: Learning the Shape of Understanding
arxiv.org·12h
🏎️TensorRT
Flag this post
A probabilistic histological atlas of the human brain for MRI segmentation
nature.com·4m
📊Gradient Accumulation
Flag this post
Chaos-inspired active learning for physics-informed neural networks to assess the reliability of multi-state systems
sciencedirect.com·1h
🎓Model Distillation
Flag this post
A brief guide for those who slept (on AI) the last two years
dev.to·1h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis
arxiv.org·2d
🏎️TensorRT
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·1d
📊Gradient Accumulation
Flag this post
Beyond Bandwidth: AI's Quantum Leap in Image Transmission
dev.to·1d·
Discuss: DEV
Flash Attention
Flag this post
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
arxiv.org·1d
ONNX Runtime
Flag this post