Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

Jeff Su: 4 Next-Level ChatGPT Techniques (Save 15+ Hours Weekly)
youtube.com·15h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Modeling the geopolitics of AI development
lesswrong.com·12h
🤖AI Coding Tools
Flag this post
This feels like the early Internet moment for AI.
threadreaderapp.com·20h
ONNX Runtime
Flag this post
A Decade of AI Platform at Pinterest
medium.com·11h
🚀MLOps
Flag this post
Real-time Semantic Segmentation for AR Glasses: Dynamic Occlusion Handling via Bayesian Fusion
dev.to·21h·
Discuss: DEV
🏎️TensorRT
Flag this post
Fleming-VL: Towards Universal Medical Visual Reasoning with Multimodal LLMs
arxiv.org·1d
🏎️TensorRT
Flag this post
Measuring the Intrinsic Dimension of Earth Representations
arxiv.org·49m
🏎️TensorRT
Flag this post
AndesVL Technical Report: An Efficient Mobile-side Multimodal Large LanguageModel
paperium.net·3d·
Discuss: DEV
🏎️TensorRT
Flag this post
Benchmarking Generative AI Against Bayesian Optimization for Constrained Multi-Objective Inverse Design
arxiv.org·1d
🎓Model Distillation
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.to·2d·
Discuss: DEV
🔄ONNX
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·1d
✂️CUTLASS
Flag this post
Announcing the fastest inference for realtime voice AI agents
together.ai·1d
🤖AI Coding Tools
Flag this post
BondBERT: What we learn when assigning sentiment in the bond market
arxiv.org·49m
🏎️TensorRT
Flag this post
Hyper Hawkes Processes: Interpretable Models of Marked Temporal Point Processes
arxiv.org·1d
🏎️TensorRT
Flag this post
Multi-refined Feature Enhanced Sentiment Analysis Using Contextual Instruction
arxiv.org·1d
🛠Ml-eng
Flag this post
DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning
arxiv.org·49m
🔄ONNX
Flag this post
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
arxiv.org·1d
🔄ONNX
Flag this post
Spiking Neural Networks: The Next Leap in AI Power Efficiency by Arvind Sundararajan
dev.to·4h·
Discuss: DEV
ONNX Runtime
Flag this post
ARC-GEN: A Mimetic Procedural Benchmark Generator for the Abstraction and Reasoning Corpus
arxiv.org·1d
🤖AI Coding Tools
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·1d
🔄ONNX
Flag this post