Resource-Efficient and Robust Inference of Deep and Bayesian Neural Networks on Embedded and Analog Computing Platforms
arxiv.org·2d
🏎️TensorRT
Flag this post
VerfCNN, Optimal Complexity zkSNARK for Convolutional Neural Networks
eprint.iacr.org·2d
📉Model Quantization
Flag this post
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
⚡Flash Attention
Flag this post
Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
arxiv.org·1d
👁️Attention Optimization
Flag this post
RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection
towardsdatascience.com·1d
👁️Attention Optimization
Flag this post
Deep Learning — 7 : Optimize your Neural Networks through Dropouts & Regularization.
pub.towardsai.net·1d
📊Gradient Accumulation
Flag this post
New Dataset PerSense-D Enables Model-Agnostic Dense Object Segmentation
hackernoon.com·3d
📊Gradient Accumulation
Flag this post
Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images
👁️Attention Optimization
Flag this post
A hitchhiker's guide to CUDA programming
🎯GPU Kernels
Flag this post
Developing a Multi-task Ensemble Geometric Deep Network for Supply Chain Sustainability and Risk Management
arxiv.org·1d
📊Gradient Accumulation
Flag this post
Contribution-Guided Asymmetric Learning for Robust Multimodal Fusion under Imbalance and Noise
arxiv.org·1d
📉Model Quantization
Flag this post
Run Multimodal Reasoning Agents with NVIDIA Nemotron on vLLM
blog.vllm.ai·1d
🏎️TensorRT
Flag this post
Ubuntu Blog: Why we brought hardware-optimized GenAI inference to Ubuntu
ubuntu.com·2d
⚡ONNX Runtime
Flag this post
Reinforcement learning driven adaptive graph construction for fault diagnosis of chemical processes
sciencedirect.com·2d
🔄ONNX
Flag this post
An intro to the Tensor Economics blog
lesswrong.com·3d
🏎️TensorRT
Flag this post
Loading...Loading more...