Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80694
posts in
719.6
ms
Automating Inference Optimizations with NVIDIA
TensorRT
LLM
AutoDeploy
developer.nvidia.com
·
23h
⚡
ONNX Runtime
TreeTensor
: Boost AI System on
Nested
Data with Constrained Tree-Like Tensor
arxiv.org
·
13h
🎯
Tensor Cores
Tutorial – What is a
variational
autoencoder
?
jaan.io
·
1d
·
Discuss:
Hacker News
📉
Model Quantization
Quantized
Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with
Reynolds-Independent
Bond Dimension
zenodo.org
·
1d
·
Discuss:
Hacker News
📉
Model Quantization
Parallel Track Transformers:
Enabling
Fast GPU Inference with Reduced
Synchronization
machinelearning.apple.com
·
18h
⏱️
CUDA Events
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
1d
·
Discuss:
r/C_Programming
⚡
ONNX Runtime
Turning Any Model into an XAI-Ready Model:
Formats
and
Gradient
Flow
dev.to
·
6h
·
Discuss:
DEV
📜
TorchScript
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
2d
·
Discuss:
Hacker News
📉
Model Quantization
Antirez
Strikes Again: The Creator of
Redis
Builds a Bare-Metal Vision AI in Pure C — And It Actually Works
webpronews.com
·
4h
🎯
Tensor Cores
LQA
: A Lightweight
Quantized-Adaptive
Framework for Vision-Language Models on the Edge
arxiv.org
·
13h
📉
Model Quantization
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
1d
🎯
Tensor Cores
Deep transfer learning based on cross-domain
subsequence
alignment and feature contribution
interpretation
for remaining useful life prediction
sciencedirect.com
·
2h
🧮
cuDNN
A
Time-Synchronized
Multi-Sensor drone dataset acquired from multiple
radars
and RF receiver
nature.com
·
5h
🔗
Kernel Fusion
Autoregressive
Model Beats Diffusion:
Llama
for Scalable Image Generation
paperium.net
·
4d
·
Discuss:
DEV
📊
Gradient Accumulation
Colab
marketplace.visualstudio.com
·
4h
✂️
CUTLASS
How2Everything
: Mining the web to evaluate and improve LLMs on real-world
procedures
allenai.org
·
1h
·
Discuss:
Hacker News
⏱️
Benchmarking
Build Voice AI in Python: Complete Speech-to-Text Developer Guide (2026)
dev.to
·
3h
·
Discuss:
DEV
🤖
AI Coding Tools
Trainy-ai/pluto
: Next Generation Experimental Tracking for Machine Learning Operations
github.com
·
22h
·
Discuss:
Hacker News
🚀
MLOps
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
·
1d
🔍
Nsight
MiRAGE
: Open-source framework for multimodal
RAG
evaluation
news.ycombinator.com
·
2h
·
Discuss:
Hacker News
🧮
cuDNN
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help