Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Specific
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146590
posts in
12.4
ms
3DTurboQuant
: Training-Free Near-Optimal
Quantization
for 3D Reconstruction Models
📉
Model Quantization
arxiv.org
·
1d
How Do You Actually Scale
High-Throughput
LLM Serving in Production with
vLLM
?
🔍
Nsight
medium.com
·
4d
I Ran My
KYB
Engine at Three
Quantization
Levels. Accuracy Didn't Move. Cost Dropped 6x.
⏱️
Benchmarking
walsenburgtech.com
·
1h
·
Hacker News
0din-ai/ai-scanner
: AI model safety scanner built on NVIDIA
garak
🤖
AI Coding Tools
github.com
·
6h
·
Hacker News
Attn-QAT
: Making 4-Bit Attention Actually Work
📉
Model Quantization
haoailab.com
·
1d
Fast Isn’t Fast Enough:
Redefining
Metrics
for Edge AI
🎯
Tensor Cores
semiengineering.com
·
11h
NVIDIA's
N1
SoC
pictured
on an engineering board with 128GB of memory for local AI
🔍
Nsight
tweaktown.com
·
4h
Google’s
Gemma
4 Model Can Now Be Deployed on NVIDIA’s RTX GPUs,
Delivering
Optimized Performance for a ‘Personalized’ Agentic AI Environment
🎯
GPU Kernels
wccftech.com
·
6d
Build an AI-Powered GPU Fleet
Optimizer
with Gradient
ADK
🔍
Nsight
digitalocean.com
·
2d
Nvidia
Invests
In CPU Chip Startup
SiFive
🎮
NVIDIA
investors.com
·
4h
NVIDIA
DGX
Spark brings
sovereign
AI to your desktop
🔍
Nsight
yourstory.com
·
12h
Cosmos-Predict2.5-2B
Inference: NVIDIA H200 vs AMD
MI300X
🔍
Nsight
moonmath.ai
·
2d
·
Hacker News
Breaking the Memory Wall:
TurboQuant
KV
Cache Quantization on Apple Silicon
⚡
Flash Attention
pub.towardsai.net
·
14h
NVIDIA Neural
Texture
Compression Slashes
VRAM
Usage Over 80% In Games
🎯
GPU Kernels
hothardware.com
·
4d
The $21 billion AI bet: Meta and CoreWeave
ink
deal for NVIDIA’s next-gen
superchips
⚡
ONNX Runtime
coindesk.com
·
6h
SOTA
Normalization
Performance with Torch.compile
⚡
torch.compile
pytorch.org
·
1d
·
Hacker News
Even Nvidia’s own research teams can’t get enough
GPUs
amid the race for AI
computing
power
🔍
Nsight
fortune.com
·
2h
Laptops
Are About to Get Very
Exciting
Again… Thanks to Nvidia
🎮
NVIDIA
gizmodo.com
·
3h
Nvidia's Artificial Intelligence (AI) Chips Still Need Memory. Here's Why the
Micron
Sell-Off Has
Gone
Too Far.
⚡
Flash Attention
finance.yahoo.com
·
9h
Integrate
Physical AI Capabilities into Existing Apps with NVIDIA
Omniverse
Libraries
⏱️
CUDA Events
developer.nvidia.com
·
1d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help