Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🏎️ TensorRT
Specific
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
104
posts in
8.4
ms
Cerebras: The $56.4 Billion IPO Challenging
NVIDIA
’s Memory Wall
⚡
Flash Attention
artificialintelligencemadesimple.com
·
2d
Unleashing the Power of
ONNX
for Speedier SBERT
Inference
🔄
ONNX
towardsai.net
·
2d
AMD makes FSR 4 upscaling official for Radeon RX 7000- and 6000-series cards — RDNA 3 and RDNA 2 chips will soon enjoy improved visuals
🎮
NVIDIA
tomshardware.com
·
1w
Instant
GPU
Efficiency Visibility at Fleet Scale
⏱️
CUDA Events
arxiv.org
·
13h
TFLite
Model
Conversion: 10 Commands That Actually Work
📉
Model Quantization
tildalice.io
·
3d
kouhxp/yapsnap: Snap any video URL or audio file into plaintext. No
GPU
. No cloud. One command.
🔓
Open-source
github.com
·
19h
·
Hacker News
Google
Tensor
SDK Beta with LiteRT
🎯
Tensor Cores
developers.googleblog.com
·
1d
AMD Confirms FSR 4.1 Support for Radeon RX 7000 in July, RX 6000 GPUs Get it in 2027
🔍
Nsight
gizchina.com
·
6d
ADI to Acquire IVR Tech to Join Data Center’s Power Gold Rush
🔧
PTX
eetimes.com
·
2d
Show HN: FlashAttention-2 in Cute, from Scratch
⚡
Flash Attention
blog.echen.io
·
3d
·
Hacker News
PyTorch Triton Kernel Transparent Tracing and Compilation
⚡
torch.compile
leimao.github.io
·
17h
Training a 22MB prompt injection classifier
📊
Gradient Accumulation
stackone.com
·
1d
·
Hacker News
AMD FSR 4.1 Coming to RDNA 2 Will Benefit Xbox Series X More Than PS5 Due to Hardware, SDK – Rumor
🔧
PTX
gamingbolt.com
·
6d
Token-Space Mask Prediction for Efficient Vision Transformer Segmentation
🧩
Attention Kernels
arxiv.org
·
2d
Notes on pretraining parallelisms and failed training
runs
.
⏱️
CUDA Events
dwarkesh.com
·
4d
·
Hacker News
Understanding KV Cache: The Hidden Memory Cost of Serving LLMs
⚡
Flash Attention
melchi.me
·
2d
·
Hacker News
An LLM on a Sony PSP
⚙️
Systems Programming
granda.org
·
5d
Coding Agent
Inference
Benchmark Revealed
⚡
ONNX Runtime
startuphub.ai
·
1d
Ollama vs vLLM vs llama.cpp: Which Wins for Your Use Case
📊
Profiling Tools
tildalice.io
·
5d
AMD's FSR 4 coming to RDNA 2 could give the Xbox Series X a PS5 Pro-like upgrade
🔧
PTX
tweaktown.com
·
6d
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help