Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81831
posts in
898.0
ms
AnyThermal
: Towards Learning Universal
Representations
for Thermal Perception
arxiv.org
·
17h
🧩
Attention Kernels
PackInfer
: Compute- and I/O-Efficient Attention for
Batched
LLM Inference
arxiv.org
·
17h
🎓
Model Distillation
How to Automate
Multilingual
Video at the Speed of
Localization
transifex.com
·
11h
💡
LSP
AI
doomers
: What uses of generative AI are you actually
excited
about?
tildes.net
·
3h
🤖
AI Coding Tools
Expectation
and
Copysets
buttondown.com
·
3h
·
Discuss:
Hacker News
📉
Model Quantization
What I've Learned From
Digitizing
20 Million
Historical
Documents
noahdasanaike.github.io
·
8h
·
Discuss:
r/LocalLLaMA
🔄
ONNX
DeepChopper
model improves RNA sequencing research by mitigating
chimera
artifacts
phys.org
·
1h
🔄
ONNX
Join our new study on AI and data-driven computing in UK
primary
classrooms
raspberrypi.org
·
10h
🎓
Model Distillation
Top AI
Libraries
for React
Developers
in 2026
dev.to
·
8h
·
Discuss:
DEV
🤖
AI Coding Tools
A
practical
systems engineering guide:
Architecting
AI-ready infrastructure for the agentic era
thenewstack.io
·
11m
🤖
AI Coding Tools
Bytedance
shows impressive progress in AI video with
Seedance
2.0
the-decoder.com
·
3h
⚡
Flash Attention
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
dev.to
·
1d
·
Discuss:
DEV
🔗
Kernel Fusion
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
3d
·
Discuss:
Hacker News
📜
TorchScript
New Apple-backed AI model can
generate
sound and speech from
silent
videos
9to5mac.com
·
7h
🧮
cuDNN
This viral AI
caricature
trend is
everywhere
– here’s how to make one in ChatGPT
creativebloq.com
·
9h
🔄
ONNX
TTT-Discover
optimizes
GPU kernels 2x faster than human experts — by training during inference
venturebeat.com
·
4d
⚡
ONNX Runtime
On AI:
Environmental
Impact
jmoiron.net
·
12h
⚡
ONNX Runtime
World Models and the Data Problem in
Robotics
joeljang.github.io
·
5h
·
Discuss:
Hacker News
📊
Gradient Accumulation
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
3d
·
Discuss:
Hacker News
🔄
ONNX
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
5d
·
Discuss:
Hacker News
,
r/Compilers
🎯
Tensor Cores
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help