Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81888
posts in
547.4
ms
CNN-based
Segmentation
of Medical
Imaging
Data
dev.to
·
11h
·
Discuss:
DEV
🧮
cuDNN
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
· Pull Request #19378
github.com
·
3d
·
Discuss:
r/LocalLLaMA
🎯
Tensor Cores
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
3d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Recursive
Deductive
Verification: A framework for reducing AI
hallucinations
news.ycombinator.com
·
12h
·
Discuss:
Hacker News
📊
Gradient Accumulation
Main
Content ||
Math
∩ Programming
jeremykun.com
·
3h
📉
Model Quantization
A Chinese Traditional
Opera
Video Super-Resolution Dataset Based on the “Real-world+”
Degradation
Fusion
nature.com
·
1d
🧮
cuDNN
Prompt injection in Google
Translate
reveals base model
behaviors
behind task-specific fine-tuning
lesswrong.com
·
1d
·
Discuss:
Hacker News
🤖
AI Coding Tools
Matching
the right LLM for your GPU feels like an art, but I finally
cracked
it
xda-developers.com
·
1d
📈
GPU Occupancy
Teon
Demonstrates
Improved Pre-Training With Language Models Up To 1B Parameters
quantumzeitgeist.com
·
3d
📉
Model Quantization
Needed
10K
prompts for my ML dataset, so I made this tool instead of
copy-pasting
for hours
promptanvil.com
·
10h
·
Discuss:
r/SideProject
🤖
AI Coding Tools
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
dev.to
·
1d
·
Discuss:
DEV
🔗
Kernel Fusion
Pathwise
Test-Time Correction for
Autoregressive
Long Video Generation
arxiv.org
·
2d
📊
Gradient Accumulation
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
2d
·
Discuss:
Hacker News
📜
TorchScript
Why
Penguins
Don't Build
Nests
in Trees and Why That Matters for AI
erikzaadi.com
·
15h
🤖
AI Coding Tools
A Field Guide To
Gaussian
Splatting
rd.nytimes.com
·
5h
⚡
Flash Attention
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
2d
·
Discuss:
Hacker News
🔄
ONNX
Reducing the Computational Cost Scaling of Tensor Network Algorithms via
Field-Programmable
Gate Array
Parallelism
arxiv.org
·
2d
🎯
Tensor Cores
TTT-Discover
optimizes
GPU kernels 2x faster than human experts — by training during inference
venturebeat.com
·
3d
⚡
ONNX Runtime
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
4d
·
Discuss:
Hacker News
,
r/Compilers
🎯
Tensor Cores
Three AI
engines
walk
into a bar in single file...
theregister.com
·
11h
🤖
AI Coding Tools
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help