Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82396
posts in
344.9
ms
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
14h
·
Discuss:
Hacker News
📉
Model Quantization
Autoregressive
Model Beats Diffusion:
Llama
for Scalable Image Generation
paperium.net
·
2d
·
Discuss:
DEV
📊
Gradient Accumulation
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
1d
·
Discuss:
Hacker News
⚡
ONNX Runtime
Show HN: Model Training Memory
Simulator
czheo.github.io
·
7h
·
Discuss:
Hacker News
📊
Gradient Accumulation
🥇Top AI
Papers
of the Week
nlp.elvissaravia.com
·
2h
⚡
ONNX Runtime
Physics-Informed Neural Networks for
Inverse
PDE
Problems
pub.towardsai.net
·
1d
📊
Gradient Accumulation
NVIDIA
VibeTensor
: AI Just Built Its Own Deep Learning Engine… And It Actually Works (AI
Revolution
youtube.com
·
7h
🤖
AI Coding Tools
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
18h
·
Discuss:
DEV
📊
Gradient Accumulation
— ### Abstract We introduce a
rigorously
engineered hybrid pipeline that transforms deep generative neural architectures into quadratic
unconstrained
b...
freederia.com
·
2d
📉
Model Quantization
qrafty-ai/teleop
_xr: Transforms your VR/AR headset into a powerful, precise robot controller
github.com
·
12h
·
Discuss:
Hacker News
🤖
AI Coding Tools
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
·
2d
🔄
ONNX
Logarithmic-time
Schedules
for Scaling Language Models with Momentum
arxiv.org
·
2d
📊
Gradient Accumulation
Run
Voxtral
Mini 4B Realtime on
vLLM
with Red Hat AI on Day 1: A step-by-step guide
developers.redhat.com
·
1d
⚡
ONNX Runtime
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
·
1d
👁️
Attention Optimization
CNN-based
Segmentation
of Medical
Imaging
Data
dev.to
·
2h
·
Discuss:
DEV
🧮
cuDNN
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
3d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Training language models on
TPUs
shouldn't be
scary
dogac.dev
·
3d
·
Discuss:
Hacker News
📜
TorchScript
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
· Pull Request #19378
github.com
·
2d
·
Discuss:
r/LocalLLaMA
🎯
Tensor Cores
Recursive
Deductive
Verification: A framework for reducing AI
hallucinations
news.ycombinator.com
·
3h
·
Discuss:
Hacker News
📊
Gradient Accumulation
A Chinese Traditional
Opera
Video Super-Resolution Dataset Based on the “Real-world+”
Degradation
Fusion
nature.com
·
1d
🧮
cuDNN
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help