Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81821
posts in
772.9
ms
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
1d
·
Discuss:
Hacker News
🎓
Model Distillation
Regularized Calibration with
Successive
Rounding
for Post-Training Quantization
arxiv.org
·
3d
🏎️
TensorRT
Image
Classification
with
Convolutional
Neural Networks
dev.to
·
59m
·
Discuss:
DEV
🧮
cuDNN
Main
Content ||
Math
∩ Programming
jeremykun.com
·
22h
🔗
Kernel Fusion
Tutorial – What is a
variational
autoencoder
?
jaan.io
·
3h
·
Discuss:
Hacker News
🏎️
TensorRT
ProtoQuant
: Quantization of
Prototypical
Parts For General and Fine-Grained Image Classification
arxiv.org
·
15h
🏎️
TensorRT
Quantized
Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with
Reynolds-Independent
Bond Dimension
zenodo.org
·
8h
·
Discuss:
Hacker News
🏎️
TensorRT
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
6h
🎯
Tensor Cores
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
1d
·
Discuss:
r/C_Programming
⚡
ONNX Runtime
Expectation
and
Copysets
buttondown.com
·
1h
🔄
ONNX
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
·
2h
·
Discuss:
Hacker News
🔬
Static Analysis
Manufacturing
QMS
Software
samrian.com
·
5h
·
Discuss:
Hacker News
⏱️
Benchmarking
the
mathematics
of
compression
in database systems
bitsxpages.com
·
1h
📈
Occupancy Optimization
Scale LLM fine-tuning with
Hugging
Face and Amazon
SageMaker
AI
aws.amazon.com
·
3h
🎓
Model Distillation
Automating Inference Optimizations with NVIDIA
TensorRT
LLM
AutoDeploy
developer.nvidia.com
·
2h
🏎️
TensorRT
Show HN:
C-CMCP
–
Validated
AI development workflow with quality gates
news.ycombinator.com
·
4h
·
Discuss:
Hacker News
🤖
AI Coding Tools
Adaptive
Neuro-Symbolic
Planning for smart agriculture
microgrid
orchestration in hybrid quantum-classical pipelines
dev.to
·
1d
·
Discuss:
DEV
⚡
ONNX Runtime
Drifting
models
breno.bearblog.dev
·
9h
🎓
Model Distillation
25W06
. Learning a language with the machine
z1nz0l1n.com
·
1d
🛠
Ml-eng
What I've Learned From
Digitizing
20 Million
Historical
Documents
noahdasanaike.github.io
·
6h
·
Discuss:
r/LocalLLaMA
🔄
ONNX
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help