Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82991
posts in
676.8
ms
Regularized Calibration with
Successive
Rounding
for Post-Training Quantization
arxiv.org
·
1d
🏎️
TensorRT
=============================================================================================================================== **Abstract**
freederia.com
·
1d
🏎️
TensorRT
D$^2$
Quant
:
Accurate
Low-bit Post-Training Weight Quantization for LLMs
arxiv.org
·
3d
🎯
Tensor Cores
AI Sees And
Understands
Images Far More
Efficiently
With New Embedding Technique
quantumzeitgeist.com
·
18h
👁️
Attention Optimization
Running Local LLMs as Your AI Coding
Assistant
on Apple
Silicon
dev.to
·
6h
·
Discuss:
DEV
🚀
MLOps
Hello Edge: Keyword
Spotting
on
Microcontrollers
paperium.net
·
14h
·
Discuss:
DEV
🎯
Tensor Cores
Writing an LLM from scratch, part
32c
– Interventions: removing
dropout
gilesthomas.com
·
1d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
1d
·
Discuss:
Hacker News
📜
TorchScript
deepmriprep
: voxel-based
morphometry
preprocessing via deep neural networks
nature.com
·
1d
🏎️
TensorRT
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
·
16h
⚡
ONNX Runtime
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
·
8h
👁️
Attention Optimization
The Little Book of
Linear
Algebra
little-book-of.github.io
·
21h
🔢
cuBLAS
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
1d
·
Discuss:
Lobsters
,
Hacker News
🔍
Type Checkers
So
whats
the next word, then? Almost-no-math
intro
to transformer models
matthias-kainer.de
·
1d
·
Discuss:
Hacker News
🧩
Attention Kernels
Actualización
de embeddings en
producción
con LangChain + pgvector
platform.openai.com
·
1d
·
Discuss:
DEV
🛠
Ml-eng
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
21h
📊
Gradient Accumulation
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
1d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
2d
·
Discuss:
Hacker News
🎯
Tensor Cores
Linear
Regression
: An
Overview
dev.to
·
20h
·
Discuss:
DEV
🎓
Model Distillation
Mechanistic
Interpretability:
Peeking
Inside an LLM
towardsdatascience.com
·
1d
📊
Gradient Accumulation
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help