Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83069
posts in
2.39
s
D$^2$
Quant
:
Accurate
Low-bit Post-Training Weight Quantization for LLMs
arxiv.org
·
18h
🎯
Tensor Cores
Quantization-Aware
Regularizers
for Deep Neural Networks Compression
arxiv.org
·
18h
📊
Gradient Accumulation
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
1h
·
Discuss:
Hacker News
🎯
Tensor Cores
Training Design for Text-to-Image Models:
Lessons
from
Ablations
huggingface.co
·
1d
📊
Gradient Accumulation
Shows
Learnable
Permutation
Improves Transformer Model Sparsity Performance
quantumzeitgeist.com
·
2h
📊
Gradient Accumulation
7 Advanced Feature Engineering
Tricks
Using LLM
Embeddings
machinelearningmastery.com
·
1d
🎓
Model Distillation
Neural Bitcoin Small Class
Number
Attack
leetarxiv.substack.com
·
5h
·
Discuss:
Substack
⚡
ONNX Runtime
Cross
Entropy
Derivatives
, Part 4: Solving for other output classes
dev.to
·
3h
·
Discuss:
DEV
🏎️
TensorRT
What is
Overfitting
? -
Overfitting
in Machine Learning
Explained
aws.amazon.com
·
20h
·
Discuss:
Hacker News
📊
Gradient Accumulation
qcc4cp/qcc
: Source code for the book "Quantum Computing for
Programmers
", Cambridge University Press
github.com
·
16h
·
Discuss:
Hacker News
🔄
ONNX
The 4-Step
Magic
: How Neural Networks Actually Learn (With Real
Examples
)
pub.towardsai.net
·
10h
📊
Gradient Accumulation
## Deep Learning-Driven Predictive Modeling of Shell Degradation in Juvenile *
Crassostrea
virginica
* under Simulated Ocean Acidification Conditions: A Multi-Modal Approach
freederia.com
·
11h
🔄
ONNX
Convert
&
Compress
frontendmasters.com
·
1d
🐕
Ruff
Automatic
RGBA
Decomposition
image-layered.app
·
2d
·
Discuss:
Hacker News
🧮
cuDNN
Training a Small Language Model
elijahpotter.dev
·
1d
🏎️
TensorRT
Future
leakage
in
block-quantized
attention
matx.com
·
1d
·
Discuss:
Hacker News
⚡
Flash Attention
Information
Retrieval
Part 2: How To Get Into Model Training Data
searchenginejournal.com
·
9h
📊
Gradient Accumulation
As
Rocks
May Think
evjang.com
·
23h
·
Discuss:
Hacker News
⚡
ONNX Runtime
I built a free
ML
practice
platform - would love your feedback [P]
reddit.com
·
15h
·
Discuss:
r/MachineLearning
🚀
MLOps
Optimized
LLM Inference
Engines
rishirajacharya.com
·
9h
⚡
ONNX Runtime
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help