Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112449
posts in
381.8
ms
Train Less,
Infer
Faster: Efficient Model
Finetuning
and Compression via Structured Sparsity
arxiv.org
·
2d
🎓
Model Distillation
Astro: Activation-guided Structured
Regularization
for
Outlier-Robust
LLM Post-Training Quantization
arxiv.org
·
3d
📊
Gradient Accumulation
Digitizing
the "
Shokunin
": How we encoded a Master's hammer strike into AI
yusukekaizen.substack.com
·
1d
·
Discuss:
Substack
🤖
AI Coding Tools
Finding cancer cells in a
cocktail
of complex
tissues
sciworthy.com
·
19h
🧩
Attention Kernels
Grassmannian
Manifold
Learning: Optimization and Deep Learning Architectures
hackernoon.com
·
1d
🏎️
TensorRT
New
Ovis2.6-30B-A3B
, a lil better than
Qwen3-VL-30B-A3B
huggingface.co
·
19h
·
Discuss:
r/LocalLLaMA
🔄
ONNX
Storing
Image Data As
Analog
Audio
hackaday.com
·
10h
🧮
cuDNN
Deterministic
Inference with
EigenAI
deterministicinference.com
·
1d
🏎️
TensorRT
Karpathy
's
Micro
LLM in JavaScript
github.com
·
15h
·
Discuss:
Hacker News
🤖
AI Coding Tools
A C implementation of the inference pipeline for the Mistral AI’s
Voxtral
Realtime
4B model
blog.adafruit.com
·
14h
🏎️
TensorRT
Issue 638
datascienceweekly.substack.com
·
11h
·
Discuss:
Substack
⏱️
Benchmarking
The 5 Model
Compression
Techniques: How to
Shrink
AI 10× Without Losing Accuracy
pub.towardsai.net
·
6d
🎓
Model Distillation
Don't give away to the
gradient
descent
carteakey.dev
·
1d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Ai’s
Inner
Workings
Revealed By Model Trained On One Billion Data Points
quantumzeitgeist.com
·
14h
📊
Gradient Accumulation
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
·
18h
📜
TorchScript
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
11h
⚡
ONNX Runtime
microgpt
karpathy.github.io
·
1d
📜
TorchScript
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
1d
·
Discuss:
Hacker News
🤖
AI Coding Tools
Large Language Models for
Mortals
book
andrewpwheeler.com
·
1d
🎓
Model Distillation
(Re)
Discovering
Natural
Laws
lesswrong.com
·
9h
🛠
Ml-eng
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help