Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Quantization
Model Compression, INT8, Weight Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7560
posts in
8.2
ms
OmniZip
: Learning a Unified and Lightweight Lossless
Compressor
for Multi-Modal Data
arxiv.org
·
1d
📊
Vector Quantization
Deep Dive into
Transformer
Encoders
by Hand ✍️
pub.towardsai.net
·
11h
⚡
Transformers
InnerQ
: Hardware-aware Tuning-free Quantization of
KV
Cache for Large Language Models
arxiv.org
·
1d
📊
Vector Quantization
[
Python/Sage
]
Extended
Hidden Number Problem
leetarxiv.substack.com
·
1d
·
Discuss:
r/programming
🔐
Cryptography
Unsloth
Dynamic 2.0
GGUFs
unsloth.ai
·
7h
·
Discuss:
Hacker News
,
r/LocalLLaMA
🎛️
Fine-tuning
μpack
: Faster & more flexible
integer
compression
blog.cf8.gg
·
1d
·
Discuss:
r/programming
,
r/rust
⚡
Hardware Acceleration
Accuracy
vs. Speed in Local LLMs: Finding Your
Sweet
Spot
grigio.org
·
6h
·
Discuss:
Hacker News
🎛️
Fine-tuning
yuechen-li-dev/GenerativeCompressionProtocol
: The first model-native prompt compression protocol
github.com
·
23h
⚡
Speculative Decoding
Asura
:
Looped
Language Models done better
neel04.github.io
·
2d
·
Discuss:
Hacker News
💬
LLMs
VLM
Validation
and Document Intelligence
pub.towardsai.net
·
18h
🤖
Machine Learning
Fast
KV
Compaction
Makes Long Context LLMs Practical
hackernoon.com
·
1d
💬
LLMs
pplx-embed
: State-of-the-Art Embedding Models for Web-Scale Retrieval
research.perplexity.ai
·
1d
·
Discuss:
Hacker News
📊
Embeddings
Scaling ML Inference on Databricks: Liquid or
Partitioned
?
Salted
or Not?
towardsdatascience.com
·
2h
⚙️
MLOps
KV
Caching
in LLMs: A Guide for Developers
machinelearningmastery.com
·
2d
⚡
Speculative Decoding
Current language model training leaves large
parts
of the internet on the
table
the-decoder.com
·
4h
💬
LLMs
Clojure +
NumPy
Interop
: The 2026 Guide to Hybrid Machine Learning Pipelines
flexiana.com
·
1d
⚙️
MLOps
The Lie algebra of XY-mixer
topologies
and warm starting
QAOA
for constrained optimization
nature.com
·
1d
⚛️
Quantum Computing
DeepSeek updated its low-level operator library
DeepGEMM
, basically confirming the implementation of
mHC
and next-generation hardware support in V4
github.com
·
16h
·
Discuss:
r/LocalLLaMA
⚡
Hardware Acceleration
Coherent
Care
lesswrong.com
·
17h
∀
Mathematical Logic
Analyzing
ReLUfication
Limitations: Enhancing LLM
Sparsity
via Up Projection
hackernoon.com
·
1d
💬
LLMs
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help