Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
📊 IVF Indexes
Specific
Inverted File Index, Vector Clustering, Quantization, ANN Search
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
188493
posts in
59.0
ms
Coverage-Based Calibration for Post-Training Quantization via
Weighted
Set Cover over
Outlier
Channels
🎯
Vector Quantization
arxiv.org
·
5d
My New
Ebook
(Free Download):
Quantization
for Modern AI Systems
🗜️
Vector Compression
pawankjha.substack.com
·
1d
·
Substack
LLM
Quantization
🗜️
Vector Compression
huggingface.co
·
2d
·
Hacker News
Effective
KV
Compression with
TurboQuant
🗜️
Vector Compression
machinelearningmastery.com
·
3d
A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for
CPU/XPU/CUDA
, with
multi-datatype
support and full compatibility...
🧠
LLM Inference
lemmy.ml
·
1d
AmSach/kvquant
: Drop-in KV cache compressor for local LLM inference - Run 70B models on 8GB RAM
🧠
LLM Inference
github.com
·
3d
·
DEV
Release
v0.22.1-rc0
: New models (#15861)
🆕
New AI
github.com
·
4d
VitaLLM
: A Versatile, Ultra-Compact
Ternary
LLM Accelerator with Dependency-Aware Scheduling
🏗️
LLM Infrastructure
arxiv.org
·
2d
Reclaiming
Residual
Knowledge: A Novel Paradigm to Low-Bit Quantization
🔬
RaBitQ
arxiv.org
·
5d
CARD:
Non-Uniform
Quantization
of Visual Semantic Unit for Generative Recommendation
🎯
Semantic Tokens
arxiv.org
·
3d
Transformer-Based Rhythm Quantization of Performance
MIDI
Using Beat
Annotations
🔤
Tokenization
arxiv.org
·
6d
QFlash
: Bridging
Quantization
and Memory Efficiency in Vision Transformer Attention
🧠
LLM Inference
arxiv.org
·
4d
Quantamination
: Dynamic
Quantization
Leaks Your Data Across the Batch
📦
Batch Embeddings
arxiv.org
·
3d
CoQuant
: Joint Weight-Activation
Subspace
Projection for Mixed-Precision LLMs
🎯
Vector Quantization
arxiv.org
·
3d
MedSynapse-V
: Bridging Visual Perception and Clinical
Intuition
via Latent Memory Evolution
✨
Gemini
arxiv.org
·
3d
QuantClaw
: Precision Where It Matters for
OpenClaw
🔍
Quickwit
arxiv.org
·
6d
MesonGS
++: Post-training Compression of 3D Gaussian Splatting with
Hyperparameter
Searching
📦
Batch Embeddings
arxiv.org
·
3d
Joint Design of Doppler-Resilient
Unimodular
Discrete-Phase
Waveforms
and Receiving Filters for MIMO Radars
🔬
RaBitQ
arxiv.org
·
4d
QB-LIF
: Learnable-Scale Quantized Burst Neurons for Efficient
SNNs
🔢
BitNet
arxiv.org
·
4d
Feasible-First
Exploration for Constrained ML Deployment Optimization in
Crash-Prone
Hierarchical Search Spaces
🏗️
LLM Infrastructure
arxiv.org
·
4d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help