Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Quantization
🗜️ Quantization
Specific
model quantization, int8, mixed precision, weight compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
46
posts in
15.4
ms
Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM
Quantization
📐
Model Architecture
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization
Less-relevant results
Daily Hacker News for 2026-06-06
🔄
MLOps
daemonology.net
·
4d
4 days ago
Actions for Daily Hacker News for 2026-06-06
[AINews] FrontierCode: Benchmarking for Code Quality over Slop
⚡
ML Inference
Content type:
News
latent.space
·
2d
2 days ago
Actions for [AINews] FrontierCode: Benchmarking for Code Quality over Slop
MiMo-v2.5-Pro-UltraSpeed: 1T
model
with 1000 TPS
🖥️
Systems ML
Content type:
Blog
mimo.xiaomi.com
·
3d
3 days ago
·
Hacker News
,
r/LocalLLaMA
Actions for MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 TPS
not much happened today | AINews
🤖
Machine Learning
news.smol.ai
·
6d
6 days ago
Actions for not much happened today | AINews
Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
🔧
MLIR
gizchina.com
·
1d
1 day ago
Actions for Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
Minimizing the Hidden Cost of Scales: Graph-Guided
Ultra-Low-Bit
Quantization
for Large Language
Models
🖥️
Systems ML
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for Large Language Models
Apple rebuilt its on-device AI stack at WWDC 2026
🤖
Machine Learning
Content type:
Blog
ziraph.com
·
1d
1 day ago
·
Hacker News
Actions for Apple rebuilt its on-device AI stack at WWDC 2026
OpenAI govt stake 🇺🇸, Google compute deal 🚀, Microsoft Scout launch 🤖
🧠
Deep Learning
tldr.tech
·
3d
3 days ago
Actions for OpenAI govt stake 🇺🇸, Google compute deal 🚀, Microsoft Scout launch 🤖
☕🤖 Claude Now Writes Most of Its Own Code
⚙️
Systems Programming
Content type:
News
Content type:
Blog
theaibreak.substack.com
·
2d
2 days ago
·
Substack
Actions for ☕🤖 Claude Now Writes Most of Its Own Code
UniSVQ:
2-bit
Unified Scalar-Vector
Quantization
🖥️
Systems ML
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for UniSVQ: 2-bit Unified Scalar-Vector Quantization
MoQ GGUFs and GSQ:
Low-Bit
GGUFs Are About to Get Much Better
⚡
ML Inference
Content type:
News
Content type:
Blog
kaitchup.substack.com
·
5d
5 days ago
·
r/LocalLLaMA
Actions for MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
On
Low-Bit
Quantization
Errors in Speaker Verification: Diagnostic and Mitigation
🖥️
Systems ML
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for On Low-Bit Quantization Errors in Speaker Verification: Diagnostic and Mitigation
Where to Host Your Open-Source
Model
(Under 10B Parameters)
⚡
ML Inference
digitalocean.com
·
6d
6 days ago
Actions for Where to Host Your Open-Source Model (Under 10B Parameters)
FAIR-Calib:
Frontier-Aware
Instability-Reweighted Calibration for
Post-Training
Quantization of Diffusion Large Language Models
⚙️
Model Training
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models
ScaleSweep: Accurate NVFP4
Post-Training
Quantization
of LLMs via Block Scale Initialization
🖥️
Systems ML
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for ScaleSweep: Accurate NVFP4 Post-Training Quantization of LLMs via Block Scale Initialization
alexziskind1/model-shelf
:
Model
Shelf is a local-first
model
resolver that helps AI agents and scripts find
model
weights
on your own storage before downloading from Hugging Face. Point it at an
internal
SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.
🧠
Deep Learning
Content type:
Code
github.com
·
6d
6 days ago
Actions for alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.
Dew Drop - June 8, 2026 (#4685)
🔄
MLOps
alvinashcraft.com
·
2d
2 days ago
Actions for Dew Drop - June 8, 2026 (#4685)
#068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps
⚡
ML Inference
indiehacker.news
·
2d
2 days ago
Actions for #068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps
AI Week in Review 26.06.06
🧠
Deep Learning
Content type:
News
Content type:
Blog
patmcguinness.substack.com
·
4d
4 days ago
·
Substack
Actions for AI Week in Review 26.06.06
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help