Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Compression
馃摝 Model Compression
Specific
Quantization, Pruning, Distillation, Efficient AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
46
posts in
6.5
ms
Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM
Quantization
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Trainable Smooth-Rotation Transforms with Learned Channel Scales for LLM Quantization
Less-relevant results
iblameandrew/open-deepthink: Grok-heavy at the price of API cost. You choose the
model
. An unlimited army to think about your problem.
聽
馃
Multi-Agent Systems
聽
Content type:
Code
github.com
路
4d
4 days ago
路
r/LocalLLaMA
Actions for iblameandrew/open-deepthink: Grok-heavy at the price of API cost. You choose the model. An unlimited army to think about your problem.
Heterophily-Aware Adaptive
Knowledge
Distillation
for Hypergraph
Neural
Networks
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Heterophily-Aware Adaptive Knowledge Distillation for Hypergraph Neural Networks
MODF-SIR
: A Multi-agent Omni-modal
Distilled
Framework for Social Intelligence Reasoning
聽
馃挰
Prompt Engineering
聽
Content type:
Academic
arxiv.org
路
16h
16 hours ago
Actions for MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning
Understanding
Quantization-Aware
Training
: Gradients at Quantized
Weights
Bias to the Low-Loss Basin
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin
ScaleSweep: Accurate NVFP4
Post-Training
Quantization
of LLMs via Block Scale Initialization
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for ScaleSweep: Accurate NVFP4 Post-Training Quantization of LLMs via Block Scale Initialization
Dew Drop - June 8, 2026 (#4685)
聽
馃挰
Prompt Engineering
alvinashcraft.com
路
3d
3 days ago
Actions for Dew Drop - June 8, 2026 (#4685)
LLM-Based User Personas for Recommendations at Scale
聽
馃攳
Vector Databases
聽
Content type:
Academic
arxiv.org
路
16h
16 hours ago
Actions for LLM-Based User Personas for Recommendations at Scale
PADD: Path-Aligned Decompression
Distillation
for Non-Router Teacher to Guide MoE Student Learning
聽
馃攳
Vector Databases
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for PADD: Path-Aligned Decompression Distillation for Non-Router Teacher to Guide MoE Student Learning
Cross-Modal
Knowledge
Distillation
without Paired Data: Theoretical Foundation and Algorithm
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundation and Algorithm
Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit
Quantization
for
Large
Language
Models
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for Large Language Models
Beyond Dark
Knowledge
: Mixup-Based
Distillation
for Reliable Predictions
聽
馃攳
Vector Databases
聽
Content type:
Academic
arxiv.org
路
16h
16 hours ago
Actions for Beyond Dark Knowledge: Mixup-Based Distillation for Reliable Predictions
TENP: Trapezoidal Expert Neuron
Pruning
For Mixture-of-Experts
聽
馃挰
Prompt Engineering
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for TENP: Trapezoidal Expert Neuron Pruning For Mixture-of-Experts
FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for
Post-Training
Quantization
of Diffusion Large Language Models
聽
馃挰
Prompt Engineering
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models
LLMCodec: Adapting Video Codecs for
Efficient
Weight
Compression
of Large Language Models
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models
Sigma-Branch: Hierarchical Single-Path
Network
Reconstruction for Dynamic
Inference
with Reduced Active
Parameters
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Sigma-Branch: Hierarchical Single-Path Network Reconstruction for Dynamic Inference with Reduced Active Parameters
Value-and-Structure Alignment for Routing-Consistent
Quantization
of Mixture-of-Experts
Models
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models
Unsupervised Continual Clustering via Forward-Backward
Knowledge
Distillation
聽
鈿欙笍
AutoML
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for Unsupervised Continual Clustering via Forward-Backward Knowledge Distillation
Compress-Distill
: Reasoning Trace Compression for
Efficient
Knowledge Distillation
聽
馃挰
Prompt Engineering
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Compress-Distill: Reasoning Trace Compression for Efficient Knowledge Distillation
Distilling
first-principles accuracy into compact machine learning potentials for condensed-phase chemistry
聽
馃挰
Prompt Engineering
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for Distilling first-principles accuracy into compact machine learning potentials for condensed-phase chemistry
« Page 1
路
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help