Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Optimization
⚖️ Model Optimization
model compression, quantization, model size
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
22
posts in
47.6
ms
🚀
Frontier AI
arXiv
·
1d
1 day ago
On the Expressive Power of
Weight
Quantization
in Large Language
Models
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On the Expressive Power of Weight Quantization in Large Language Models
🤖
AI
GitHub
·
8h
8 hours ago
Deltatensors – store
model
fine-tunes as
compressed
weight
deltas
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Deltatensors – store model fine-tunes as compressed weight deltas
📱
Edge AI Optimization
deepgate.ai
·
5d
5 days ago
Automating
model
design for edge AI
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Automating model design for edge AI
🗜️
Vector Compression
moorcheh.ai
·
1d
1 day ago
Information-Theoretic Vector Search Is Having Its Moment
Covered by
GitHub
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Information-Theoretic Vector Search Is Having Its Moment
🤖
AI
latent.space
·
5d
5 days ago
[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for [AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December
📱
Edge AI Optimization
arXiv
·
8h
8 hours ago
ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion
Distillation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion Distillation
🤖
AI
Qt Blog
·
2d
2 days ago
Qt Creator 20 and local AI
Covers
10 stories
See all stories this covers
including
Pi.dev: There are many coding agents, but this one is mine
Covered by
Techrights
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qt Creator 20 and local AI
🆕
New AI
techaffiliate.in
·
6d
6 days ago
GLM-5.2: Benchmarks, Architecture and How to Run It
Covers
2 stories
See all stories this covers
including
zai-org/GLM-5.2 is here!
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2: Benchmarks, Architecture and How to Run It
🏭
Industrial Policy
arXiv
·
8h
8 hours ago
Lightweight Transformer
Models
for On-Device Fault Detection: A Benchmark Study on Resource-Constrained Deployment
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Lightweight Transformer Models for On-Device Fault Detection: A Benchmark Study on Resource-Constrained Deployment
🤖
AI
GitHub
·
3d
3 days ago
Show HN: Callimachus – Local search across your AI coding-agent history
Covers
2 stories
See all stories this covers
including
Open VSX Registry Is Down
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Callimachus – Local search across your AI coding-agent history
📱
Edge AI Optimization
arXiv
·
1d
1 day ago
An Empirical Study of OpenPangu
Quantization
on Ascend NPUs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for An Empirical Study of OpenPangu Quantization on Ascend NPUs
📱
Edge AI Optimization
arXiv
·
1d
1 day ago
Understanding
Knowledge
Distillation
in Post-Training: When It Helps and When It Fails
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Understanding Knowledge Distillation in Post-Training: When It Helps and When It Fails
🤖
Unmanned Systems
arXiv
·
1d
1 day ago
Denoising-Enhanced Coarse-to-Fine Infrared Small Target Detection with Attention Prior-Guided
Knowledge
Distillation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Denoising-Enhanced Coarse-to-Fine Infrared Small Target Detection with Attention Prior-Guided Knowledge Distillation
ℹ️
Information Theory
arXiv
·
5d
5 days ago
StreamKL: Fast and Memory-Efficient KL Divergence for Boosting Attention
Distillation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for StreamKL: Fast and Memory-Efficient KL Divergence for Boosting Attention Distillation
📱
Edge AI Optimization
arXiv
·
1d
1 day ago
Efficient Network Inference via Hardware-Aware Architecture Search,
Model
Pruning
&
Quantization
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Efficient Network Inference via Hardware-Aware Architecture Search, Model Pruning & Quantization
📱
Edge AI Optimization
arXiv
·
5d
5 days ago
HilDA: Hierarchical
Distillation
with Diffusion for Advancing Self-Supervised LiDAR Pre-trainin
Covered by
ai-brief.liziran.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for HilDA: Hierarchical Distillation with Diffusion for Advancing Self-Supervised LiDAR Pre-trainin
📱
Edge AI Optimization
arXiv
·
1d
1 day ago
PRIDE: Privileged Information-enhanced
Distillation
for Empathetic Dialogue Generation
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for PRIDE: Privileged Information-enhanced Distillation for Empathetic Dialogue Generation
📱
Edge AI Optimization
arXiv
·
5d
5 days ago
Wisdom of Committee: Diverse
Distillation
from Large Foundation
Models
and Domain Experts
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Wisdom of Committee: Diverse Distillation from Large Foundation Models and Domain Experts
🔓
Open Source AI
arXiv
·
1d
1 day ago
SVD-Surgeon:
Optimal
Singular-Value Surgery for Large Language
Model
Compression
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for SVD-Surgeon: Optimal Singular-Value Surgery for Large Language Model Compression
ℹ️
Information Theory
arXiv
·
6d
6 days ago
Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report