⚖️ Model Optimization - emschwartz

Discussed on Hacker News

📱Edge AI Optimization deepgate.ai·

Automating model design for edge AI

Discussed on Hacker News

🗜️Vector Compression moorcheh.ai·

Information-Theoretic Vector Search Is Having Its Moment

Covered by GitHub

Discussed on Hacker News

🤖AI latent.space

[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December

📱Edge AI Optimization arXiv·

ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion Distillation

🤖AI Qt Blog·

Qt Creator 20 and local AI

Covers 10 stories including Pi.dev: There are many coding agents, but this one is mine

Covered by Techrights

Discussed on r/LocalLLaMA

🆕New AI techaffiliate.in·

GLM-5.2: Benchmarks, Architecture and How to Run It

Covers 2 stories including zai-org/GLM-5.2 is here!

Discussed on Hacker News

🏭Industrial Policy arXiv·

Lightweight Transformer Models for On-Device Fault Detection: A Benchmark Study on Resource-Constrained Deployment

🤖AI GitHub·

Show HN: Callimachus – Local search across your AI coding-agent history

Covers 2 stories including Open VSX Registry Is Down

Discussed on Hacker News

📱Edge AI Optimization arXiv·

An Empirical Study of OpenPangu Quantization on Ascend NPUs

📱Edge AI Optimization arXiv·

Understanding Knowledge Distillation in Post-Training: When It Helps and When It Fails

🤖Unmanned Systems arXiv·

Denoising-Enhanced Coarse-to-Fine Infrared Small Target Detection with Attention Prior-Guided Knowledge Distillation

ℹ️Information Theory arXiv·

StreamKL: Fast and Memory-Efficient KL Divergence for Boosting Attention Distillation

📱Edge AI Optimization arXiv·

Efficient Network Inference via Hardware-Aware Architecture Search, Model Pruning & Quantization

📱Edge AI Optimization arXiv·

HilDA: Hierarchical Distillation with Diffusion for Advancing Self-Supervised LiDAR Pre-trainin

Covered by ai-brief.liziran.com

📱Edge AI Optimization arXiv·

PRIDE: Privileged Information-enhanced Distillation for Empathetic Dialogue Generation

📱Edge AI Optimization arXiv·

Wisdom of Committee: Diverse Distillation from Large Foundation Models and Domain Experts

🔓Open Source AI arXiv·

SVD-Surgeon: Optimal Singular-Value Surgery for Large Language Model Compression

ℹ️Information Theory arXiv·

On the Expressive Power of Weight Quantization in Large Language Models

Deltatensors – store model fine-tunes as compressed weight deltas

Automating model design for edge AI

Information-Theoretic Vector Search Is Having Its Moment

[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December

ARIA: Adaptive Region-Based Importance Allocation for Conditional Diffusion Distillation

Qt Creator 20 and local AI

GLM-5.2: Benchmarks, Architecture and How to Run It

Lightweight Transformer Models for On-Device Fault Detection: A Benchmark Study on Resource-Constrained Deployment

Show HN: Callimachus – Local search across your AI coding-agent history

An Empirical Study of OpenPangu Quantization on Ascend NPUs

Understanding Knowledge Distillation in Post-Training: When It Helps and When It Fails

Denoising-Enhanced Coarse-to-Fine Infrared Small Target Detection with Attention Prior-Guided Knowledge Distillation

StreamKL: Fast and Memory-Efficient KL Divergence for Boosting Attention Distillation

Efficient Network Inference via Hardware-Aware Architecture Search, Model Pruning & Quantization

HilDA: Hierarchical Distillation with Diffusion for Advancing Self-Supervised LiDAR Pre-trainin

PRIDE: Privileged Information-enhanced Distillation for Empathetic Dialogue Generation

Wisdom of Committee: Diverse Distillation from Large Foundation Models and Domain Experts

SVD-Surgeon: Optimal Singular-Value Surgery for Large Language Model Compression

Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards