📉 Model Quantization - miterion · Scour

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·2d·

Discuss: Hacker News

🏎️TensorRT

Seq2Seq2Seq: Lossless Data Compression via Discrete Latent Transformers and Reinforcement Learning

arxiv.org·18h

🎓Model Distillation

The 4 Precision Formats: How to Train AI 2× Faster with Half the Memory

pub.towardsai.net

·9h

🎯Tensor Cores

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·23h

🎓Model Distillation

Sparse Semantic Dimension as a Generalization Certificate for LLMs

arxiv.org·18h

🎓Model Distillation

batteryphil/Primal-Discrete-LLM-Training: ComponentThe "Secret Sauce"MemoryZero-Shadow Training: Training without FP16 master weights.MathPrime-Grid LUT: Better precision-per-bit than standard INT4.StabilityVote Buffering: Making Gradient Accumulation work for discrete weights.

github.com·23h·

Discuss: Hacker News

📊Gradient Accumulation

Presentation: Building Embedding Models for Large-Scale Real-World Applications

infoq.com

·7h

🎓Model Distillation

BetaZero V2: A Diffusion Model for Setting Boulder Problems

evmojo37.substack.com·1d·

Discuss: Substack

📊Gradient Accumulation

mradermacher/Qwen3-Coder-Next-REAM-GGUF

huggingface.co·1d·

Discuss: r/LocalLLaMA

📜TorchScript

gist.github.com·2d·

Discuss: Hacker News, Hacker News

🔍Type Checkers

Image Classification with CNNs – Part 4: Dealing with Variations in Input

dev.to·1h·

Discuss: DEV

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·1d

📊Gradient Accumulation

Show HN: A segmentation model client-side via WASM

qtoolkit.dev·1d·

Discuss: Hacker News

🧩Attention Kernels

Quantization-Aware Distillation

ternarysearch.blogspot.com·5d·

Discuss: Hacker News, ternarysearch.blogspot.com

🎓Model Distillation

Architectural and Mathematical Foundations of Machine Learning: A Rigorous Synthesis of Theory, Geometry, and Implementation

chizkidd.github.io·2d·

Discuss: Hacker News

🔗Kernel Fusion

antirez/iris.c: Flux 2 image generation model pure C inference

github.com·8h

🏎️TensorRT

Running Machine Learning on Arduino Nano

hackster.io·13h

🎯Tensor Cores

Visual Introduction to PyTorch

0byte.io·10h·

Discuss: Hacker News

Addendum: Data splitting against information leakage with DataSAIL

nature.com·10h

🎓Model Distillation

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·13h·

Discuss: Hacker News

🎯Tensor Cores

Loading more...