Knowledge Cascade: Reverse Knowledge Distillation on Nonparametric Multivariate Functional Estimation (opens in new tab)
As machine learning models and datasets continue to grow, developing complex models has become increasingly computationally demanding. Knowledge distillation reduces deployment cost by compressing a large, well-trained teacher model into a compact student model, but it does not address settings where constructing the teacher itself is the bottleneck. Motivated by this challenge, we introduce Knowledge Cascade (KCas), a reverse knowledge distilla...
Read the original article