Model Compression, Neural Networks, Precision Reduction, Efficient Inference
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
arxiv.orgยท1d
DRIFT: Data Reduction via Informative Feature Transformation- Generalization Begins Before Deep Learning starts
arxiv.orgยท10h
ML in the Home
blog.raymond.burkholder.netยท1d
Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs
arxiv.orgยท10h
Loading...Loading more...