Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI