Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments

How LLMs See the World
blog.bytebytego.com·20h
Living with LLMs
matiasklemola.com·2h·
Discuss: Hacker News