Llama, qwen, OpenAI, Claude, Anthropic, GPUs, Ollama, Local LLMs

Caches and Abstractions
parallelprogrammer.substack.com·21h·