Cutting Our LLM Bill 65%: A Backend Engineer's Postmortem (opens in new tab)
So here's what happened: cutting Our LLM Bill 65%: A Backend Engineer's Postmortem I'll be honest — when I first looked at our monthly LLM bill last quarter, I had to close the laptop and go for a walk. Six figures a month, mostly going to GPT-4o because, well, that's just what we defaulted to. Fwiw, this is one of those situations where nobody on the team actually made a deliberate choice — we just kept using the first thing that worked, and by the time anyone noticed, the spend had metastas...
Read the original article