Changes that cut our LLM pipeline costs more than model-switching did (opens in new tab)

Discussed on Hacker News

I have been building multiple LLM systems and for our Organization biggest cost savings weren't from prompt-wordsmithing or model switchings. Sharing useful to anyone watching their token bill :

Read the original article