Paying for LLM inference by the kilowatt-hour instead of per token (opens in new tab)
You might have read recently on this blog that my procurement preferences for hank.parts are basically * EU, * (self hosted) open source, * UK/CH, * Rest of the world, in this order. This article is a "rest of the world" case where my first impression of both the concept and the
Read the original article