Thinking about the cost of inference (opens in new tab)
I’m considering how will prices per token change in the next 1-2 years, which given the pace of developments, is fringe futurology. The aim is to get something basic in paper that will let me or anyone else improve the heuristic as time and data show the trend.
Read the original article