How persistent is the inference cost burden? (opens in new tab)
RL scaling might be better than it looks, and inference costs to reach a capability level fall fast
Read the original articleRL scaling might be better than it looks, and inference costs to reach a capability level fall fast
Read the original article