Nvidia Rubin's 10x Cheaper Tokens Hide a Footnote (opens in new tab)
A single number is already loose in 2026 budget decks: up to 10x lower cost per token than Blackwell. That is Nvidia's headline for the Vera Rubin NVL72, launched at CES in January and detailed at GTC in March. Per Nvidia's newsroom and developer blog, the same rack also promises up to 5x greater inference performance and a 4x cut in the GPUs needed to train a mixture-of-experts model, all measured against the current Blackwell generation. If you are signing a GPU commit this quarter, that 10...
Read the original article