Sort providers by cost, latency, or throughput on AI Gateway (opens in new tab)
**Published:** May 15, 2026 | **Authors:** Walter Korman, Jerilyn Zheng --- You can now sort the providers behind a model by cost, time to first token (TTFT), or throughput (TPS) in AI Gateway. The default provider order blends provider reliability, quality of model output, cost, and speed of response. You can now use `sort` for explicit control over ranking criteria. For models with many providers and noticeable cost or speed variation, you can use `sort` to optimize on your dimension of...
Read the original article