How AI Gateway runs on Fluid compute
vercel.com·15h
Flag this post

6 min read

Nov 6, 2025

AI Gateway is a Node.js service for connecting to hundreds of AI models through a single interface. It processes billions of tokens per day. The secret behind that scale is Fluid.

When we announced its general availability, we highlighted how AI Gateway scales efficiently, routes requests securely, and simplifies connecting to multiple AI providers.

We looked at data from the first month of its availability, and while AI Gateway handled roughly 16,000 total runtime hours, only 1,200 of those hours involved actual CPU work (processing requests, routing logic, streaming responses). The remaining 14,800 hours were spent waiting for AI providers to respond.

Traditional serverless bills you for…

Similar Posts

Loading similar posts...