Monitor LLM routing with the Kubernetes Inference Extension (opens in new tab)

Covers vLLM

Learn how to use inference-aware routing for your LLM workloads in Kubernetes, and how to monitor performance with Datadog.