OpenAI Outlines WebRTC Architecture for Low-Latency Voice AI at Scale (opens in new tab)
OpenAI recently outlined how it adapted WebRTC for low-latency voice AI at global scale. The new architecture replaced a conventional media termination model with a relay-transceiver design better suited to Kubernetes and cloud load balancers. It keeps WebRTC session state in a dedicated transceiver layer while using relays to reduce public UDP exposure and keep media routing close to users. By Eran Stiller
Read the original article