How OpenAI delivers low-latency voice AI at scale

中文日本語 Español

OpenAI Apr 27, 2026

OpenAI rearchitected its WebRTC infrastructure using a split relay and transceiver model to provide low-latency voice AI for millions of users.

Read Full Article

Summary

OpenAI developed a custom 'split relay plus transceiver' architecture to maintain high-performance, low-latency voice AI. By offloading WebRTC session management to specialized transceivers and using a lightweight relay layer for packet routing, they avoided the complexities of exposing massive UDP port ranges in Kubernetes. This design preserves standard WebRTC compatibility while ensuring efficient global connectivity and scalable real-time performance for ChatGPT Voice and the Realtime API.

(Source：OpenAI)

中文日本語 Español

Read Full Article

TechCrunch May 4, 2026

Image AI models now drive app growth, beating chatbot upgrades

Gemini May 4, 2026

The latest AI news we announced in April 2026

TechCrunch May 4, 2026

Elon Musk’s only expert witness at the OpenAI trial fears an AGI arms race

The Verge May 4, 2026

The creator of Roomba is back with a furry robot companion

TechCrunch May 4, 2026