OpenAI launches new voice intelligence features in its API

TechCrunch
OpenAI has introduced new voice intelligence API features, including real-time translation, advanced conversational models, and live transcription for developers.

Summary

OpenAI has expanded its API with three new voice intelligence capabilities: GPT-Realtime-2, featuring advanced reasoning; GPT-Realtime-Translate, supporting over 70 input languages; and GPT-Realtime-Whisper for live speech-to-text. These tools are designed to move voice interfaces beyond simple call-and-response, enabling apps to listen, reason, and take action during conversations. While targeted at industries like customer service and education, OpenAI has implemented safety guardrails to prevent misuse, fraud, and spam.

(Source:TechCrunch)