AI voice agents are getting closer to doing more than waiting their turn to speak. OpenAI announced Thursday that it is ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
The launch ⁠of ⁠the application programming interface (API) moves the ChatGPT-maker beyond ​transcription and chat toward ...