OpenAI releases GPT-5 with native voice and real-time reasoning
Your voice app architecture just got 3x simpler. The old STT → LLM → TTS chain (Whisper + GPT-4 + ElevenLabs) collapses into a single API call. Latency drops from ~2–3s to under 400ms. Extended thinking means you can replace custom chain-of-thought prompting with a single flag. The 30% price cut makes previously uneconomical use cases viable at scale.