OpenAI’s new demo shows developers how to build real-time voice agents with handoffs, tools and production-style ...
The OpenAI ChatGPT Realtime API, now available in public beta, is transforming how developers create low-latency, multimodal applications. By seamlessly integrating speech, text, and function calling ...
What if your next phone call with customer support didn’t feel like a frustrating maze of robotic prompts but instead like a natural, empathetic conversation? Imagine an AI that not only understands ...
Agora's Conversational AI Engine offers key enhancements to the Realtime API for more natural communication and interaction. This milestone builds on Agora's partnership with OpenAI, as the Realtime ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
The company behind ChatGPT wants you to build with its voice AI model. OpenAI announced its most advanced speech-to-speech AI model yet, GPT-Realtime. The new model, now available through OpenAI’s ...
Nearly a year after the developer preview was introduced, OpenAI released the GA version (General Availability) of the Realtime API in August 2025. The Realtime API is a multimodal interface that ...
AI thrives on data but feeding it the right data is harder than it seems. As enterprises scale their AI initiatives, they face the challenge of managing diverse data pipelines, ensuring proximity to ...
OpenAI’s new GPT-Realtime model and Realtime API updates bring lifelike voice AI, phone calling, and image input to everyday apps. If you’ve ever wished that talking to an AI felt more like chatting ...