OpenAI developing real-time conversational audio

OpenAI is building a new bidirectional (BiDi) audio model designed to make AI conversations feel far more natural. Unlike today’s turn-based voice systems - where the AI waits for you to finish and can’t adjust mid-response - BiDi continuously processes speech, allowing it to adapt instantly if users interrupt or change direction.

The prototype still glitches after a few minutes, delaying launch from the expected Q1 timeline to potentially Q2 or later, but the upgrade could significantly improve customer support bots, voice assistants, and smart devices that handle tasks like checking email or booking reservations.

Source.

@aipost
📢 OpenAI developing real-time conversational audio OpenAI is building a new bidirectional (BiDi) audio model designed to make AI conversations feel far more natural. Unlike today’s turn-based voice systems - where the AI waits for you to finish and can’t adjust mid-response - BiDi continuously processes speech, allowing it to adapt instantly if users interrupt or change direction. The prototype still glitches after a few minutes, delaying launch from the expected Q1 timeline to potentially Q2 or later, but the upgrade could significantly improve customer support bots, voice assistants, and smart devices that handle tasks like checking email or booking reservations. Source. @aipost 🏴
0 Комментарии ·0 Поделились ·511 Просмотры ·0 предпросмотр
Спонсоры
My Tape - Write. Read. https://www.mytape.live