After much anticipation, OpenAI finally plans to introduce voice mode in ChatGPT. According to sources, OpenAI will roll out its voice mode feature.
After suffering major backlash, OpenAI finally launches its ChatGPT’s advanced voice mode.This backlash is expected to have delayed the launch from May end to July end.
ChatGPT in voice mode
According to OpenAI, the Advanced Voice Mode will be limited to ChatGPT’s four preset voices. This included Juniper, Breeze, Cove and Ember which are made in collaboration with paid voice actors. As of now the alpha version will be available to a small group of ChatGPT Plus users.
OpenAI further added that the feature will gradually roll out to all Plus users in the fall of 2024. It is believed that the new voice features will utilise OpenAI’s cutting-edge AI model to directly process and understand audio inputs.
Additionally according to an official video posted on Instagram, the new voice mode aims to enable a more seamless and efficient voice interaction experience without the need for intermediate text conversion.
How can you use the update
OpenAI said in an official blog post, that the new voice capability is powered by a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech. So how can you use the voice mode? Given below are the steps to enable the voice mode:
- To get started with voice, head to Settings
- Then tap on New Features on the mobile app and opt into voice conversations
- Then, tap the headphone button located in the top-right corner of the home screen
- Finally choose your preferred voice out of five different voices.
In addition to this OpenAI claimed that they also use Whisper, which is their open-source speech recognition system, to transcribe user’s spoken words into text.
Other updates
Moreover, “ChatGPT cannot impersonate other people’s voices, both individuals and public figures, and will block outputs that differ from one of these preset voices,” OpenAI spokesperson Lindsay McCallum, said in a statement.
Reportedly, when OpenAI introduced the GPT-4o’s voice feature in May, it stunned audiences with its human-like tone and rapid responses. However, the voice seemed quite similar to Scarlett Johansson’s voice for the character Samantha in the film Her. The actor later took legal action against the company. However, in spite of the allegations OpenAI denied using Johansson’s voice. It is believed that OpenAI later removed the voice shown in its demo.
Follow FE Tech Bytes on Twitter, Instagram, LinkedIn, Facebook.