ChatGPT with Enhanced Voice Mode is now available to some paid users, the company announced Tuesday. OpenAI first announced the feature during its spring event in May. Powered by its latest artificial intelligence (AI) model GPT-4o, OpenAI’s enhanced voice mode offers features like real-time feedback, natural-sounding voices, and the ability to detect user emotions. The company said all ChatGPT Plus users will benefit from the feature this fall. However, it’s unclear when the video and screen sharing features, which were also introduced at the event, will launch ChatGPT Advanced Voice Mode Rolls.
OpenAI rolls out enhanced voice mode for ChatGPT
OpenAI announced the rollout of ChatGPT’s enhanced voice capabilities in a post on X (formerly known as Twitter). The company highlighted that the new voice mode will allow users to interrupt the AI chatbot at any time and provide more natural interactions with voice modulation. A short video was also shared showing how to enable the feature once it’s live.
According to the video, the selected group of ChatGPT Plus users will see an invitation message at the bottom of the screen inviting them to try the enhanced voice mode after opening the app. Tapping on it will take users to a new page with the title “You are invited to try the enhanced voice mode” and a button to enable the feature.
The feature is currently available to a small group of Plus users, but the company has not yet specified eligibility criteria. Dubbed an alpha rollout, the feature is powered by OpenAI’s latest flagship extended language model (LLM), GPT-4o.
Explaining the reason for the delay, the AI company said: “Since we first introduced Enhanced Voice Mode, we have been working to enhance the security and quality of voice conversations as we prepare to bring this advanced technology to millions of people. »
OpenAI also emphasized that GPT-4o’s voice capabilities have been tested with more than 100 external red team members in 45 languages. Red teams are cybersecurity experts responsible for testing the security of a product or organization by simulating cyberattacks and cracking attempts. The goal of this process is to detect vulnerabilities in a system before it goes live. Currently, you can only access four preset voices after the feature is rolled out to your account. Sky, the controversial voice said to bear a resemblance to actress Scarlett Johannson, has not yet been added to ChatGPT..