OpenAI’s ChatGPT is introducing a groundbreaking feature that enables the chatbot to respond with spoken words. Users can choose from five distinct personas, such as “Juniper” and “Breeze,” for an immersive conversation experience. This feature, available to ChatGPT Plus subscribers, brings a touch of humanity to AI interactions, positioning ChatGPT as a formidable rival to personal assistants like Siri and Alexa.
Sign up for your early morning brew of the BizNews Insider to keep you up to speed with the content that matters. The newsletter will land in your inbox at 5:30am weekdays. Register here.
OpenAI Gives ChatGPT the Ability to Speak in Five Different Voices
By Rachel Metz
Artificial intelligence startup OpenAI is rolling out a feature for its ChatGPT app that lets the chatbot respond to spoken questions and commands with speech of its own.
Starting over the next two weeks, users will be able to choose a voice in the chatbot app, picking from five personas with names like “Juniper,” “Breeze” and “Ember.” ChatGPT will then produce audio of the text it generates in that voice — for example, reading an AI-generated bedtime story out loud. The feature will be available to people who subscribe to OpenAI’s $20-per-month ChatGPT Plus service and enterprise users.
OpenAI released its ChatGPT app in May, and already offers a voice-to-text capability that lets users talk to the bot. Adding an audio response feature could create a sense that people are having a more human conversation. The company hopes the new feature will encourage on-the-go uses of its mobile app, putting it in closer competition with personal assistant offerings like Google’s Assistant, Apple Inc.’s Siri or Amazon.com Inc.’s Alexa.
Requests could include asking the program to talk about the history of Disneyland while driving to the theme park, or asking for a cocktail recipe while rummaging around in the kitchen. In a test of the tool, it ably narrated a story about a starfish and a rutabaga. However, while ChatGPT can come up with lyrics for songs, the app will decline to sing.
The voices of ChatGPT sound fairly human-like (though a close listen reveals a bit of a robotic monotone). OpenAI said it worked with voice actors to build the text-to-speech AI model that underlies the feature.
The company also said than in the coming weeks paid and enterprise users will be able to access a feature for GPT-4 — one of the AI models that powers ChatGPT — to submit a picture and a related question about it. For example, it will be possible to upload a picture of pink sunglasses and ask the chatbot to suggest an outfit to go with it, or to submit a picture of a math problem and request help solving it. The feature, which OpenAI announced earlier this year when it unveiled GPT-4, is available through the ChatGPT app and website.
Read also:
- Deepfakes: Blurring reality in the age of AI
- Comparative view: South Africa’s Internet speeds and prices against the World
- Empowering the workforce by upskilling and reskilling for the AI revolution
To contact the author of this story:
Rachel Metz in San Francisco at [email protected]
© 2023 Bloomberg L.P.