OpenAI has released a new ChatGPT bot that you can talk to
๐ Abstract
The article discusses the launch of an advanced AI chatbot by OpenAI, which represents the company's push into a new generation of AI-powered voice assistants. The new ChatGPT voice bot has enhanced capabilities compared to previous voice assistants, including the ability to detect and respond to different tones of voice, handle interruptions, and convey a wide range of emotions. The voice mode is powered by OpenAI's new GPT-4o model, which combines voice, text, and vision capabilities.
๐ Q&A
[01] Overview of the new ChatGPT voice bot
1. What are the key capabilities of the new ChatGPT voice bot?
- The new ChatGPT voice bot can detect and respond to different tones of voice, handle interruptions, and reply to queries in real time
- It has been trained to sound more natural and use voices to convey a wide range of different emotions
2. What is the technology behind the new voice feature?
- The voice mode is powered by OpenAI's new GPT-4o model, which combines voice, text, and vision capabilities
3. How is the new voice feature being rolled out?
- OpenAI is initially launching the chatbot to a "small group of users" paying for ChatGPT Plus, with plans to make it available to all ChatGPT Plus subscribers this fall
- A ChatGPT Plus subscription costs $20 a month
4. What safety features have been implemented for the new voice feature?
- OpenAI has created four preset voices in collaboration with voice actors to prevent the model from being used to create audio deepfakes
- The company has also applied the same safety mechanisms used in its text-based model to GPT-4o to prevent it from breaking laws and generating harmful content
[02] Future plans for the ChatGPT voice bot
1. What additional features are planned for the ChatGPT voice bot in the future?
- OpenAI plans to include more advanced features, such as video and screen sharing, which could make the assistant more useful
- These features will not be available immediately but at an unspecified later date
2. How has OpenAI tested the voice capabilities of the model?
- OpenAI says it has tested the model's voice capabilities with more than 100 external red-teamers, who spoke a total of 45 languages and represented 29 countries
3. What controversy has surrounded the use of voices in the model?
- When OpenAI first introduced GPT-4o, the company faced a backlash over its use of a voice called "Sky," which sounded a lot like the actress Scarlett Johansson
- Johansson released a statement saying the company had reached out to her for permission to use her voice for the model, which she declined
- OpenAI has denied that the voice is Johansson's but has paused the use of Sky