HomeOpenAI introduced Voice Interaction and Image RecognitionBlogOpenAI introduced Voice Interaction and Image Recognition

OpenAI introduced Voice Interaction and Image Recognition

OpenAI has made waves in the tech world with a significant update to its ChatGPT app, introducing two groundbreaking features that enhance its utility and versatility. These updates reinforce OpenAI’s commitment to evolving its flagship product, making it a compelling choice for a wide range of users.

Voice Interaction: ChatGPT Gets Vocal

The most noteworthy addition to ChatGPT is its newfound ability to engage in voice conversations. Users can now choose from five distinct synthetic voices, enabling them to converse with ChatGPT as if they were making a phone call. This feature provides real-time spoken responses to spoken queries, greatly enhancing the conversational aspect of the app.

This voice interaction relies on two separate models. First, Whisper, OpenAI’s existing speech-to-text model, converts users’ spoken words into text. This text is then fed to ChatGPT, which formulates text-based responses. Finally, a new text-to-speech model converts ChatGPT’s responses into spoken words. This dual-model approach ensures that users can have seamless voice conversations with the chatbot.

During a recent demo, OpenAI showcased the range of synthetic voices available. These lifelike voices were developed by training the text-to-speech model using the voices of hired actors, with a primary focus on creating voices that users could comfortably listen to for extended periods. This innovation opens up possibilities for further voice customization, potentially allowing users to create their own voices in the future.

Image Recognition: ChatGPT Deciphers Visual Content

The second major update empowers ChatGPT with the ability to answer questions about images. Users can now upload images to the app and inquire about the content within them. This feature, which was previously teased during the GPT-4 reveal in March, marks a significant expansion of ChatGPT’s capabilities.

Marcel Pechmann, an analyst at Cointelegraph, notes that current options data for Bitcoin suggests the potential for further declines. Regulatory challenges in the U.S. crypto industry and the likelihood of additional Federal Reserve rate hikes in the coming months contribute to this pessimistic outlook.

As the cryptocurrency market navigates heightened volatility and increased regulatory scrutiny, the performance of Bitcoin in the upcoming weeks will provide valuable insights into its resilience amid evolving economic factors.

Contact info:

  38 Andrea Kariolou, Agios Athanasios, Limassol

4102, CYPRUS


 [email protected]

©  2024 Soundigit Holdings Limited. All rights reserved.

Soundigit

Holdings

Limited

Digital Marketing Agency

(brand and media management)