ChatGPT 4O can now speak and sing in real time. It can even view the real world through your phone’s camera and describe what’s happening in real time.
The AI race has just shifted into high gear, with US artificial intelligence pioneers OpenAI rolling out its new interface that works with audio and vision as well as text. The new model, called GPT-4o, has gone beyond the familiar chat-bot features and is capable of real-time, near-natural voice conversations. The developer OpenAI will also make it available to free users.
ChatGPT was already able to talk to users, but with long pauses to process the data. It often seemed a bit sluggish. This was because the feature required three internal applications, the company explained: transcribing the spoken text, processing and generating, and converting the response to speech. This caused delays.
We talk to computer scientist Mike Cook from the renowned Kings College London about the new Chat GPT-4o development.
#artificialintelligence #chatgpt #openai.
Subscribe:
For more news go to: http://www.dw.com/en/
Follow DW on social media:
►Facebook: / deutschewellenews.
►Twitter: / dwnews.
►Instagram: / dwnews.
►Twitch: / dwnews_hangout.
Für Videos in deutscher Sprache besuchen Sie: / dwdeutsch.
Comments are closed.