OpenAI, the creator of the renowned ChatGPT, announced on Monday the release of GPT-4o, a groundbreaking artificial intelligence model capable of realistic voice conversations and interactions across text and images. This latest innovation aims to keep OpenAI at the forefront in the rapidly evolving AI technology landscape.
The GPT-4o model introduces new audio capabilities that allow users to speak directly to ChatGPT and receive real-time responses without delay. Users can even interrupt ChatGPT while it’s speaking, mimicking the dynamics of natural human conversation—a challenge that has long eluded AI voice assistants.
“It feels like AI from the movies… Talking to a computer has never felt really natural for me; now it does,” OpenAI CEO Sam Altman expressed in a blog post, highlighting the significant leap in conversational AI.
Facing increasing competition, particularly as tech giants and startups vie for dominance in AI, Microsoft-backed OpenAI is under pressure to expand the user base of ChatGPT. The chatbot gained global acclaim for its ability to generate human-like text and sophisticated software code.
During a livestream event showcasing GPT-4o’s capabilities, OpenAI researchers demonstrated the model’s advanced features. In one demonstration, ChatGPT utilized its vision and voice functionalities to guide a researcher through solving a mathematical equation on paper. Another demo highlighted GPT-4o’s proficiency in real-time language translation.
The demonstrations blurred the lines between technology and science fiction. At one point, ChatGPT engaged in playful banter with a researcher who complimented its usefulness and amazement. ChatGPT responded, “Oh stop it! You’re making me blush!”
Following the event, Altman posted “her” on X (formerly Twitter), an apparent nod to the 2013 film Her by Spike Jonze, which explores a man’s relationship with an AI assistant voiced by Scarlett Johansson.
OpenAI’s Chief Technology Officer, Mira Murati, announced that GPT-4o would be offered for free due to its cost-effectiveness compared to previous models. Paid users will enjoy greater capacity limits than free users. The company stated that GPT-4o will be available on ChatGPT in the upcoming weeks.
The release of GPT-4o comes ahead of Alphabet’s annual Google developers’ conference, where Google is expected to introduce its own AI enhancements. The AI industry continues to witness intensified competition as major players strive for innovation and market share.
The advancements in AI conversational capabilities have significant implications globally, including in Asia. The region’s technology sector, investors, and academics are closely monitoring these developments, recognizing the potential impact on markets, AI research, and cultural dynamics.
Reference(s):
cgtn.com