Google Launches Gemini: A Multimodal AI Model Outperforming OpenAI's GPT-4 : omgfin

Google Launches Gemini: A Multimodal AI Model Outperforming OpenAI's GPT-4 Print

Modified on: Thu, 7 Dec, 2023 at 12:09 AM

Google has unveiled Gemini, its latest artificial intelligence model, claiming it surpasses OpenAI's GPT-4 in intelligence and power. Gemini is a multimodal model optimized for various sizes and use cases, offering advanced capabilities in math, specialized coding, and multitask language understanding. Google's Gemini Pro has outperformed GPT-3.5 in benchmarks, making it a potent tool in the AI arms race. The model is seamlessly integrated into Google's products, including ChatGPT (Bard) and the Pixel 8 Pro smartphone.

In a significant development in the AI arms race, Google has officially launched Gemini, its latest artificial intelligence model, claiming superiority over OpenAI's GPT-4. The announcement was made by Google CEO Sundar Pichai and Google DeepMind CEO Demis Hassabis in a blog post on December 6.

Gemini: A Multimodal AI Model Beyond GPT-4

Gemini is designed as a multimodal AI model, optimized for various sizes and use cases, including Ultra, Pro, and Nano. One notable advantage Google claims over GPT-4 is Gemini's ability to handle advanced math and specialized coding tasks, setting it apart in the realm of large language models.

According to Google's chief scientist, Jeff Dean, Gemini Ultra achieves "state-of-the-art performance" across 30 out of 32 academic benchmarks used in large language model (LLM) development. It also excels in a massive multitask language understanding (MMLU) test, surpassing human expert performance with a score of 90%.

Furthermore, Gemini Pro outperforms GPT-3.5 in six out of eight benchmarks, making it a formidable presence in the AI landscape. AI expert Rowan Cheung noted that Gemini Pro stands as "the most powerful free chatbot on the market today."

Multimodal Capabilities and Advanced Programming Skills

Gemini is designed to seamlessly reason across text, images, audio, and video, providing an integrated approach to processing different types of information. Unlike GPT-4, Gemini is not limited to textual understanding but has been built from the ground up to incorporate various modalities.

The model's advanced programming skills include generating high-quality code using AlphaCode 2, an advanced code generation system. It can solve complex programming problems and collaborate effectively with developers.

Integration with Google Products and Services

Google has integrated Gemini into its products and services, with a fine-tuned version of Gemini Pro rolled out to Google's version of ChatGPT, known as Bard. This marks a significant upgrade to Bard, making it available in English in more than 170 countries. Gemini is also deployed on Google's flagship smartphone, the Pixel 8 Pro, powering features like Summarize in the Recorder app and Smart Reply in Gboard.

Google plans to expand Gemini's deployment across more products and services, including Search, Ads, and Chrome, in the coming months. The tech giant is also experimenting with using Gemini to enhance the generative experience in its web-dominant search engine.

Conclusion

Google's Gemini emerges as a powerful player in the AI landscape, positioning itself as a contender to OpenAI's GPT-4. The model's multimodal capabilities, advanced programming skills, and superior performance in benchmarks make it a significant advancement in AI technology. As Gemini is integrated into various Google products, its impact on applications like chatbots, language understanding, and programming tasks is expected to be substantial. The AI community will closely watch the ongoing developments in the AI arms race as companies vie for supremacy in the field of artificial intelligence.

(MARTIN YOUNG, COINTELEGRAPH, 2023)

Did you find it helpful? Yes No

Help Desk Software by Freshdesk