Gemini is a new and powerful artificial intelligence model from Google that can understand not just text but also images, videos, and audio. As a multimodal model, Gemini is described as capable of completing complex tasks in math, physics, and other areas, as well as understanding and generating high-quality code in various programming languages.
It is currently available through integrations with Google Bard and the Google Pixel 8 and will gradually be folded into other Google services.
Also: AI in 2023: A year of breakthroughs that left no human thing unchanged
"Gemini is the result of large-scale collaborative efforts by teams across Google, including our colleagues at Google Research," according to Dennis Hassabis, CEO and co-founder of Google DeepMind. "It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information including text, code, audio, image, and video."