Innovation

OpenAI makes GPT-4 Turbo with Vision available to developers to unlock new AI apps

A multimodal model like GPT-4 can take AI chatbots and other AI applications to new heights by combining text and images, for example.

Written by Sabrina Ortiz, Editor April 9, 2024 at 1:27 p.m. PT

OpenAI is best known for its advanced large language models (LLMs) used to power some of the most popular AI chatbots, such as ChatGPT and Copilot. Multimodal models can take chatbot capabilities to new heights by unleashing a new range of visual applications, and OpenAI just made one available to developers.

On Tuesday, via an X (formerly Twitter) post, OpenAI announced that GPT-4 Turbo with Vision, the latest GPT-4 Turbo model with vision capabilities, is now generally available to developers via the OpenAI API.

Also: How to use ChatGPT

This latest model maintains GPT-4 Turbo's 128,000-token window and knowledge cutoff from December 2023. The main difference is its vision capabilities, which allow it to understand images and visual content.

Before GPT-4 Turbo with Vision was made available, developers had to call on separate models for text and images. Now, developers can just call on one model that can do both, simplifying the process, and opening the doors for a wide range of use cases.

Also: The best AI image generators of 2024: Tested and reviewed

OpenAI shared some ways developers are already using the model, and they are pretty fascinating.

For example, Devin, an AI software engineering assistant, leverages GPT-4 Turbo with Vision to better assist with coding. The health and fitness app, Healthify, uses GPT-4 Turbo with Vision to scan photos of users' meals and give nutritional insights through photo recognition. Lastly, Make Real uses GPT-4 Turbo with Vision to convert a user's drawing into a working website.

Make Real, built by @tldraw, lets users draw UI on a whiteboard and uses GPT-4 Turbo with Vision to generate a working website powered by real code. pic.twitter.com/RYlbmfeNRZ
— OpenAI Developers (@OpenAIDevs) April 9, 2024

While the GPT-4 Turbo with Vision model is not yet available inside ChatGPT or to the general public, OpenAI teased that it will soon be available in ChatGPT. If you are a developer looking to get started with OpenAI's GPT-4 Turbo with Vision API, you can learn how to get started here.

Artificial Intelligence

Editorial standards

Show Comments

OpenAI makes GPT-4 Turbo with Vision available to developers to unlock new AI apps

Artificial Intelligence

Related

OpenAI rolls out new features to entice companies to build AI solutions

Zoom gets its first major overhaul in 10 years, powered by generative AI

GPT-4 Turbo reclaims the 'best AI model' crown from Anthropic's Claude 3