The low-latency accelerator is a good fit for machine learning inference use cases like visual search, interactive speech and video recommendations, Google said. As GPUs become increasingly important for running AI workloads, public cloud providers like Google, IBM, and Oracle have been competing to offer the latest from Nvidia on their cloud infrastructure.
To support graphics-intensive applications, Google is also adding support for virtual workstations with Nvidia GRID on the P4 and the P100. GCP is also partnering with Teradici to deliver virtual workstations running on Google Compute Engine. Enabling virtual workstations should help customers that need to run cloud-based, compute-intensive tools, such as entertainment studios producing 3D content or industries like oil & gas.
GCP customers can attach one or multiple P4s to any machine type. P4s are now available in zones in us-central1 (Iowa), us-east4 (N. Virginia), Montreal (northamerica-northeast1) and europe-west4 (Netherlands), with more regions coming soon.
Cloud computing battles, AI and digital transformation: Research round-up