Amazon adds new features to SageMaker

The fully-managed machine learning service is getting new algorithms and framework support, as well as new features for managing machine learning pipelines.
Written by Stephanie Condon, Senior Writer

Amazon on Wednesday introduced a series of enhancements to SageMaker, its fully-managed machine learning service. SageMaker Workflows is a series of features that makes it easier to manage machine learning pipelines. Amazon is also introducing new built-in algorithms and new framework support, as well as new compliance standards and accreditation.

Amazon rolled out SageMaker at last year's AWS re:Invent conference -- along with a slew of other new services, many powered by machine learning. Ahead of this year's conference, Amazon Web Services has announced several improvements to those services, including updates to Amazon Polly, Transcribe and Translate. Amazon also just introduced predictive scaling for EC2 instances. SageMaker itself has seen nearly 100 new features added in the past year, Amazon noted.

Also: Top cloud providers 2018: How AWS, Microsoft, Google Cloud Platform, IBM Cloud, Oracle, Alibaba stack up

With SageMaker Workflows, customers are getting new automation, orchestration, and collaboration features for machine learning pipelines. For instance, SageMaker Search lets customers quickly find relevant model training runs, right from the AWS console. This will help them more easily find the right combination of datasets, algorithms and parameters for their models.

Also: Amazon's Cloud Cam finds the right balance for home security CNET

Workflows also includes Git integration and visualization for better collaboration and version control. Additionally, customers can now use Step Functions to automate and orchestrate SageMaker steps in an end-to-end workflow. SageMaker also now integrates with Apache Airflow, a popular open source framework, to author, schedule and monitor multi-stage workflows.

Amazon is also introducing new algorithms to SageMaker for detecting suspicious IP address (IP Insights), low dimensional embeddings for high dimensional objects (Object2Vec), and unsupervised grouping (K-means clustering). These built-in algorithms are all designed to support petabyte scale datasets. Amazon has also been adding new framework support. Soon, customers will be able to run fully-managed Horovod jobs for high scale distributed training, and scikit-learn and Spark MLeap for inference.

Also: 51% of tech pros say cloud is the no. 1 most important TechRepublic

In terms of compliance and accreditation, Amazon is adding SageMaker to its System and Organizational Controls (SOC) Level 1, Level 2, and Level 3 audits.

Cloud services: 24 lesser-known web services your business needs to try

Previous and related coverage:

What a hybrid cloud is in the 'multi-cloud era,' and why you may already have one

Now that the services used by an enterprise and provided to its customers may be hosted on servers in the public cloud or on-premises, maybe "hybrid cloud" isn't an architecture any more. While that may the case, that's not stopping some in the digital transformation business from proclaiming it a way of work unto itself.

Cloud computing: Here comes a major tipping point

Application spending has moved to the cloud fastest, but other areas of IT spending are catching up.

Related stories:

Editorial standards