How resilient is your cloud service provider?

How resilient is your cloud service provider?

Summary: Given recent examples of outages, what questions should you be asking your cloud service provider about resiliency?


Recent outages from Amazon and Google have got me thinking about resliency in the cloud. When you use a cloud service, whether you are consuming an application (backup, CRM, email, etc), or just using raw compute or storage, how is that data being protected? A lot of companies assume that the provider is doing regular backups, storing data in geographically redundant locations or even have a hot site somewhere with a copy of your data. Here's a hint: ASSUME NOTHING. Your cloud provider isn't in charge of your disaster recovery plan, YOU ARE!

Yes, several cloud providers are offering a fair amount of resiliency built in, but not all of them, so it's important to ask. Even within a single provider, there are different policies depending on the service, for example, Amazon Web Services  which has different policies for EC2 (users are responsible for their own failover between zones) and S3 (data is automatically replicated betwen zones in the same geo). Here is a short list of questions I would ask your provider about their resiliency:

  • Can I audit your BC/DR plans?

    • Can I review your BC/DR planning documents?

  • Geographically, where are your recovery centers located?

    • In the event of a failure at one site, what happens to my data?
    • Can you guarantee that my data will not be moved outside of my country/region in the event of a disaster?

  • What kinds of service-levels can you guarantee during a disaster?

    • What are my expected/guaranteed recovery time objective (RTO) and recovery point objective (RPO)?

  • What method do you use to backup data (tape, disk, etc)? How often are backups occuring?

    • If I have data loss, what is the protocol for restoring from backup?
    • What is the retention policy for these backups?
    • Where are the backup copies being stored?

  • How resilient is your data center facility?

    • Is it a Tier III or IV equivalent according to the Uptime Institute? 
    • Is is SAS-70 Type II compliant?

I'm sure there's more questions that I haven't thought of, but I think this list is a good starting place. I'd love to get input from all of you, do you audit your cloud providers for resiliency? What other questions should we be asking?

Topics: Storage, Data Management

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.


Log in or register to join the discussion
  • ElasticHosts cloud servers offer an alternative

    For all customers affected by EC2 downtime, I would like to recommend ElasticHosts as an alternative cloud service ( - we offer a 5 day free trial for our cloud servers in US or UK, which is likely enough at least to bridge the gap.
    ElasticHosts cloud servers
  • Suckers!

    There is no such a thing as a safe cloud service. Never will be. Too many variables - most obviously the Internet. Comcast had a fiber cut in Florida today taking out Interent access for 18,000 customers - what is your cloud provider going to do about that? Not a darn thing. Good luck with your Cloud plans. Hee Hee.
  • phzmfyv 82 wqa

    dliuzy,jlcnqsjw42, zfkee.