Amazon Web Services outage: 'Detailed post mortem' coming

Amazon Web Services outage: 'Detailed post mortem' coming

Summary: Amazon Web Services is operating normally for most customers and the company said it will post a detailed post mortem on what went wrong in its Northern Virginia data center.

SHARE:

Amazon Web Services is operating normally for most customers and the company said it will post a detailed post mortem on what went wrong in its Northern Virginia data center.

On its dashboard, Amazon Web Services (AWS) wrapped up a multi-day outage with the following post:

As we posted last night, EBS (Elastic Block Store) is now operating normally for all APIs and recovered EBS volumes. The vast majority of affected volumes have now been recovered. We're in the process of contacting a limited number of customers who have EBS volumes that have not yet recovered and will continue to work hard on restoring these remaining volumes...

We are digging deeply into the root causes of this event and will post a detailed post mortem.

That post mortem is going to be interesting. In the meantime, tech observers have posted a few post mortems of their own. The takeaways appear to be:

  • Architecture for failures.
  • If you're solely on AWS use multiple Availability Zones.
  • Have multiple cloud providers for infrastructure.

Items worth a read:

Previous coverage:

Amazon's N. Virginia EC2 cluster down, 'networking event' triggered problems

Topics: Amazon, Browser, Cloud, Enterprise Software, Networking, Outage, Software

Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.

Talkback

3 comments
Log in or register to join the discussion
  • RE: Amazon Web Services outage: 'Detailed post mortem' coming

    Everyone knows by now it was Skynet.
    youngmaester@...
  • my sources told me that it was sabotage

    We all know who the main suspect is by by now and hope that DOJ will get involved.
    Linux Geek
  • Backup, backup, backup

    To err is human, always plan for disasters. Even startups who chose to use Amazon should take some of that money that they're saving by using cloud technology and invest it into fail over mechanisms.

    As the article states, "If you?re solely on AWS use multiple Availability Zones."
    mikerosile