Amazon Web Services suffers partial outage
Summary: Parts of Amazon Web Services were hit by an outage in a North Virginia datacenter with many popular websites clocking off the Web for the evening.
Parts of Amazon Web Services (AWS) suffered an outage on Thursday which led to a spread of high-profile sites clocking off the Web for the evening.
Amazon was quick to update its cloud status --- its first update at 8:50 p.m. PDT --- stating the problems were due to a power outage in a Virginia datacenter.
It's the same datacenter that forced Quora, Foursquare and other major website to crumble in April 2011 as the cloud infrastructure began to fall from the sky. Since last year's outage, a detailed post-mortem noted the need for greater transparency and better communication with its customers.
As of this morning, "almost all affected EBS volumes have been brought back online" but some still report problems. It may take a few more hours for the service to fully recover.
Amazon's RDS service also fell down, but has since recovered from a multi-availability zone failure. However, a "small number of [database] instances remain unavailable" at 1.09 a.m. PDT.
Customers were quick to vent their frustration on Twitter, which thankfully isn't hosted by the AWS service.
Sites like Quora (it got hit again, bless it) and Hipchat, along with Heroku --- a division of Salesforce, and leading social movement Pinterest and file-hosting site Dropbox hit the stumbling block as a result of the outage.
It's a case of putting all of the Web's eggs all in the same basket. Or, at least in one case, all the tofu in one food truck. (I think he was kidding.)
Amazon Web Service, when it works ---and give it credit, we're talking the very vast majority of the time --- it works well. Amazon says it is "committed" to a 99.95 percent uptime, but other smaller, nimbler companies, as you might expect, offer a 99.99 percent uptime.
It doesn't mean that Amazon's cloud service will fall down on average 7 minutes a month, but it doesn't help when customers start calling to ask why their service is down, only to reassured that "most of the time it's up."
Earlier this week, Amazon announced its S3 online storage service hit the 1 trillion object milestone mark, equating to roughly 140 objects per person for everyone on the planet.
Image source: Twitter.
Related:
- Amazon's Web Services outage: End of cloud innocence?
- Amazon S3 hits one trillion objects
- Amazon Web Services lands more SAP certifications
- Amazon Web Services revamps enterprise support, tiers
Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.
Talkback
"single Availability Zone(s)" are scary.
For the affected technology companies, I would say yes.
a) to do this cloud thing "right" it will cost about 1/3 more
b) we can tolerate an SLA outage, so why spend the extra money
Choice "a" is a tough sell when everyone in the room is focused on upfront and ongoing costs. You get what you pay for.
There words "Single" and "Availability" and "Zone" sum it up
It's simply poor system design to have any meaningful application or service dependent on one data center (in AWS, your own data centers or other cloud providers). AWS makes it easy to develop multi-zone and even multi-region applications - I'm really surprised Heroku and others who were affected haven't invested the effort to operate with a zone failure.
My company has plenty of AWS hosted apps, many using RDS, and we sailed right past the failure (with no intervention required).
Reliability
There are just too many single points of failure for me to be comfortable using the cloud as my sole back end.
Live by the cloud, die by the cloud
I don't know how wide spread it was, but it certainly affected more than one business here (San Diego area). Really makes me feel comfortable about services and data storage in the cloud. (Did you detect my sarcasm?)
@Shara8
Most of these systems are highly fragile and depend on lots of complex legacy back-end infrastructure which is VERY hard to move to the cloud.
Sad, yet true
Fail to plan....and you indeed plan to fail
Everyone on Twitter, Blogs, and in this forum that rattle their sabers that this is the reason they don't trust the cloud should really consider other careers. Kind of reminds me of the guy that still owned a horse whip & buggy store saying cars will never work out.
Beware the Cloud
Zones?
Bait and Switch
The cloud providers will endure more bad PR until they price their services appropriately. If they wish to call it "cloud", then include one additional zone as part of the base costs to differentiate between what they provide versus pure IAAS from large ISPs.
Discouraging news
- Sara
http://www.hireamobileappdeveloper.com/
Outage overshadows security threat...