Microsoft's Azure cloud leap-day meltdown
Summary: Everyone makes mistakes, but for Microsoft to make a killer leap day blunder with its Azure cloud service is inexcusable.
Sometimes, Microsoft can make great programs, Windows 2008 R2 and Windows 7 SP1. And, sometimes they can blow it, Vista and, from what I've seen so far, Windows 8. But every now and again Microsoft fouls up in such a spectacular fashion that I'm left to wonder how anyone can use them for mission-critical work. There was the London Stock Exchange failure, which is one reason why almost all the world's leading stock exchanges now use Linux. Microsoft's Azure cloud collapse may prove to be a similar turning-point for Microsoft's cloud service.
In case you missed it, on the same day Microsoft fans were slapping themselves on the back for Windows 8 Consumer Preview getting out the door, Microsoft's Windows Azure Platform-as-a-Service (PaaS) cloud suffered a worldwide meltdown. For almost 36-hours, Windows Azure Service Management was down.
Even after Microsoft had a fix in, faults continued to spread across the Azure cloud in America and Northern Europe. As some areas came back up Compute functionality in the North Central US, South Central US and North Europe regions, functionality was downgraded or even turned off on a range of Azure services.
What caused Azure to fall down and go boom? Microsoft hasn't really spelled out what happened yet but, according to Bill Laing, Microsoft's Corporate VP of Server and Cloud, "Yesterday, February 28th, 2012 at 5:45 PM PST Windows Azure operations became aware of an issue impacting the compute service in a number of regions. The issue was quickly triaged and it was determined to be caused by a software bug. While final root cause analysis is in progress, this issue appears to be due to a time calculation that was incorrect for the leap year."
Well, who could blame Microsoft for that? I mean how often do we get a leap year... Oh wait, we get a leap year once every four years! Who knew? Apparently not Microsoft's developers.
This is incredible. How in the world can a company the size of Microsoft make such a simple, stupid mistake as not accounting for a leap day in its most important cloud service? How can any business trust a cloud that can go out of service because of a programming blunder that would get a failing mark in a software development 101 class? I don't know. I really don't.
I do know that businesses putting all their computing eggs into one Azure basket led to untold damages. If you want to continue to take chances with Azure, good for you. Just be ready to explain to your board of directors exactly why you thought trusting Azure was a smart move. Good luck with that.
Azure's failure, while an especially spectacular one, reminds me again just how vulnerable any business that puts its trust into the cloud model is. No cloud, not even one built on Linux or open-source cloud technologies such as Eucalyptus and OpenStack is immune to major problems. You need to carefully plan for cloud failures no matter whose cloud you use.
That said I will also say that in the open-source model, where with many eyeballs on the code all bugs are shallow, I'm sure that we'll never see a kiddie programming mistake take out a global cloud the way Azure fell apart. Clouds are dangerous enough as they are for enterprises, if you can't trust their code, how can you trust your company's business to them at all?
Related Stories:
Windows Azure suffers worldwide outage
Microsoft's Windows Azure has a meltdown
Is uptime the wrong metric for cloud service-level agreements?
Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.
Talkback
But ... but ...
People should start
How many times must they blunder or how many $$'s must be lost due to Microsoft's Ineptitude?
That is why there are so many Linux openings
It is not a question of if it will be hacked or fail, but when. :|
Clouds? There's more than MS Azure at play here. Reread the article.
Sounds like Steven is more down on Clouds than he is Microsoft.
[i]That said I will also say that in the open-source model, where with many eyeballs on the code all bugs are shallow, Im sure that well never see a kiddie programming mistake take out a global cloud the way Azure fell apart. Clouds are dangerous enough as they are for enterprises, if you cant trust their code, how can you trust your companys business to them at all?[/i]
*This* is what everyone should be fighting over. Open-source code is more bug-free than proprietary code? Really!? I wonder what percentage of the Linux fans that post here download and review the source code.
If Steven's assertion *is* true, then shouldn't the open-source Chromium browser be more bug-free than the proprietary Chrome browser? And shouldn't CynaogenMod be more bug-free than the proprietary Android on the pretty devices you buy? Since Steven doesn't wave his pom-poms for Chromium and CyanogenMod, I say [b]hypocrisy[/b].
fdsdfd
Wholesale Thermometer http://www.chinawholesaletown.com/wholesale-Tin-CD-Case/ Wholesale Bedding Wholesale lable
Wholesale Accessories http://www.chinawholesaletown.com/wholesale-Beach-Ashtray/ Wholesale Mp3 Photo Frame
Bar Caddy http://www.chinawholesaletown.com/wholesale-Perpetual-Calendar/ Stuffed Animals World Cup Products
Menu Holder http://www.chinawholesaletown.com/wholesale-Wine-Bottle-Cover/ Abacus China Wholesale
Audio Video Equipment http://www.chinawholesaletown.com/wholesale-Cake-Towels/ Reflective Safety Vest Wholesale Toys
Wholesale USB Products http://www.chinawholesaletown.com/wholesale-Jewelry-Loupe/ Promotional Items Golf Pouch
Wholesale Tellurion http://www.chinawholesaletown.com/wholesale-Jewelry-Loupe/ Wholesale Binoculars Silicone Products
Waterproof Beach Case http://www.chinawholesaletown.com/wholesale-Frosty-Beer-Mug/ Wholesale USB Flash Drive Wholesale Puzzle
Wholesale Stress Ball http://www.chinawholesaletown.com/wholesale-Key-Chain-Bottle-Opener/ Wholesale Umbrella Wholesale Swimming Products
Water Filter Bottle http://www.chinawholesaletown.com/wholesale-Survival-Card/ Digital Spoon Scale Glass Rimmers
Abacus http://www.chinawholesaletown.com/wholesale-Tangle-Puzzle/ Bar Holder Tray Vocal Concert Products
Mini DV http://www.chinawholesaletown.com/wholesale-BMI-Tape-Measure/ Wine Bottle Cove Wholesale Golf Products
Wholesale Fan http://www.chinawholesaletown.com/wholesale-Ring-Opener/ Corner Flag Wholesale Binoculars
Wholesale Clothes Rack http://www.chinawholesaletown.com/wholesale-Metal-Money-Bank/ Highlighter Pen Waterproof Hard Case
Ice Bottle http://www.chinawholesaletown.com/wholesale-Highlighter-Pen/ Gift Box Entertainment Supplies
Glass Rimmers http://www.chinawholesaletown.com/wholesale-Poncho-With-Key-Chain-Ball/ Inflatable Products Wholesale Album
Coca Cola Glass http://www.chinawholesaletown.com/wholesale-Waterproof-Hard-Case/ Mini Hockey Stick Gloves Clapper
Outdoor Leisure Products http://www.chinawholesaletown.com/wholesale-Clip-Dispenser/ Silicone Products Promotional Products
Notepad Calculator http://www.chinawholesaletown.com/wholesale-Water-Filter-Bottle/ Crystal Gifts Wholesale Towel
World Cup Products http://www.chinawholesaletown.com/wholesale-Jute-Bag/ Wholesale Pin Tube Cooler
Wholesale Wallet http://www.chinawholesaletown.com/wholesale-Rolling-Cooler-Bag/ Wholesale Watch Wholesale Scissors
Ring Mug http://www.chinawholesaletown.com/wholesale-Digital-Money-Bank/ Wholesale Mat Wholesale Cooler
Wholesale Cards http://www.chinawholesaletown.com/wholesale-Wine-Tote-Bag/ Perpetual Calendar Wholesale Belt
Wholesale Puzzle http://www.chinawholesaletown.com/wholesale-Foam--Hand/ Electroluminescent Fishing Supplies
Consumer Electronics http://www.chinawholesaletown.com/wholesale-Pepper-Spray/ Mouse Pad Bingo Bag
Christmas Gifts http://www.chinawholesaletown.com/wholesale-Golf-Counter/ Writing Instrument Sport Support Products
Gift Box http://www.chinawholesaletown.com/wholesale-Wedding-Favors/ Wholesale Bangle Water Spray Fan
Wholesale Knife http://www.chinawholesaletown.com/wholesale-Whistle-Buckle/ Manicure Set Dog Waste Bag Dispenser
Wholesale Candle http://www.chinawholesaletown.com/wholesale-UV-Pen/ Wholesale Tableware Banner Stand
Wholesale Gift Bags http://www.chinawholesaletown.com/wholesale-Electric-Heating-Mugs/ Heating Products Waterproof Beach Case
Wholesale Tag http://www.chinawholesaletown.com/wholesale-Note-Pad-Holder-Calendar/ Voice Recorder Electrical Gifts
Wholesale Mirror http://www.chinawholesaletown.com/wholesale-Leather-Tape-Measure/ Frosty Beer Mug Wholesale Bracelet
Voice Recorder http://www.chinawholesaletown.com/wholesale-Level-Ruler---Digital-Level/ Baby Products Suppliers Wholesale Shoe
Ice Players Stick http://www.chinawholesaletown.com/wholesale-Bar-Caddy/ Wholesale Coaster Corner Flag
Wow back to the tired discredited LSE FUD dead horse again?
The great thing about outsourcing...
Never mind that MS apparently failed to test for this particular contingency.
Reply to otaddy:
Then maybe customers should be more careful about patronizing big companies.
I agree, they own it, their responsibility to test and accept it.
Not trying to make excuses for MS here, they failed big time in their quality control efforts, but I understand the difficulties of working for a giant company.
Dumber than bricks
The rest of your post is pants, starting with your excusing of the LSE fiasco. Microsoft's systems were at the heart of the problems there. Microsoft have not released enough information about the problem and why it's still bouncing around the world. So, how do you think you know anything about it?
re lse i have read back at the time what the cause was and it was an
Of course he's dumber then bricks
Microsoft Hater!
A bug in Linux based appliance code
Until those routers and switches fail, then he is as quiet and secreted as the proverbial church mouse.
:|
Just like Android is Linux
Get back to him when this level of failure is reported
[i]"According to a Microsoft dashboard update at 2:30pm around 37 percent of Azure Compute services in North Europe, 6.7 percent of North Central US and 28 percent of South Central US, were affected by the problems."[/i]
And a cynic might suggest that the party in question might just be understating the problems, but that's just a cynic talking.
Agreed
It is rather embarassing
Note to SJVN: This article really belongs in your Networking blog, not "Linux and Open Source".
It's more than embarrassing...
[b]Although we discovered our error via TESTING![/b]
Biased commentary: the only bug-free process of mankind
To quote the author
But I hope my mistakes are less obvious (or disasterous) than this one.