RBS Bank joins the IT failures 'Hall of Shame'
Summary: An IT failure hits customers in the worst way possible.
Most IT failures go unnoticed by the public - backend systems break, projects go over-budget, or perhaps a IT misses an internal deadline. Such was not the case with this week's IT failure at the UK-based Royal Bank of Scotland (RBS), which, incidentally, has a balance sheet the size of the UK economy.
Also Read:
Key questions on the massive RBS / NatWest IT failure
RBS gives more detail on IT failure train wreck

Image credit: iStockPhoto
According to Reuters, a bad software upgrade made RBS' systems unable to process payments for both individual and business customers. The upgrade failure also hit systems from British bank NatWest and Ireland's Ulster Bank. The BBC describes what happened and explains the ripple effect that occurred when the RBS systems went down:
The bank says there was a failure in the computer system that carries out the overnight transfer of money between accounts.
So, even though payments had been made - such as a business paying wages to staff - this did not show up on their account balances. In turn this meant many customers could not make payments themselves, such as paying rent to their landlord.
The failure effectively caused a traffic jam in the system. It created a huge backlog in updating account balances, which the bank has been trying to clear for some days.
Customers of other banks have also been affected because some payments from RBS, NatWest and Ulster Bank customers have not come through
In what must be an IT failures understatement of the year, RBS chief executive, Stephen Hester, commented:
It shouldn't have happened and we are very sorry
The BBC's Business Editor speculated that outsourcing arrangements interfered with the bank's ability to isolate and solve the problem:
As I understand it, one reason why RBS has not given much detailed information about why its services have been so badly disrupted is that so much of the operational responsibility for IT is outsourced - so there is a sensitive issue of where to attribute blame.
In my conversations with RBS bankers, there is an implication that outsourcing contributed to the problems - though they won't say whether this is an issue of basic competence or of the complexities of coordinating a rescue when a variety of parties are involved.
Adding to the confusion, Information Age reports that the problems were entirely self-made within RBS:
An RBS spokesperson said this morning that the technical fault, to her knowledge, had taken place on RBS' own systems, and not those of a supplier or outsourcer.
Although the problem upgrade occurred on Tuesday night and was fixed by Friday, downstream effects will continue through this week. This IT failure continues to have dramatic negative impact on bank customers, many of whom do not have access to their own money.
Also read: ZDNet UK: Natwest, RBS customers hit by balance glitch BusinessWeek: RBS Systems Failure Unlikely to Be Resolved Until Monday London Evening Standard: NatWest chaos heads into weekend Irish Times: Angry customers seek answers BBC: Ulster Bank 'needs week to clear IT failure backlog' The Telegraph: NatWest computer glitch 'fixed but backlog remains'? NatWest: Helpful Banking
A comment thread, containing thousands of messages, on the bank's online customer service forum describes travelers stranded abroad, home purchase closings that did not go through, and similar stories of difficulty due to customers not having access to funds. Ireland's Minister for Social Protection said that up to 30,000 people did not receive social welfare payments, even though funds had been withdrawn from government accounts. To assist customers, RBS opened 1200 bank branches on Sunday.
A SHAMEFUL PERFORMANCE
Although IT failures happen, this impact of this one is huge by any standard and several important questions remain unanswered:
- What actually happened? Was this an outsource problem, an in-house problem, or maybe the RBS technology organization is so complex that it's impossible to isolate exact cause.
- How can one of the world's largest banks roll out an upgrade without sufficient testing? No matter how you look at it, the upgrade created unexpected issues upon deployment. That's why we test, test, and test again, even though it appears RBS did not test enough.
- Why the long and difficult recovery? Presumably, a bank this size has worked out its business continuity and recovery plans with a level of efficiency only possible for a huge organization. Or maybe not.
- What accountability does the government hold over banks that evince such apparently poor IT processes as this? IT failures do not just happen randomly or in a vacuum. I suspect process and technology issues will emerge as warning signs that management and technical workers ignored.
THE BOTTOM LINE
This IT failure is sad for RBS and far worse for its customers. I urge the UK government to conduct a detailed review of RBS management, business, and technical processes with respect to all aspects of IT. It appears that RBS did not follow important standard practices in areas such as testing and business continuity planning.
Government regulators should hold RBS to account with stiff fines and other punishments as appropriate. Regulators should treat this IT failure with the same severity and determination they would bring to operational failures in any other part of a major bank.
Kick off your day with ZDNet's daily email newsletter. It's the freshest tech news and opinion, served hot. Get it.
Talkback
Who are the outsourcers involved?
LOL - i bet.
Maybe now...
Doubt it - it'll take at least dozen of failures like this i'm afraid.
Taxpayer picks up tab
It's a good job we can afford to pay the high salaries needed to attract world class managers. I hate to think what could have happened. (Irony ends)
Hester is still at RBS
Surely he should have been off like a flash to foreign shores, or have we called the bankers' bluff?
Or play the long game?
I think the lesson here is that if you offshore jobs to make a buck thats fine if you can maintain quality, however at the salaries they were offering in India (low even by Indias standards) you get what you pay for
Outsourcing
Couldn't someone in India do Hester's job for a fraction of the price?
You're right
With wage inflation running at what it is India and China are becoming less attractive (it'll be Brazil and Chile next) but still viable, I don't like outsourcing but see why businesses do it. Whats unspeakably dumb is paying so little that you don't get reasonably talented staff in those countries and thats where you get the horror stories like RBS.
For a fuller explanation
What puzzles me is why it took three days to restore the corrupted scheduler files.
The other lesson learned is that it is not enough to test the software, you have to test the installation too. You can get the software working in a test environment, but if you don't test the steps to get it onto production then you end up with the wrong software or configuration on the live system.
If you pay bottom rate you get poor people
Batch skills on z/OS are not exactly a popular skill nowadays so finding cheap people in India who will be able to manage roll backs and know how to restore corrupted files from backups on a TSO session won't be easy to come by
Agree completely on testing and always recommend to people that they have acceptance testing on a identical system before deployment to production
Thanks for the link
What about disaster recover and backups
software-RBS failure
RBS SW failure
I don't see how offshore is to be blamed. All the testing and rollout decisions are taken at the corporate level.
Tip of the ice berg
Why Blame India / Outsourcing ?
It's all too easy for errors to slip through
By Monday lunch time clients started calling in questions. By 3 PM we took the system off line. The final problem, when the program was compiled for production one uncompressed database definition was used in error. So the program was writing to the wrong places in the DB. Oops.
The problem was caused because I was a newbie who didn't know that "quirk" in the system, and my analyst was away. His replacement also didn't know that quirk.
Yes, management had the "one computer programmer is the same as another" frame of mind.
Same company, years later, a long term contractor was let go. No big deal. Except that she had over 10 years experience in our company. She was an excelent, knowlegable analyst. Why was she let go? Personality conflicts with her new project manager!
As long as we as consumers and shareholders allow management (and boredes of directors> to manage for quarterly profits and annual BONUSes they are paid we will continue to have this sort of mismanagement for short term profit at the expense of long term growth and stability.
Same company was on a "down sizing" jag. They cut a bunch of people. One "20-year" man who was pensioned off was a 1-man department. Not very visible, not significant to the day to day operations of the front line. He managed disk space usage on the mainframe. Space that we rented from an outsourcer.
Within a few months, we "ran out" of unused space and had to pay more than his salary to expand the storage space we were renting. Simply because Sr. management that was "optimizing" employees didn't know exactly what he did and that no one was trained to replace him. The significant amount of disk of space that he "reclaimed" in his daily work, by forcing people to delete unneeded files, no longer was freed up.
con't
Outsource management
Red Bottoms Shoes
http://www.discountchristianlouboutinblog.com
http://www.christianlouboutincheapc.com