Measurement vs. Perception

By | July 24, 2010, 12:15am PDT

Summary: Sometimes an IT performance problem has nothing to do with computers or applications - but the general rule holds: if you can’t measure it, you don’t understand it and you can’t manage it.

The old adage that you can’t manage what you don’t measure has an obvious corolary: to improve management, first improve measurement.

I was reminded of this at both the public policy and personal history levels this week - on the public policy level there was the widely heralded claim that this year, realistically the coldest in decades, was the hottest on record and on the personal level via a call from a former colleague facing an improbably complex performance problem.

The temperature claims turned out to be largely the result of cost cutting meeting a converging agenda: pilots need to know surface air temperatures near runways to gauge lift, and so the pressure to cut costs affecting various national weather services has led those agencies to preferentially reduce the number of reporting stations not on or near runways - thus significantly raising reported average temperatures.

The other problem was much harder to understand: basically the key servers are AIX (so no dTrace); the networking is Cisco (so largely unfathomable); there’s a large collection of mostly Wintel clients and the rack mount applications that go along with those (so no good performance measures there either); and, of course, there’s been no long term focus on performance that might have generated some kind of proxy measures.

He does have some records: the help desk software, helpfully chosen by some long gone predecessor to maximize positive reporting, offers a bit more than three years of “data”, the SAP error logs go back almost twelve years, and the AIX system logs go back to the latest machine replacement - about two years ago.

He’s held the job for about six months and from day one has heard other executives complain that things used to work but now don’t - a complaint that seems to refer mostly to a perceived significant increase in system response time over the last eighteen months or so but is rapidly becoming the aura through which the company sees him.

So the obvious question is, of course, “what changed?” but the less obvious, and as it turned out more important, question is “how do they know?”

He’d done the obvious stuff: looked for network bottlenecks, reviewed the DB logs, had people examine specific corporate PC applications (particularly AD and Exchange) for performance, upgraded a few racks, talked to both IBM and SAP about it, even going so far as to load and try to run some specific analytic software there guys recommended.

Great, except that he found a few minor problems but nothing sufficient to affect user perception - in fact what the logs really show is that the company now does about 10% less business, and thus fewer transactions, than it did during this quarter during 2008.

So what to do? First, ask the right question: if IT can’t objectively track performance and nobody has numbers on user response, how do the users know that performance has been degrading?

The answer turned out to be that the layoffs affected IT too - in particular there was an DB/AIX guy the users treated as their go-to guy for IT who left just before the last round of lay-offs when about a third of the “user facing” staff followed him out the door rather less voluntarily.

Thus users thought computer response had slowed when, in reality, what had happened was that a lot of the time saving face to face shortcuts around IT standards and procedures that had grown up during the earlier period of relative staffing stability in IT had disappeared - leaving users with the perception that IT had become less responsive and user management complaining about overall system response in terms the IT people heard as complaints about computer and application response.

Basically what happened to my friend was that his predecessor had laid off the wrong people to leave only those most likely to work to rule in place - a classic case, apparently, of retaining in his own image.

I suggested he sit down with the other executives to discuss the issue while launching both cross training and a replacement program aimed at replacing the worst of his deadwood with more motivated people - but the general lesson here for the rest of us is the same as that from the hottest summer nonsense: if you hope to manage well next year, you’d better measure, and measure the right things, this year.

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Paul Murphy (a pseudonym) is an IT consultant specializing in Unix and related technologies.

Disclosure

Paul Murphy

I do not work for, or otherwise receive anything from, any of the companies I write about. I have some money in a number of funds that bet on the markets, including the technology market, but have no direct control over how these funds are administered or what investments are made. I use Sun and Apple technology both at home and at work.

Biography

Paul Murphy

Originally a Math/Physics graduate who couldn't cut it in his own field, Paul Murphy (a pseudonym) became an IT consultant specializing in Unix and related technologies after a stint working for a DARPA contractor programming in Fortran and APL. Since then he's worked in both systems management and consulting for a range of employers including KPMG, the government of Alberta, and his own firm. In those roles he's "been there and done that" for just about every aspect of systems management and operation.

8
Comments

Join the conversation!

Just In

I read a great article
Roger Ramjet 5th Aug 2010
@Peter Perry someone did a study on confidence and ability. Turns out the the people with the least ability have the highest confidence. If you never question your ability - YOU'RE FIRED!
0 Votes
+ -
The missing piece you couldn't measure - until now
francis.carden@... 24th Jul 2010
You hit the nail on the head. So much of the above would have been solved if you could have added the ability to track the usage and performance of the 100's's of applications in use on the enterprise desktops. OpenSpan.com has introduced it's User Process Monitoring to do just that. Whether you measure how many text fields are changed in a given app (win/web/legacy) for a given transaction or how long the response time is (really) on the user desktop for each app, you now have visibility 24x7x365 for comparisons and validation of what's going on - at the coal face. The final level of visibility is key to the success of you BI strategy.

OpenSpan's injection technology (http://gurl.im/ace1dH ) allows any app running on the windows desktop to start emitting events of what it and it's users are doing.

Your opening;

"The old adage that you can?t manage what you don?t measure has an obvious corolary: to improve management, first improve measurement".

compelled me to write happy
0 Votes
+ -
You're not measuring perception
Roger Ramjet 5th Aug 2010
@francis.carden@... and that is the key area. PERCEPTION of performance is an abstraction away from the nuts and bolts of metrics. This may be a useful app, but it doesn't (single-handedly) solve the overall issue.
0 Votes
+ -
Perhaps that isn't exactly why this is an issue...
Peter Perry Updated - 24th Jul 2010
I think you're right in some ways but a large portion of this country is Good Ole Boy Networks which force the use of Unions to eliminate their bias towards their friends...

In other words, sometimes these companies are keeping dead wood because they're a friend to the manager and they're almost never the most talented people on the team... In fact, it is usually just the opposite, they're the least talented people on the team and they know their only hope for being retained is Brown Nosing.

Basically, until somebody starts holding these managers accountable then this really won't ever change.
0 Votes
+ -
I read a great article
Roger Ramjet 5th Aug 2010
@Peter Perry someone did a study on confidence and ability. Turns out the the people with the least ability have the highest confidence. If you never question your ability - YOU'RE FIRED!
0 Votes
+ -
Do you mean like lines of code???
wackoae 24th Jul 2010
I love it when management wants to measure efficiency by the number of lines of code in a project. To some of them, the more the lines of code an application has, the more efficiently time was used. When in fact most of the time, the contrary is more correct.

The more lines of code in small amount of time usually mean A HACK, something that was created on the fly, wasn't design correctly and most likely is poorly written. A person who writes 500 lines of code in a day, is not better than a person that writes 50 lines of code in the same period of time and provide the same exact functionality. One spend the day hacking out the functionality, while the other spent (most or some of) the same time figuring out how to solve the problem efficiently.
0 Votes
+ -
No, but yes I Agree
murph_z 24th Jul 2010
@wackoae
No: this isn't a development shop, lines of code per day was never a metric for them.

yes: lines of code per day was/is about as dumb a metric as I've seen. Function points, provided you can define them, are much better happy
0 Votes
+ -
RE: Measurement vs. Perception
scotth_z 24th Jul 2010
1st..Measurement is only half the story. Getting SLA agreements is the real work in the measurement vs. perception debate.

2nd...While Murph wants to focus on application performance, that is only one measurement... Good shops also measure availability and network performance. SLAs put good PROFESSIONAL pressure on the IT staff to do the work that they are paid to do. IT is more than writing good code and walking away.

3) While the IT'er may look at his server as 'HIS/HER', it really isn't. Someone else is paying to have that server in place to do work... and usually it is the same someone who is paying someone to use that machine or getting money from someone to use that machine.

If you have 1000 end users, each getting paid $15/hour(we are not talking about a big system)... and you the IT'er decide that you will reboot/IPL the machine in the middle of the day because it will 'only take an hour' to put in a change... YOU just made a $15K decision... not counting any impact to the business that SOMEONE else has to pay for. If you have 3 or 4 of these outages per year.. you are talking about another techie's salary. Businesses who are left in the dark on these 'decisions' are the ones who get ANGRY at the IT department. $60K + your salary could get another techie in place, who they could work with a bit better.
This is where change control comes into play and where the professionalism comes into play. Informing the business units removes any ANGER from the situation, after all they are the ones paying for the machine and either directly or indirectly paying your salary. If the business units dictate that you can't take a server down in this 24x7x365 world, during the business week, then weekend work is part of your business. Change control is as much for the business units to minimize impact, but it is also there to protect the techie from the political fallout.

Understanding this from either a techies or a techies manager point of view, goes a long way between the ITer being a part of the business or being just a cost.
0 Votes
+ -
while i don't want to appear..
thx-1138_@... 30th Jul 2010
@scotth_z .. as splitting hairs, to quote you:

"...Change control is as much for the business units to minimize impact, ..."

Although I have to agree with most of what you've posted, I'm going to have to disagree with you on this point (albeit minor and a little pedantic on my part). Any minimizing of impact falls, more accurately, under the field of risk management. While you are technically correct in your assertion that change control is factorial in the process (holistically speaking), the primary function has to be risk management.

Having said all that, I agree with you wholeheartedly that any situation that can minimize operational impacts - in regards I.T/I.S. management - can only be good for the I.T. personnel involved in the long run.

Regards

Join the conversation!

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]
ie8 fix

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
ie8 fix