|
Skep's Pick: The IT Skeptic Awards for 2008 This link is here because...(hover) Shit happens, or how I learned to love the incident
Blog entry submitted by skeptic
on Mon, 2010-02-08 09:43. [nid:1799] Complex systems are by definition broken. They will always break and sometimes they will break when everybody did what they are supposed to. Fixing the problem won't necessarily reduce the risk of another incident.
I wrote recently about how the paper How Complex Systems Fail rocked me. Word for word, it is one of the most powerful bits of theory impacting service management that I have ever seen. As a fitting complement to that paper, read Malcolm Gladwell's "Blowup". To summarise the main message I took from both: Someone said on LinkedIn recently that problem management is about making incident management obsolete. Not only is such thinking wrong but it is also dangerous. We can't stop incidents, including major ones. And reducing the risk of a major incident recurrence through fixing a problem does not necessarily reduce the odds of a major incident happening again. We can't ever significantly reduce our need for support. We need to be ready and rehearsed to deal with major incidents because they come like earthquakes. And when it is over there may well be nobody to blame. This seems a reversal of some things I have said in the past about the need for change control. I said that "shit happens" is not an excuse any more. I still believe that. Just because some incidents will remain unpreventable doesn't mean that many others can't be prevented. Just because fixing a problem in one place means higher risks will be taken elsewhere doesn't mean we shouldn't fix the problems. And just because complex systems are impossible to stop breaking doesn't mean that there isn't negligence behind some breakages. We need to be more embracing of incidents as part of normal operations, not as aberrations that can be eliminated. Incidents aren't deviations from some idyllic norm: they are the norm. Buy your books here to support this blog: |
Blog





















Got a tricky question about ITIL?
Made in New Zealand 
Comments
Shit happens!
Skep, if the term "Shit happens!" makes it into ITIL, can I claim the first use from 11 July 2008? http://thinkingproblemmanagement.blogspot.com/2008/07/shit-happens.html