A Need for a Honest Look at How We Do Incident Management

Compared with other fields ours is still young and we haven’t figured out all the things just yet. The natural tight connection between academics, open source software and the improvements we’ve already seen can make it easy to think we’re already doing all of the hard work. All of which has been on specific technical challenges and very little on how we as an industry should improve how we work.

Consider the difference between how much attention we place on data we collect from the servers and services we support compared with what we have available for our entire field. We love dashboards and metrics to the point that they’re used to drive businesses and/or teams. Why haven’t we done the same thing at the macro level to help improve and guide our profession?

Our Resistance to Change has Sent Us the Long Way Round to Where We’re Going

Eight years after Mark Mayo wrote about how cloud computing would change our profession pretty much everything still applies and people are still trying to adapt to the changes he highlighted. The post is primarily about the changes that individual systems administrators would be facing, but it also describes how the way we work would need to adapt.

The problem is that on the operations and systems side of web development our overall reaction to this was slow and I think we confused what the actual goal should be with something that’s was only temporary.

What I'm Wishing For in 2016

Five Fucks to Give

First thing is that I’m only going to have five fucks to give in 2016. One for my wife Lane, three for our pets Chai, Hanna and Olive and the fifth for myself. This means keeping a low tolerance on the amount of stress and nonsense we’re allowing into our lives.

The Important, the Urgent and remembering it’s the people that Matter

Large scale outages at major service providers isn’t something new. In 2007 both Rackspace and the 365 Main data center in San Francisco suffered major outages that took down part of what made up Web 2.0. Amazon Web Services had major outages in 2011, 2013, and again last week each causing outages for several services.

The Concepts of John Boyd are much more than just the OODA Loop

Ever since John Boyd’s OODA Loop was introduced to the web community at Velocity, we’ve had an oversimplified and incomplete view of his ideas. One that has been reinforced by reusing a version of the same basic diagram that he never drew.