After Amazon’s Stumble Anti-Cloud Rhetoric Is Expected, But Is It Warranted?
I like Phil Wainewright of ZDNet’s take on the Amazon incident last week. I like it partially because his post-chaos “seven lessons to learn from Amazon’s outage” reads oddly similar to my recent Transparency in the Cloud series. Mostly, though, I appreciate Phil’s approach because all of his lessons are targeted at the cloud services consumer.
Without a doubt, the blame for last week’s outage falls squarely on Amazon’s shoulders, but the responsibility to understand risk and adjust appropriately is in the hands of those that choose to use cloud-based services either to consume services, or to provide services to other companies.
One thing about centralizing services in the cloud is that it makes everything extremely visible. What was traditionally hidden behind private firewalls is now out in the open for everyone to see. I have a hunch that the aggregate downtime and subsequent hours spent by private IT departments on a regular basis far exceeds even the four days it took Amazon to clean up after this incident.
Which brings me to my point: There are two things that cost money during outages – 1) lost business, and 2) resolution efforts.
Lost business is a sunk cost, whether the outage is with your website hosting company, your cloud services provider, or inside your own private datacenter. The mere fact that you are in an outage situation means you are losing business. The difference is that in a private outage situation, you are also compounding the loss by shelling out the dollars to fix the problem. In this case, it was Amazon’s software (and/or hardware) that failed, let them foot the cleanup bill (and they happily will)!
Regarding the anti-cloud sentiment that seems to be pouring from mainstream outlets, I’ll borrow from CNN’s sensationalistic metaphor. Did we stop building ocean liners after the Titanic sank? Did everybody start crossing the Atlantic in tiny dinghies because they didn’t trust larger ships?
Smart people who specialized in the things that went wrong with the Titanic figured out how to make those things better and prevent disaster from happening again. Given that we all know there’s inherent value in cloud services, instead of throwing around anti-cloud sentiment, let’s all focus our energy on helping cloud providers like Amazon get better at it. We can do that by learning and applying our own lessons, and contributing our own technologies and talents.






The Playstation Network outage is another recent example to keep in mind. not only did Sony loose business and money, but the developers who rely on the service to sell their products are going to have a hard time convincing people to trust them again.
Being breached by an outside source is a more frighting proposition then simply having some servers go down… and the concentration of data in one place makes large network services like that very juicy targets for data theft.
Mistakes are the way we learn. If you were to make no mistakes you would learn nothing in life. Unfortunatly, all things in life are the same way, you make mistakes and you learn and grow. Often times the mistakes made lead to some amazing discoveries. Thanks for the post.
I was personally affected by the outage, and I took it as par for the course in these situations. New technology has some hiccups. Wainewright acknowledged the idea that the majority of customers won’t think less of Amazon for it. I know I don’t. It is unfortunate but doesn’t have to bring down a business. I like Symform’s take on cloud storage, offering storage for an exchange of excess space on local drives. That idea may really pan out for them.