Menu
AWS glitch strikes Netflix and Tinder, offering a wake-up call for others

AWS glitch strikes Netflix and Tinder, offering a wake-up call for others

'Design your cloud apps for failure," one analyst urges

Netflix, Tinder and other major websites were affected for a time Sunday by glitches in Amazon Web Services' Northern Virginia facility, offering a cautionary lesson to other companies that rely on the cloud service for mission-critical capabilities.

The problem manifested itself primarily in the form of higher-than-normal error rates. Sites affected reportedly also included IMDb and Amazon's Instant Video and Books websites.

At the heart of the snafu were issues with AWS's DynamoDB database, but it spread to include other services such as EC2, the mobile-focused Cognito service and the CloudWatch monitoring service, according to the AWS Service Health Dashboard.

"The root cause began with a portion of our metadata service within DynamoDB," AWS explained in a dashboard update posted at 4:52 a.m. PDT on Sunday. "This is an internal sub-service which manages table and partition information. Our recovery efforts are now focused on restoring metadata operations. We will be throttling APIs as we work on recovery."

After beginning at 3:00 a.m. PDT Sunday, the DynamoDB issues were fixed by 9:12 a.m. All the other services were restored by noon.

AWS declined to comment for this story.

"This really shouldn’t be happening," said Rob Enderle, principal analyst with Enderle Group. "A service that is sold for mission-critical systems should have massive redundancies, and there should be isolation between different customers' implementations so a failure on one shouldn’t bring everyone down."

If similar incidents happen in the future, AWS could start to lose customers, Enderle said.

It's "a cautionary tale for any AWS customer," he said. "In the end, Amazon does not have adequate failover protection, which means its customers need to make sure they do."

Netflix, in fact, apparently experienced minimal disruption because of its own redundancy approach.

"We were able to quickly redirect traffic from the impacted AWS region to one that was fully operational," the company said via email.

Other Amazon customers running mission-critical systems on AWS would do well to emulate Netflix's approach, Enderle suggested.

In the meantime, the event could benefit IBM, which "has a far more robust offering in SoftLayer," as well as firms such as BMC that incorporate AWS and have strong failover capability, Enderle said.

Of course, virtually any outage is a significant one for a cloud provider given the heavy emphasis customers place on uptime, said Stephen O'Grady, co-founder and principal analyst with RedMonk.

"Undoubtedly AWS will be having 'less than fun' with customers today," he said.

That said, however, "all providers have outages," O'Grady noted, "and thus far they have not appeared to have any lasting impact on the trajectory of businesses like Amazon's."

Indeed, "the fix was applied quickly, AWS owned it and recovery started almost immediately," agreed Dave Bartoletti, a principal analyst with Forrester. "In my experience, AWS can handle one or two of these a year without significantly scaring customers."

More than anything, he added, "it's a wake-up call to design your cloud apps for failure."

Follow Us

Join the New Zealand Reseller News newsletter!

Error: Please check your email address.

Tags amazon.com

Featured

Slideshows

Examining the changing job scene in the Kiwi channel

Examining the changing job scene in the Kiwi channel

Typically, the New Year brings new opportunities for personnel within the Kiwi channel. 2017 started no differently, with a host of appointments, departures and reshuffles across vendor, distributor and reseller businesses. As a result, the job scene across New Zealand has changed - here’s a run down of who is working where in the year ahead…

Examining the changing job scene in the Kiwi channel
​What are the top 10 tech trends for New Zealand in 2017?

​What are the top 10 tech trends for New Zealand in 2017?

Digital Transformation (DX) has been a critical topic for business over the last few years and IDC is now predicting a step change as DX reaches macroeconomic levels. By 2020 a DX economy will emerge and it will become the core of what New Zealand industries focus on. From the board level through to the C-Suite, Kiwi organisations must be prepared to think and act digital when the DX economy emerges in 2017.

​What are the top 10 tech trends for New Zealand in 2017?
Top 15 Kiwi tech storylines to follow in 2017

Top 15 Kiwi tech storylines to follow in 2017

​The New Year brings the usual new round of humdrum technology predictions, glaringly general, unashamedly safe and perpetually predictable. But while the industry no longer sees value in “cloud is now the norm” type projections, value can be found in following developments of the year previous, analysing behaviours and patterns to formulate a plan for the 12 months ahead. Consequently, here’s the top Kiwi tech storylines to follow in 2017...

Top 15 Kiwi tech storylines to follow in 2017
Show Comments