Menu
Outage caused by single admin mortifies cloud provider Joyent

Outage caused by single admin mortifies cloud provider Joyent

Joyent is looking at how to improve software and operational procedures to prevent a reoccurrence

Cloud provider Joyent suffered an outage on Tuesday after an administrator was able to simultaneously reboot all virtual servers hosted in the company's US-East-1 data center.

"It should go without saying that we're mortified by this. While the immediate cause was operator error, there are broader systemic issues that allowed a fat finger to take down a datacenter," said Bryan Cantrill, CTO at Joyent, in a post on Hacker News.

The company first noticed something had gone wrong when it started seeing transient availability issues.

"Due to an operator error, all compute nodes in US-East-1 were simultaneously rebooted.  Some compute nodes are already back up, but due to very high load on the control plane, this is taking some time," Joyent said in an initial update on the issue.

About an hour later after first reporting the problem, the company said that all compute nodes and virtual machines were back online.

Joyent didn't say how many customers or servers were affected by the reboot. However, an error of this magnitude shouldn't be allowed to happen, and highlights the importance of processes that balance the need for effective management and protecting users against these kinds of issues.

"As soon as we reasonably can, we will be providing a full postmortem of this: how this was architecturally possible, what exactly happened, how the system recovered, and what improvements we are and will be making," Cantrill wrote.

The company is looking at how it can improve software and operational procedures to ensure that this doesn't happen in the future, and also how the recovery after a failure can be made smoother, according to Cantrill.

Just like any IT system, cloud-based services and servers can suffer from outages, but because the large number uses consequences are usually larger.

This week some Amazon Web Services users were hit by a power outage. Servers in one of the US-West-1 region's availability zones were affected, and it took almost three hours for Amazon to recover all instances. Amazon didn't elaborate on what caused the power failure.

Recently, Twitter also suffered an outage after a change to one of its core services went wrong, and HBO angered users of its Go service twice after it was overwhelmed by the number of people that wanted to watch the season premiere of "Game of Thrones" and the finale of "True Detective."

Send news tips and comments to mikael_ricknas@idg.com


Follow Us

Join the New Zealand Reseller News newsletter!

Error: Please check your email address.

Tags softwarecloud computinginternetJoyentsystem management

Featured

Slideshows

Sizing up the NZ security spectrum - Where's the channel sweet spot?

Sizing up the NZ security spectrum - Where's the channel sweet spot?

From new extortion schemes, outside threats and rising cyber attacks, the art of securing the enterprise has seldom been so complex or challenging. With distance no longer a viable defence, Kiwi businesses are fighting to stay ahead of the security curve. In total, 28 per cent of local businesses faced a cyber attack last year, with the number in New Zealand set to rise in 2017. Yet amidst the sensationalism, media headlines and ongoing high profile breaches, confusion floods the channel, as partners seek strategic methods to combat rising sophistication from attackers. In sizing up the security spectrum, this Reseller News roundtable - in association with F5 Networks, Kaspersky Lab, Tech Data, Sophos and SonicWall - assessed where the channel sweet spot is within the New Zealand channel. Photos by Maria Stefina.

Sizing up the NZ security spectrum - Where's the channel sweet spot?
Kiwi channel comes together for another round of After Hours

Kiwi channel comes together for another round of After Hours

The channel came together for another round of After Hours, with a bumper crowd of distributors, vendors and partners descending on The Jefferson in Auckland. Photos by Maria Stefina.​

Kiwi channel comes together for another round of After Hours
Consegna comes to town with AWS cloud offerings launch in Auckland

Consegna comes to town with AWS cloud offerings launch in Auckland

Emerging start-up Consegna has officially launched its cloud offerings in the New Zealand market, through a kick-off event held at Seafarers Building in Auckland.​ Founded in June 2016, the Auckland-based business is backed by AWS and supported by a global team of cloud specialists, leveraging global managed services partnerships with Rackspace locally.

Consegna comes to town with AWS cloud offerings launch in Auckland
Show Comments