Northeast blackout software failure modes

The northeast blackout of 2003 was a widespread power outage that occurred throughout parts of the northeastern and midwestern united states and the. One of them was buried in a massive piece of software compiled from. The task force responsible for investigating the cause of the aug. By thinkreliability staff on august 14, 2003, over 50 million people in the u. Software failure synonyms, software failure antonyms. The safety relay was set to trip if other protective equipment deeper within the ontario hydro system failed to operate properly.

Its the system, not a shortage of power plants that is the problem. Ten years to the day after the great northeast blackout of 2003 caused new york city to go dark. Aug 14, 20 ten years ago today, large portions of the midwest and northeast united states and into canada went dark. Et, 21 power plants shut down in just three minutes.

According to the official analysis of the blackout prepared by the us and. Many past blackouts, such as northeast blackouts in 1965 and 2003, new york city blackout in 1977, two wecc blackouts in 1996, were caused by cascading failures 5, 6table 1 lists some of the. Pataki reluctantly recalled one of the two major blackouts of the last 40 years in the northeast the 1965 power failure, which left an 80,000squaremile stretch of the united states and. The northeast blackout of 1965 was a significant disruption in the supply of electricity on tuesday, november 9, 1965, affecting parts of ontario in canada and connecticut, massachusetts, new hampshire, new jersey, new york, rhode island, pennsylvania, and vermont in the united states. For more public eye satellite imagery, see the picture of the week gallery of satellite and other imagery of places in the news. Northeast blackout of 2003 your expert root cause analysis. A collection of wellknown software failures software systems are pervasive in all aspects of society. I bring up the usual examples three mile island, challenger, and the hyatt regency walkway collapse but id like to use an example or two thats more specific to ee rather than a mechanical general design problem. A narration of the events that lead to the great northeast blackout on august 14th, 2003. It sent a surge of power south, causing power overloads and automatic system shutdowns. I was the first journalist to report on a bug in a ge energy management system that resulted in an alarm system failure at firstenergys.

The cause of the failure was the setting of a protective relay on one of the transmission lines from the sir adam beck hydroelectric power station no. In this page, i collect a list of wellknown software failures. Final report on the august 14, 2003 blackout in the united. System blackout causes and cures by damir novosel 106.

The startup circuits at most of the power plants were designed to be powered from the functioning power grid. Transmission operations and maintenance conference instructed by. A study of the impact of peak demand on increasing. The multiple failures make the problem worse and worse, and a large area ends up in the dark.

Software bug contributed to blackout securityfocus. As a matter of fact, programming bugs can irritate, however, the defective programming can likewise be costly, humiliating, ruinous and savage. Ultimately, this blinded miso to the real time failures the grid was experiencing. A major outage knocked out power across the eastern united states and parts of canada on august 14, 2003. When overloaded transmission lines hit untrimmed trees, the alarm did not sound to warn maintenance workers. Information primarily taken from nercs technical analysis of the system collapse. Taking after are 6 famous software disasters in as beneath. Software failure failure modes in medical device software. The blackouts proximate cause was a software bug in the alarm system at the control room of firstenergy, an akron, ohiobased company, which rendered operators unaware of the need to redistribute load after overloaded transmission lines drooped into foliage. The northeast blackout of 2003 in august 2003, a massive power outage occurred in northeastern us and eastern canada. The blackout was a telling example of how vulnerable modern infrastructure can be to disruption and a reminder that our sophisticated failure prevention systems are only as effective as the.

The august 14 blackout affected the northeastern portion of the eastern. A previouslyunknown software flaw in a widelydeployed general electric energy management system contributed to the devastating scope of the august 14th northeastern u. Even if the tripped lines and firstenergys failure to detect and report them prove to be the exclusive initiating cause of the 2003 blackout, it still will be necessary to account for the failure. The reason it took so long to get everything back up again is.

Todd myers, technical advisor, epri transmission research, epri tim mcjunkin, power systems engineer project manager, idaho national laboratory energy systems lab tim vice, transmission maintenance analyst, westar energy eileen lockhart, uas program lead, xcel energy. A software bug stalled firstenergys alarm system for more than an hour, leaving. The beckwith electric center for learning offers a variety of classroom seminars covering diverse and complex subjects relating to the electric power field, ranging from industry theory to product specific applications and operation. Most did not get their power back until two days later.

This loss of electricity was the largest blackout in north american history. Technical analysis of the august 14, 2003, blackout nerc. Software bug contributed to blackout kevin poulsen, securityfocus 20040211. The blackout that exposed the flaws in the grid the new. Investigators found several factorsalarm system issues, software bugs, and an overloaded. As the august 14th blackout began, a flaw buried in widelydeployed. Fifteen years ago, a software bug resulted in over 50 million people losing. The northeast blackout of 2003 was a widespread power outage that occurred throughout parts of the northeastern and midwestern united states and the canadian province of ontario on thursday. Canada power system outage task force g august 14th blackout. View notes lecture 0 from computer s etc at university of malaysia, sarawak. Cascading blackouts are, by their nature, highly correlated events. We examined the public health effects of the northeast blackout of august 2003 and the emergency response to the blackout by the new york city department of health and mental hygiene dohmh. The failures occurred when multiple systems trying to access the same information at once got the equivalent of busy signals, he said.

From electronic voting to online shopping, a significant part of our daily life is mediated by software. New york news coverage of the northeast blackout of 2003. Final report on the august 14th blackout in the united states and canada. In addition to status monitoring, they also incorporate sophisticated models to predict the proper response to fault conditions. System blackout causes and cures by damir novosel 1062003 northeast blackout energy pulse energy central media coverage index library geni global energy network institute. The report makes clear that this blackout could have been prevented and that immediate actions must be taken in both the united states and canada to ensure that our electric system is more reliable. The massive blackout prompts some companies to turn to providers of disaster recovery services, but it operations at. Breakthrough computer system reliability and nuclear war. As has been taught in ce370, engineering disasters may often be underestimated due to the fact that the different components of the failure seem initially to be orthogonal and thus, uncorrelated. Software failure cited in august blackout investigation computerworld.

I will start with a study of economic cost of software bugs. Part one of a twopart series on the northeast blackout of 2003. Last saturdays power outage in new york city affected fewer than 100,000 people in a city of 8. Cause, software bug in the alarm system in the control room of firstenergy. This causes a transient flashover event as current carried by the line is. Computer system reliability and nuclear war alan borning. Take a look at the major power outages over many years, and see that the problems we face are not because we arent building enough power plants. Feb 11, 2004 turns out after all the various speculation on reasons, the big northeast blackout from last summer was the result of a software bug that was only just discovered and not as many had speculated. Great northeast power blackout of 2003 satellite imagery. Notice any opinions, findings, conclusions, or recommendations expressed in this publication do not necessarily reflect the views of the applied technology council atc, the department of homeland security. The whole system may have gone down for the reason given in. Probabilistic risk assessment is a methodology to assess the probability of failure or success of a mission. Every so often i teach an electrical engineering class, and towards the end we discuss engineering ethics. The 2003 northeast blackout five years later tougher regulatory measures are in place, but were still a long way from a smart power grid by jr minkel on august, 2008.

Nov 06, 2011 the blackout was determined to have been caused by a faulty relay at sir adam beck station on the ontario side of niagara falls. Bornings work with computer professionals for social responsibility resulted in a definitive analysis of the role of computer failure in accidental nuclear war. The cascading event, which started shortly after 4. Blackout threat unmitigated a decade after the northeast. In nearly every major blackout, the situation is the same. Software failure cited in august blackout investigation. A software bug at firstenergy corporation in ohio caused this power outage.

A softwarerelated failure mode taxonomy bin li,1 ming li,1 ken chen,2 and carol smidts1. The 2003 northeast blackoutfive years later scientific american. What should have been a manageable local blackout cascaded into the collapse of the entire northeast region. A case study of the 2003 north american power outage key questions what were the main vulnerabilities and threats related to the electricity subsector of the energy sector at the time of the blackout. The event started at the ontario new york border, near niagara falls. The northeast blackout of 2003 was a widespread power outage throughout parts of the. Power system reliability analysis considering protection failures.

One piece of the system fails, and then the pieces near it cannot handle the increased load caused by the failure, so they fail. Prevention remains difficult without methods to calculate the risk of cascading power failures. Lecture 0 software failure failure modes in medical device. More than 35 million people were hit by the northeast blackout in 1965, one of the largest outages in north american history. The northeastern blackout 2003 the blackout report. The great northeast blackout of november 9, 1965 began at 5. The cause of the power failure is explained in depth by the suburban emergency management project. The northeast blackout of 2003 was a widespread power outage throughout parts of the northeastern and midwestern united states, and the canadian province of ontario on august 1428, 2003, beginning just after 4. The northeast blackout struck 15 years ago today extremetech. Fifty million people were affected, including residents of new york, cleveland and detroit, as well as toronto and ottawa, canada. Similarly, firstenergy fe had a number of software failures in their energy.

The blackout that occurred on august 14, 2003 in the midwest regions of the united states and the province of ontario in canada was the worst in u. The 2003 northeast blackoutfive years later scientific. Is our power grid more reliable one year after the blackout. Northeast blackout caused by a software bug techdirt. The software should have given one system precedent. After an extensive analysis, we submitted a report to the task force that. What will be key challenges for the resilience of the electricity subsector in the future. Results provided by the risk assessment methodology are used to make decisions. Ten years later, we are still grappling with concerns over the vulnerability of the power grid. Associate professor of computer science, university of washington, seattle.