Incident: 911 System Outages Due to Malicious Twitter Cyberattack

Published Date: 2017-04-01

Postmortem Analysis
Timeline 1. The software failure incident involving a malicious Twitter post causing cellphones to repeatedly call 911 occurred in October [50444]. **Estimation:** Step 1: The article mentions the incident happened in October. Step 2: The article was published on 2017-04-01. Step 3: The incident occurred in October of the previous year, which would be October 2016.
System 1. Antiquated 911 emergency-response systems 2. Faulty phone software 3. Aging 911 systems 4. Internet-based technology for emergency call centers 5. Next-generation Internet-based technology 6. Washington state's 911 service based on Internet connectivity
Responsible Organization 1. Malicious hackers who programmed multiple phones to crash emergency networks by sharing a malicious Twitter post targeting faulty phone software [50444].
Impacted Organization 1. AT&T Wireless customers nationwide [50444] 2. Callers in Dallas, particularly T-Mobile customers [50444] 3. Emergency call centers in at least a dozen states from California to Florida [50444]
Software Causes 1. A service issue with AT&T Wireless customers nationwide preventing them from dialing 911 [50444]. 2. Malicious Twitter post with a link targeting faulty phone software causing cellphones to repeatedly call 911 in cities across the country, which was the largest cyberattack on the emergency-response system [50444]. 3. Aging 911 systems being vulnerable to malicious hackers who deliberately program multiple phones to crash emergency networks [50444].
Non-software Causes 1. A shortage of people to handle calls in Dallas [Article 50444] 2. Calls from T-Mobile customers overwhelming the system in Dallas [Article 50444]
Impacts 1. AT&T Wireless customers nationwide couldn't dial 911, prompting local emergency officials in several states to advise calling an alternate number or texting authorities in case of emergency [50444]. 2. In Dallas, callers were unable to reach 911 during spikes in calls, leading to hundreds of people being put on hold due to a combination of calls from T-Mobile customers and a shortage of staff to handle calls [50444]. 3. A malicious Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911 in cities across the country, overwhelming emergency call centers in at least a dozen states for 12 hours over two days [50444].
Preventions 1. Upgrading to the newest "next-generation" Internet-based technology that uses digital routing instead of old-fashioned phone lines with switches could have prevented the software failure incident [50444]. 2. Implementing defensive tools in call centers to automatically flag and divert calls thought to be malicious or repetitive could have helped prevent the incident [50444]. 3. Putting safeguards in place on iPhones to prevent similar incidents, as mentioned by Apple, could have also prevented the software failure incident [50444].
Fixes 1. Upgrading to the newest "next-generation" Internet-based technology that uses digital routing instead of old-fashioned phone lines with switches [Article 50444]. 2. Implementing Internet-based systems that are better capable of handling cellphone traffic and are more secure against malicious misuse [Article 50444]. 3. Utilizing defensive tools in call centers to automatically flag and divert calls thought to be malicious or repetitive, and detect their legitimacy using techniques like requiring clicks or voice commands [Article 50444].
References 1. Federal Communications Commission 2. City officials in Dallas 3. Wall Street Journal 4. Ben-Gurion University's Cyber Security Research Center in Israel 5. Apple 6. National Emergency Number Association 7. Monica Million, operations manager at the Grand Junction Regional Communication Center in Colorado 8. U.S. Department of Transportation 9. U.S. Sen. Bill Nelson (D-Fla.) 10. Steve Souder, former director of public safety communications in Fairfax County 11. Andy Leneweaver, Washington state's deputy state 911 coordinator

Software Taxonomy of Faults

Category Option Rationale
Recurring one_organization, multiple_organization The software failure incident related to the 911 system disruptions has happened again at multiple organizations. The incident involved a malicious Twitter post with a link targeting faulty phone software that caused cellphones to repeatedly call 911 in cities across the country, which was reported as the largest cyberattack on the country's emergency-response system [Article 50444]. This incident demonstrated how aging 911 systems are vulnerable to malicious hackers who may deliberately program multiple phones to crash emergency networks. Additionally, the incident highlighted the need for states and localities to switch to the newest "next-generation" Internet-based technology to handle cellphone traffic more effectively and prevent such attacks.
Phase (Design/Operation) design, operation (a) The software failure incident related to the design phase can be seen in the article where a malicious Twitter post with a link targeting faulty phone software caused people's cellphones to repeatedly call 911, overwhelming emergency call centers in at least a dozen states [50444]. This incident was a result of a flaw in the design of the phone software that allowed for such an attack to occur. (b) The software failure incident related to the operation phase is evident in the article where in Dallas, callers were unable to reach 911 during spikes in calls that put hundreds of people on hold. City officials blamed a combination of calls from T-Mobile customers and a shortage of people to handle calls [50444]. This failure was due to contributing factors introduced by the operation or misuse of the system, highlighting the challenges faced during the operation of the emergency response system.
Boundary (Internal/External) within_system, outside_system (a) within_system: The software failure incident related to the 911 emergency-response systems being disrupted was primarily due to factors originating from within the system. The incidents were caused by a combination of issues such as spikes in calls, shortage of staff to handle calls, and a feature in T-Mobile phones that repeatedly called 911 if the initial call failed [50444]. Additionally, a malicious Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911, overwhelming emergency call centers in multiple states [50444]. These issues highlight vulnerabilities within the existing 911 systems that need to be addressed through system upgrades and improved technology to prevent such failures in the future.
Nature (Human/Non-human) non-human_actions, human_actions (a) The software failure incident occurring due to non-human actions: The incident where a malicious Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911 was a result of a cyberattack on the country's emergency-response system. This attack overwhelmed emergency call centers in at least a dozen states and was considered the largest cyberattack on the emergency system [50444]. (b) The software failure incident occurring due to human actions: In Dallas, callers were unable to reach 911 due to spikes in calls that put hundreds of people on hold. City officials attributed this issue to a combination of calls from T-Mobile customers and a shortage of staff to handle the calls. To address the problem, officials dedicated $2 million to upgrades and increased staffing, and they requested T-Mobile to disable a feature that calls 911 repeatedly if an initial call fails to go through [50444].
Dimension (Hardware/Software) hardware, software (a) The software failure incident occurring due to hardware: - The incident where a malicious Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911 was a result of a cyberattack on the country's emergency-response system, which overwhelmed emergency call centers in at least a dozen states [50444]. - Aging 911 systems were shown to be vulnerable to malicious hackers who could deliberately program multiple phones to crash emergency networks, either by infecting phones with malware or by buying a few thousand phones [50444]. (b) The software failure incident occurring due to software: - The incident where a malicious Twitter post caused cellphones to repeatedly call 911 was attributed to faulty phone software, indicating a software-related issue [50444]. - The need for states and localities to switch to next-generation Internet-based technology that uses digital routing instead of old-fashioned phone lines with switches highlights the importance of upgrading software systems to handle cellphone traffic more effectively [50444].
Objective (Malicious/Non-malicious) malicious, non-malicious (a) The software failure incident related to the malicious objective was the cyberattack on the country's emergency-response system caused by a malicious Twitter post with a link targeting faulty phone software. This incident led to cellphones repeatedly calling 911 in cities across the country, overwhelming emergency call centers in at least a dozen states [50444]. (b) The software failure incidents related to non-malicious objectives included disruptions in antiquated 911 emergency-response systems due to service issues, spikes in calls overwhelming call centers, and a shortage of staff to handle calls. These incidents were not intentional attacks but rather resulted from system limitations and lack of resources [50444].
Intent (Poor/Accidental Decisions) poor_decisions (a) The software failure incident related to the 911 emergency-response system disruptions was primarily due to poor decisions made by individuals. For example, in one incident, a malicious Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911, overwhelming emergency call centers in multiple states [50444]. This incident was described as the largest cyberattack on the country's emergency-response system, and it was later revealed that a Washington state teen shared the link as a prank, leading to the disruption. Additionally, the incident highlighted the vulnerability of aging 911 systems to malicious hackers who can deliberately program multiple phones to crash emergency networks, either through malware or by manipulating phones to make repeated calls [50444]. The lack of proactive measures and the failure to anticipate such attacks demonstrate poor decisions that contributed to the software failure incident.
Capability (Incompetence/Accidental) development_incompetence, accidental (a) The software failure incident related to development incompetence is evident in the article as it mentions how the 911 systems were vulnerable to malicious hackers who could deliberately program multiple phones to crash emergency networks. This vulnerability was highlighted in a 2016 paper on U.S. call center security by Ben-Gurion University's Cyber Security Research Center in Israel [50444]. Additionally, the incident where a Twitter post with a link targeting faulty phone software caused cellphones to repeatedly call 911 across the country demonstrates the impact of development incompetence in creating software vulnerabilities that can be exploited for cyberattacks. (b) The software failure incident related to accidental factors is seen in the article when it mentions the October cyberattack where a Washington state teen shared a malicious Twitter post as a prank, causing cellphones to repeatedly call 911 in cities across the country [50444]. This incident was not intentional but rather accidental, resulting in a significant disruption to the emergency-response system.
Duration temporary The software failure incident related to the 911 emergency-response system disruptions reported in the articles was temporary. The incidents mentioned in the articles, such as the inability to reach 911 during spikes in calls, the malicious Twitter post causing cellphones to repeatedly call 911, and the issues faced by call centers in various states, were all temporary disruptions caused by specific circumstances like high call volumes, cyberattacks, and software vulnerabilities [50444].
Behaviour crash, omission, other (a) crash: The software failure incident mentioned in the articles involved a crash where people's cellphones repeatedly called 911 due to a malicious Twitter post with a link targeting faulty phone software, causing emergency call centers in at least a dozen states to be overwhelmed for 12 hours over two days [50444]. (b) omission: The incident also involved an omission where callers in Dallas were unable to reach 911 during spikes in calls, leading to hundreds of people being put on hold. This was attributed to a combination of calls from T-Mobile customers and a shortage of staff to handle the calls [50444]. (c) timing: There is no specific mention of a timing-related failure in the software incident described in the articles. (d) value: The incident did not involve a failure due to the system performing its intended functions incorrectly. (e) byzantine: The incident did not involve a byzantine failure where the system behaves erroneously with inconsistent responses and interactions. (f) other: The incident also highlighted the vulnerability of aging 911 systems to malicious hackers who may deliberately program multiple phones to crash emergency networks, either by infecting phones with malware or by buying a few thousand phones [50444].

IoT System Layer

Layer Option Rationale
Perception None None
Communication None None
Application None None

Other Details

Category Option Rationale
Consequence delay, theoretical_consequence (a) death: People lost their lives due to the software failure - The article mentions that a malicious Twitter post targeting faulty phone software caused cellphones to repeatedly call 911, resulting in emergency call centers being overwhelmed for 12 hours over two days. This incident was described as the largest cyberattack on the country's emergency-response system, and it led to the arrest of a Washington state teen who shared the link as a prank [50444]. (b) harm: People were physically harmed due to the software failure - The articles do not specifically mention physical harm to individuals due to the software failure. (c) basic: People's access to food or shelter was impacted because of the software failure - The articles do not mention any impact on people's access to food or shelter due to the software failure. (d) property: People's material goods, money, or data was impacted due to the software failure - The articles do not mention any direct impact on people's material goods, money, or data due to the software failure. (e) delay: People had to postpone an activity due to the software failure - The articles mention that in Dallas, callers were unable to reach 911 during spikes in calls, which put hundreds of people on hold. This situation led to delays in accessing emergency services [50444]. (f) non-human: Non-human entities were impacted due to the software failure - The articles do not mention any impact on non-human entities due to the software failure. (g) no_consequence: There were no real observed consequences of the software failure - The articles clearly describe various consequences of the software failure incidents, including overwhelming emergency call centers, arrests, financial implications for upgrades, and the need for new technology. (h) theoretical_consequence: There were potential consequences discussed of the software failure that did not occur - The articles discuss the potential consequences of aging 911 systems being vulnerable to malicious hackers, the need for states and localities to switch to newer technology, and the financial challenges associated with upgrading systems. These are discussed as potential consequences that could occur if the necessary actions are not taken. (i) other: Was there consequence(s) of the software failure not described in the (a to h) options? What is the other consequence(s)? - The articles primarily focus on the consequences related to emergency response systems, cyberattacks, and the need for technological upgrades. No other specific consequences beyond those mentioned in options (a) to (h) are highlighted.
Domain information, finance, government (a) The failed system was related to the industry of information, specifically the emergency-response system for 911 calls. The incidents reported in the articles highlight the vulnerabilities and failures in the antiquated 911 emergency-response systems, which are crucial for providing timely and accurate information during emergencies [50444]. (h) The failure incident also has implications for the finance industry, as upgrading and maintaining the emergency-response systems require significant financial resources. The article mentions concerns about the financial means of states and localities to pay for the necessary technology upgrades to enhance the security and efficiency of the 911 systems [50444]. (l) Additionally, the government sector is directly impacted by the software failure incident, as the emergency-response systems are a critical component of public services and safety. The article discusses the challenges faced by government agencies in funding the transition to next-generation technology for 911 systems to ensure better emergency response capabilities [50444].

Sources

Back to List