Incident: RoboSimian Lithium-Ion Battery Explosion at NASA's Jet Propulsion Laboratory

Published Date: 2016-10-28

Postmortem Analysis
Timeline 1. The software failure incident involving the RoboSimian at Nasa's Jet Propulsion Laboratory happened on June 14th [48582].
System unknown
Responsible Organization unknown
Impacted Organization 1. Nasa [48582]
Software Causes unknown
Non-software Causes 1. The failure incident was caused by a lithium-ion battery undergoing thermal runaway, resulting in the destruction of the RoboSimian robot at Nasa's Jet Propulsion Laboratory [48582]. 2. One cell of the battery was damaged and sent misleading information to the monitoring equipment, causing individual cells to become overcharged [48582]. 3. The battery explosion led to the release of smoke containing hydrogen fluoride, carbon monoxide, and carbon dioxide [48582]. 4. The initial battery explosion required water to extinguish the fire, as CO2 fire extinguishers were not effective [48582]. 5. The incident highlighted risks associated with lithium-ion batteries, including incomplete or inaccurate manufacturer specifications and challenges in safety regulations keeping pace with battery innovations [48582].
Impacts 1. The RoboSimian robot at Nasa's Jet Propulsion Laboratory exploded due to a lithium-ion battery failure during charging, resulting in the destruction of the robot [48582]. 2. The explosion released harmful chemicals like hydrogen fluoride, carbon monoxide, and carbon dioxide into the lab environment, posing a health risk to the researchers present [48582]. 3. The battery failure incident required immediate action from interns and firefighters to extinguish the fire using a CO2 fire extinguisher and water, highlighting the potential dangers and challenges in handling such incidents [48582]. 4. The incident led to a need for better monitoring and safety measures for lithium-ion batteries, as highlighted by the analysis that indicated misleading information from a damaged cell led to overcharging and subsequent explosion [48582].
Preventions 1. Implementing more robust battery monitoring systems to detect and prevent individual cell damage and overcharging [48582]. 2. Enhancing safety protocols for handling and charging lithium-ion batteries, including proper storage and charging procedures [48582]. 3. Conducting regular maintenance and inspections on battery packs to identify any potential issues before they escalate into failures [48582]. 4. Providing comprehensive training to personnel on how to respond to battery-related incidents, including the appropriate use of fire extinguishers and contacting emergency services promptly [48582].
Fixes 1. Implement more robust battery monitoring systems to detect and prevent individual cell damage and overcharging, which can lead to thermal runaway and explosions [48582]. 2. Enhance safety protocols for handling lithium-ion batteries, including proper storage, charging, and usage practices to mitigate risks associated with battery failures [48582]. 3. Conduct thorough testing and validation of battery packs to ensure accurate manufacturer specifications and compliance with safety regulations [48582].
References 1. NASA - The articles gather information about the software failure incident from NASA, specifically the Jet Propulsion Laboratory where the RoboSimian explosion occurred [48582].

Software Taxonomy of Faults

Category Option Rationale
Recurring unknown The articles do not mention any software failure incidents related to either one_organization or multiple_organization.
Phase (Design/Operation) design, operation (a) The software failure incident related to the design phase can be seen in the article where it mentions that based on initial analysis, one cell of the battery was damaged and sent misleading information to the monitoring equipment. As a result, individual cells became overcharged, leading to the eventual explosion of the RoboSimian robot at Nasa's Jet Propulsion Laboratory [48582]. (b) The software failure incident related to the operation phase is evident in the article when it describes how an intern from the next lab over climbed through a window and sprayed RoboSimian with a CO2 fire extinguisher after the initial battery explosion. However, this method did not extinguish the fire, and water was needed to dissipate the heat. Eventually, firefighters in breathing masks rolled RoboSimian outside and extinguished the fire with water for 30 seconds, highlighting the operational challenges faced during the incident [48582].
Boundary (Internal/External) unknown The articles do not provide information about a software failure incident related to the boundary of the failure incident.
Nature (Human/Non-human) non-human_actions, human_actions (a) The software failure incident occurring due to non-human actions: The incident involving the RoboSimian robot at Nasa's Jet Propulsion Laboratory was caused by a lithium-ion battery undergoing thermal runaway, resulting in the destruction of the robot. The failure was attributed to one cell of the battery being damaged and sending misleading information to the monitoring equipment, causing individual cells to become overcharged. This non-human action of the damaged cell led to a cascading effect within the battery pack, ultimately resulting in the explosion and fire [48582]. (b) The software failure incident occurring due to human actions: Human actions played a role in responding to the software failure incident. After the initial battery explosion, an intern attempted to extinguish the fire with a CO2 fire extinguisher, which was not effective. It was later determined that water was needed to dissipate the heat from the fire. Another intern called the fire department, and firefighters eventually rolled the robot outside and extinguished the fire with water. Additionally, researchers were seen working on the robot, swapping out one of its batteries, indicating human involvement in the maintenance and handling of the robot's power source [48582].
Dimension (Hardware/Software) hardware (a) The software failure incident occurring due to hardware: - The incident involving the RoboSimian robot at Nasa's Jet Propulsion Laboratory on June 14th was primarily caused by a lithium-ion battery explosion [48582]. - The explosion was triggered by one cell of the battery being damaged and sending misleading information to the monitoring equipment, causing individual cells to become overcharged [48582]. - The battery explosion led to the release of smoke containing hydrogen fluoride, carbon monoxide, and carbon dioxide [48582]. - The interns initially used a CO2 fire extinguisher to try to put out the fire, but water was ultimately needed to extinguish it, highlighting the importance of proper fire suppression methods for such incidents [48582]. (b) The software failure incident occurring due to software: - The incident was not directly attributed to a software failure but rather to a hardware issue related to the lithium-ion battery explosion [48582]. - There is no mention in the article of any software-related contributing factors that originated in the software itself leading to the incident.
Objective (Malicious/Non-malicious) non-malicious (a) The incident described in the article is not related to a malicious software failure. It was a non-malicious failure caused by a lithium-ion battery undergoing thermal runaway, resulting in the destruction of the RoboSimian robot at Nasa's Jet Propulsion Laboratory [48582]. The failure was attributed to one cell of the battery being damaged and sending misleading information to the monitoring equipment, causing individual cells to become overcharged. This non-malicious failure was a result of a technical issue with the battery rather than any intentional harm to the system.
Intent (Poor/Accidental Decisions) unknown The articles do not mention any software failure incident related to poor_decisions or accidental_decisions.
Capability (Incompetence/Accidental) accidental (a) The articles do not mention any software failure incident related to development incompetence. (b) The incident involving the RoboSimian robot at Nasa's Jet Propulsion Laboratory was not due to development incompetence but rather an accidental failure caused by a lithium-ion battery explosion during charging. The explosion occurred due to thermal runaway in the battery, where one cell was damaged and sent misleading information to the monitoring equipment, causing individual cells to become overcharged. This accidental failure led to the robot catching fire and required the intervention of interns and firefighters to extinguish the flames [48582].
Duration unknown The articles do not mention any software failure incident related to either permanent or temporary duration. Therefore, the duration of the software failure incident in this case is unknown.
Behaviour crash, omission, value, other (a) crash: The incident involving the RoboSimian at NASA's Jet Propulsion Laboratory on June 14th resulted in an explosion of the robot while it was being charged. The explosion was powerful, with the force of a stick of dynamite, and caused significant damage. The battery failure led to the destruction of the robot, indicating a crash in the system's functionality as it lost its state and was unable to perform its intended functions [48582]. (b) omission: The battery failure in the RoboSimian incident was attributed to one cell of the battery being damaged and sending misleading information to the monitoring equipment. This led to individual cells becoming overcharged, ultimately resulting in the explosion. The omission of correct information from the damaged cell contributed to the failure of the battery system [48582]. (c) timing: The incident involving the RoboSimian's battery failure did not specifically mention any timing-related failures where the system performed its intended functions either too late or too early. The focus was more on the battery failure leading to the explosion rather than a timing issue [48582]. (d) value: The battery failure in the RoboSimian incident resulted in the system performing its intended functions incorrectly, leading to the explosion and subsequent damage to the robot. The damaged cell providing misleading information contributed to the incorrect functioning of the battery system [48582]. (e) byzantine: The articles did not mention any behavior indicative of a byzantine failure where the system behaves erroneously with inconsistent responses and interactions. The focus was more on the battery failure and subsequent explosion of the RoboSimian robot [48582]. (f) other: The behavior of the software failure incident involving the RoboSimian at NASA's Jet Propulsion Laboratory can be categorized as a catastrophic failure due to the battery explosion. The incident showcased the potential dangers of lithium-ion batteries and the risks associated with their failure, leading to significant damage and the need for emergency response measures [48582].

IoT System Layer

Layer Option Rationale
Perception None None
Communication None None
Application None None

Other Details

Category Option Rationale
Consequence no_consequence Based on the provided articles, there is no mention of a software failure incident resulting in any of the consequences listed in the question. Therefore, the answer is 'no_consequence'.
Domain unknown Unknown

Sources

Back to List