Incident: Smart Heating App Glitch Causes Temperature Soar to 32C

Published Date: 2016-02-27

Postmortem Analysis
Timeline 1. The software failure incident with the Hive smart heating app happened in November 2015 [40412].
System 1. Hive smart heating app owned by British Gas [40412]
Responsible Organization 1. Hive, a hi-tech service owned by British Gas [40412]
Impacted Organization 1. Dozens of users of Hive, a hi-tech service owned by British Gas [40412] 2. Customer Luke George from Bradford [40412]
Software Causes 1. Software glitch causing the heating to be turned up to the maximum level by the iPhone app without user instruction [40412] 2. Failure of the system to connect to the thermostat or the boiler, leading to shutdowns and loss of control over heating and hot water [40412] 3. Loss of schedules and inability to add new ones due to software issues [40412]
Non-software Causes 1. The heating system sending temperatures skyrocketing [40412] 2. System failing to connect to the thermostat or boiler, shutting down and turning off heating and hot water [40412] 3. Loss of schedules and inability to add new ones [40412] 4. Faulty hub replacement and denial of promised upgrades [40412]
Impacts 1. Users experienced their heating systems being turned up to the maximum level without their instruction, leading to temperatures soaring to 32 degrees Celsius, causing discomfort and potential safety hazards [40412]. 2. Concerns were raised about the potential increase in energy costs for affected users due to the software glitch [40412]. 3. The software failure incident resulted in users losing control over their heating systems, with some experiencing failures in connecting to the thermostat or boiler, leading to disruptions in heating and hot water supply [40412]. 4. The glitch raised concerns about the safety and security of user data, as a previous issue with the system exposed unencrypted data that could have been exploited by hackers to determine when properties were unoccupied, potentially aiding burglaries [40412].
Preventions 1. Implementing thorough software testing procedures before releasing updates or new features could have potentially prevented the software glitch that caused the heating to soar to 32 degrees [40412]. 2. Conducting regular security audits and ensuring proper encryption of user data could have prevented previous issues where user data was being sent without encryption, making it vulnerable to hackers [40412].
Fixes 1. Turning the thermostat down using the app, web dashboard, or the thermostat itself [40412].
References 1. Customers who experienced the software glitch with the Hive smart heating app, such as Luke George from Bradford and a 35-year-old man from North-West London [40412]. 2. Hive, the hi-tech service owned by British Gas, which issued an apology for the glitch and provided information on the incident [40412].

Software Taxonomy of Faults

Category Option Rationale
Recurring one_organization (a) The software failure incident has happened again at one_organization: The article mentions that the Hive smart heating app, owned by British Gas, experienced a software glitch that caused temperatures to soar to 32 degrees Celsius for some users. This incident is not the first problem the system has faced. In August the previous year, British Gas had to address a security issue where user data was being sent without encryption, potentially allowing hackers to access customers' information [40412]. (b) The software failure incident has happened again at multiple_organization: There is no specific mention in the article about the same software failure incident occurring at other organizations or with their products and services.
Phase (Design/Operation) design, operation (a) The software failure incident in the article can be attributed to the design phase. The article mentions that the heating app experienced a software glitch that caused temperatures to soar to 32C without user instruction. Initially, the company suggested that users must have been hacked, but engineers are now working on a solution to the software issue [40412]. (b) Additionally, the software failure incident can also be linked to the operation phase. Users reported problems with the heating system, such as temperatures skyrocketing, the system failing to connect to the thermostat or boiler, and losing control of heating schedules. These issues were experienced during the operation of the system, indicating operational challenges and failures [40412].
Boundary (Internal/External) within_system, outside_system (a) The software failure incident related to the Hive smart heating app was primarily within the system. The incident was caused by a software glitch within the app that led to temperatures soaring to 32 degrees Celsius without user instruction [40412]. The glitch was acknowledged by Hive, and they mentioned that a certain sequence of commands in the iOS app could cause the thermostat temperature to rise to 32 degrees [40412]. (b) Additionally, there were concerns raised about the security of the system, indicating an outside system factor. In a previous incident, it was reported that user data was being sent without encryption, potentially allowing hackers to tap into customers' Wi-Fi and steal their information [40412]. This external security vulnerability was addressed by British Gas by agreeing to start encrypting the data to reassure customers about the safety of their personal information [40412].
Nature (Human/Non-human) non-human_actions, human_actions (a) The software failure incident in the article was primarily due to non-human actions, specifically a software glitch in the Hive iOS app that caused the thermostat temperature to rise to 32 degrees without user instruction. This glitch affected dozens of users, leading to concerns about increased energy costs and potential fire risks [40412]. (b) Human actions also played a role in the incident as some users were initially told by Hive bosses that their systems must have been hacked when they reported the issue. Additionally, there were complaints from customers about the system failing to connect to the thermostat or boiler, leading to heating and hot water shutdowns, which required manual resets by the company [40412].
Dimension (Hardware/Software) hardware, software (a) The software failure incident occurring due to hardware: - The article mentions that a customer was initially told by Hive that the issue with the heating system was due to a problem with his boiler, indicating a potential hardware-related contributing factor to the failure [40412]. - Additionally, there are complaints about the system failing to connect to the thermostat or the boiler, leading to shutdowns and issues with heating and hot water, which could be hardware-related problems [40412]. (b) The software failure incident occurring due to software: - The main cause of the incident is identified as a software glitch in the Hive iOS app, which causes the thermostat temperature to rise to 32 degrees without user instruction [40412]. - The article mentions that engineers are working on a solution to the software issue, indicating that the root cause of the problem lies in the software of the app [40412]. - The spokesman for Hive apologizes for the 'glitch' and mentions that a software update is being worked on to address the issue, further confirming that the failure originated in the software [40412].
Objective (Malicious/Non-malicious) non-malicious (a) The software failure incident described in the articles is non-malicious. The incident was caused by a software glitch in the Hive iOS app that led to the thermostat temperature rising to 32 degrees for some users. This glitch was not intentional and was not caused by any malicious activity. The company, Hive, acknowledged the issue as a temporary glitch affecting a very small number of customers and stated that it was working on a software update to address the problem [40412].
Intent (Poor/Accidental Decisions) poor_decisions, accidental_decisions (a) The software failure incident related to the Hive smart heating app was primarily due to poor decisions made by the developers and operators of the app. Initially, when users complained about the heating being turned up to 32 degrees Celsius without their instruction, Hive bosses wrongly suggested that users must have been hacked [40412]. This initial response indicates a lack of understanding or acknowledgment of the software glitch within the system. Additionally, the incident highlights poor decisions in terms of data security, as it was mentioned that user data was being sent without encryption, potentially allowing hackers to tap into customers' Wi-Fi and steal their information [40412]. (b) The software failure incident also involved accidental decisions or unintended consequences. For example, one user mentioned that whenever they complained to Hive about issues with the system, Hive would reset it, and it would work for a couple of weeks before failing again [40412]. This cycle of temporary fixes followed by recurring problems suggests a pattern of accidental decisions or unintended consequences in the troubleshooting and maintenance of the system.
Capability (Incompetence/Accidental) development_incompetence (a) The software failure incident in the article can be attributed to development incompetence. The article mentions that the heating app experienced a software glitch that caused temperatures to soar to 32 degrees Celsius without user instruction. Initially, Hive bosses wrongly suggested that users must have been hacked, indicating a lack of understanding of the root cause. Engineers are now working on a solution to the software issue, highlighting a need for professional competence in addressing the problem [40412]. (b) The software failure incident can also be considered accidental. The article describes how the glitch in the Hive iOS app caused the thermostat temperature to rise to 32 degrees due to a specific sequence of commands. This unintended consequence led to concerns about energy costs, fire risks, and inconvenience for users. The Hive spokesperson acknowledged the glitch as temporary and mentioned that a software update is being worked on to address the issue, indicating that the incident was accidental in nature [40412].
Duration temporary (a) The software failure incident in the article is temporary. The article mentions that the glitch affecting a very small number of customers causes the thermostat temperature to rise to 32 degrees. The company is aware of this temporary glitch and is working on a software update to fix it. Customers can easily fix the issue by turning the thermostat down using the app, web dashboard, or the thermostat itself. The spokesman for Hive mentioned that the rest of the app works as normal despite this specific issue [40412].
Behaviour crash, omission, value, other (a) crash: The software failure incident in the article can be categorized as a crash as users reported that the heating system controlled by the Hive app was turning up the temperature to 32 degrees Celsius without their instruction, indicating a failure of the system to perform its intended function correctly [40412]. (b) omission: The software failure incident can also be categorized as an omission as users mentioned instances where the system failed to connect to the thermostat or the boiler, resulting in the shutdown of the heating and hot water, omitting to perform its intended functions [40412]. (c) timing: There is no specific mention of a timing-related failure in the articles provided. (d) value: The software failure incident can be categorized as a value failure as users experienced the system setting the temperature to 32 degrees when it should have been between 20 and 22 degrees, indicating incorrect performance of its intended function [40412]. (e) byzantine: The software failure incident does not exhibit characteristics of a byzantine failure where the system behaves erroneously with inconsistent responses and interactions. (f) other: The other behavior observed in the software failure incident is the system wrongly suggesting that users must have been hacked when the glitch first emerged, indicating a misdiagnosis of the issue [40412].

IoT System Layer

Layer Option Rationale
Perception sensor, embedded_software (a) sensor: The software failure incident related to the Hive smart heating app was primarily due to a glitch in the sensor or thermostat readings. Users reported that the temperature in their homes soared to 32 degrees Celsius without their instruction, indicating a sensor error [40412]. (b) actuator: There is no specific mention of the software failure incident being related to an actuator error in the articles. (c) processing_unit: The software failure incident was not directly attributed to a processing unit error but rather to a glitch in the software that controlled the heating system [40412]. (d) network_communication: The software failure incident did not stem from network communication errors but rather from a software glitch within the Hive app [40412]. (e) embedded_software: The failure was related to embedded software error within the Hive smart heating app, as users experienced issues with the app controlling the boilers and thermostat, causing temperatures to rise unexpectedly [40412].
Communication unknown The software failure incident reported in the articles does not specifically mention whether the failure was related to the communication layer of the cyber physical system that failed. The focus of the incident was on a software glitch in the Hive iOS app that caused the thermostat temperature to rise to 32 degrees for some users, leading to complaints and concerns about potential risks such as increased energy costs and fire hazards. The articles do not provide details indicating whether the failure was specifically at the link level (wired or wireless physical layer) or the connectivity level (network or transport layer) of the cyber physical system.
Application TRUE The software failure incident related to the Hive smart heating app, as reported in Article 40412, was indeed related to the application layer of the cyber physical system. The incident was caused by a software glitch in the Hive iOS app that led to the thermostat temperature rising to 32 degrees without user instruction. This glitch was described as a sequence of commands in the app causing the issue, indicating a problem within the application itself [40412].

Other Details

Category Option Rationale
Consequence property, non-human, theoretical_consequence, other (a) death: There is no mention of any deaths resulting from the software failure incident in the articles. [40412] (b) harm: There is no direct mention of physical harm to individuals due to the software failure incident. However, there are concerns raised about the potential fire risk and dangers of overheating caused by the glitch. [40412] (c) basic: There is no indication that people's access to food or shelter was impacted by the software failure incident. [40412] (d) property: The software failure incident impacted people's material goods and comfort as their heating systems were turned up to 32 degrees without their instruction, potentially leading to increased energy costs and concerns about fire risks. Additionally, there was a previous incident where user data was being sent without encryption, raising concerns about the security of personal information. [40412] (e) delay: There is no mention of any activities being postponed due to the software failure incident. [40412] (f) non-human: Non-human entities, such as the heating systems controlled by the software, were directly impacted by the glitch as temperatures soared to 32 degrees without user input. [40412] (g) no_consequence: The software failure incident did have observed consequences, such as users experiencing overheating issues and concerns about potential risks and increased energy costs. [40412] (h) theoretical_consequence: There were potential consequences discussed, such as the risk of fire due to the elevated temperatures caused by the software glitch, but there is no mention of any actual fires occurring as a result. [40412] (i) other: The software failure incident led to users losing control of their heating systems, experiencing discomfort, potential safety hazards, and concerns about data security, which are additional consequences not explicitly covered in the options provided. [40412]
Domain utilities (a) The software failure incident reported in the articles is related to the industry of utilities, specifically in the context of power and heating control systems for households. The incident involved the Hive smart heating app owned by British Gas, which is used by customers to control their boilers and heating systems [40412].

Sources

Back to List