Incident: NHS App Outage Impacts Travelers' Vaccination Status Verification System

Published Date: 2021-10-13

Postmortem Analysis
Timeline 1. The software failure incident of the NHS app for England happened on Wednesday, as mentioned in the article [Article 119723]. 2. The article was published on 2021-10-13. 3. Estimation: The incident occurred on Wednesday before the publication date of the article, which would be around October 13, 2021.
System 1. NHS app for England [119723]
Responsible Organization 1. The NHS England app [119723]
Impacted Organization 1. Users of the NHS app for England [119723] 2. People traveling across Europe, particularly Britons [119723]
Software Causes 1. The software cause of the failure incident was a malfunction in the NHS app for England, which led to users being unable to prove their Covid vaccination status at airport check-ins and other venues [119723].
Non-software Causes 1. Overreliance on a single centralized system for critical functions like proving Covid vaccination status [119723]. 2. Misunderstanding of rules and lack of awareness among users regarding vaccination proof requirements [119723]. 3. Issues with mandatory vaccine checks implementation in different regions leading to confusion and technical difficulties [119723].
Impacts 1. Users were unable to prove their Covid vaccination status at airport check-ins, leading to many being unable to board flights and others being turned away from venues that require evidence of vaccination [119723]. 2. Mandatory vaccine checks at venues in Wales and Scotland led to issues with users unable to prove their vaccination status, causing arguments and disruptions at nightclubs [119723]. 3. The outage of the NHS app in England highlighted the reliance on a single centralized system for essential services, impacting international travel and access to venues requiring vaccination proof [119723]. 4. People traveling across Europe faced challenges as countries demanded evidence of double-vaccination to enter various establishments, with Britons facing difficulties due to the incompatibility of UK vaccination certificates with EU-wide schemes [119723].
Preventions 1. Implementing redundancy and failover systems to ensure continuous operation even if the main system fails [119723]. 2. Conducting thorough testing and quality assurance processes before deploying updates or changes to the software system [119723]. 3. Providing clear and accessible guidance to users on alternative methods to access and store their vaccination certificates, such as saving them to Apple Wallet or carrying paper copies [119723]. 4. Collaborating with other countries and international organizations to ensure interoperability of digital vaccination certificates across different systems and regions [119723].
Fixes 1. Implementing a decentralized system rather than relying on a single centralized system could help prevent similar incidents in the future [119723]. 2. Providing clear guidance and support for users on alternative methods of accessing and storing vaccination certificates, such as saving them to Apple Wallet or carrying paper copies, could mitigate the impact of technical issues [119723]. 3. Ensuring seamless integration of the British data into the EU Digital Covid Certificate scheme to facilitate smoother verification of vaccination status across different countries [119723].
References 1. Users affected by the NHS app outage in England, Wales, and Scotland [Article 119723] 2. Door staff at nightclubs in Wales [Article 119723] 3. First Minister Nicola Sturgeon of Scotland [Article 119723] 4. Boris Johnson and his decision on vaccine passports in England [Article 119723] 5. Attendees at the Labour party's annual conference in Brighton [Article 119723] 6. Apple iPhone users and their ability to save NHS England vaccination certificates to Apple Wallet [Article 119723] 7. Britons traveling across Europe and facing challenges with proof-of-vaccination QR codes [Article 119723] 8. Talks about integrating British data into the EU Digital Covid Certificate scheme [Article 119723]

Software Taxonomy of Faults

Category Option Rationale
Recurring multiple_organization (a) The software failure incident related to the NHS app for England experiencing an outage causing users to be unable to prove their Covid vaccination status at airport check-ins [119723]. This incident highlighted the problems that can arise from relying on a single centralized system for critical services. The outage affected users' ability to travel internationally and access venues that required proof of vaccination. (b) The article also mentions that Scotland, which developed its own standalone app for mandatory vaccine checks, faced issues with technology as well. The first minister of Scotland apologized for an "initial backlog" of users unable to access their health records, indicating a similar software failure incident in a different organization [119723].
Phase (Design/Operation) design, operation (a) The software failure incident related to the design phase can be seen in the article where it mentions the outage of the NHS app for England, which left users unable to prove their Covid vaccination status at airport check-ins and other venues. This outage was a result of a single centralised system at the heart of modern life, which caused significant disruptions in international travel and mandatory checks at venues [119723]. (b) The software failure incident related to the operation phase is evident in the article where it discusses issues faced by users in Wales and Scotland trying to prove their vaccination status at nightclubs and other venues. Users reported running arguments with door staff, claiming to have run out of battery on their phones, struggling to download vaccination proof, or facing technical difficulties accessing their health records. These operational challenges led to disruptions in accessing services and venues that required proof of vaccination [119723].
Boundary (Internal/External) within_system, outside_system (a) The software failure incident related to the NHS app outage for England was primarily within the system. The article mentions that the outage of the NHS app left frustrated users unable to prove their Covid vaccination status at airport check-ins and other venues [Article 119723]. This indicates that the failure originated from within the system itself, affecting the functionality of the app and causing disruptions for users trying to access their vaccination status. (b) Additionally, the article highlights the impact of the outage on international travel and the challenges faced by people traveling across Europe due to the failure of the NHS app [Article 119723]. This external factor, such as the requirement for proof of vaccination in various countries and the lack of integration with the EU-wide vaccine passport scheme, also contributed to the software failure incident.
Nature (Human/Non-human) non-human_actions (a) The software failure incident occurring due to non-human actions: The software failure incident with the NHS app for England was primarily due to a technical issue or malfunction in the centralised system. The outage of the app left users unable to prove their Covid vaccination status, impacting their ability to board flights and access venues that required vaccination proof. This outage was not directly caused by human actions but rather by a system failure within the app itself [119723]. (b) The software failure incident occurring due to human actions: There is no specific mention in the article of the software failure incident being directly caused by human actions. The focus is more on the technical issues and system failures that led to the outage of the NHS app for England [119723].
Dimension (Hardware/Software) hardware, software (a) The software failure incident related to hardware: - The article mentions that many Apple iPhone users were unaware that they could save a copy of their NHS England vaccination certificate to their Apple Wallet to ensure it is always available offline and avoid the risk of the service crashing [119723]. (b) The software failure incident related to software: - The outage of the NHS app for England left frustrated users unable to prove their Covid vaccination status at airport check-ins, leading to many being unable to board flights and others being turned away from venues that require evidence of vaccination [119723]. - Issues with technology were also reported in Scotland, where the first minister apologized for an "initial backlog" of users unable to access their health records [119723].
Objective (Malicious/Non-malicious) non-malicious (a) The software failure incident related to the NHS app outage for England was non-malicious. The outage was not caused by malicious intent but rather by technical issues within the centralized system. Users were frustrated and unable to prove their Covid vaccination status at airport check-ins and other venues, leading to disruptions in travel and access to certain establishments [119723].
Intent (Poor/Accidental Decisions) poor_decisions (a) The software failure incident related to the NHS app outage for England can be attributed to poor decisions. The article mentions that the outage highlighted the problems that can arise from relying on a single centralized system for essential services like proving Covid vaccination status. It points out that in an era where people expect instant functionality from online accounts, a single government-run app going offline can effectively disrupt international travel for a significant portion of the population [119723]. This indicates that the decision to centralize the vaccination status verification system in a single app led to significant consequences when the app experienced technical issues.
Capability (Incompetence/Accidental) development_incompetence, accidental (a) The software failure incident related to development incompetence is evident in the article as it mentions issues with the NHS app for England, which left frustrated users unable to prove their Covid vaccination status at airport check-ins and other venues [119723]. The outage of the app caused significant disruptions, highlighting the problems that can arise when relying on a single centralized system for critical functions. Additionally, the article discusses technical issues faced by users in Wales and Scotland due to mandatory checks for vaccination status, indicating potential shortcomings in the development and implementation of the technology. (b) The software failure incident related to accidental factors is also apparent in the article, particularly when it mentions the outage of the NHS England app being restored after around four hours [119723]. This restoration implies that the initial failure was not intentional but rather an accidental disruption in the service. Furthermore, the article discusses issues faced by users in Scotland with accessing their health records, which could be attributed to accidental technical glitches rather than deliberate actions.
Duration temporary (a) The software failure incident related to the NHS app for England was temporary. The article mentions that the app outage lasted for around four hours before it was restored, causing frustration among users who were unable to prove their Covid vaccination status during that time [119723].
Behaviour crash, omission, other (a) crash: The software failure incident in the NHS app for England resulted in a crash, where frustrated users were unable to prove their Covid vaccination status at airport check-ins, leading to them being unable to board flights and being turned away from venues [119723]. (b) omission: The software failure incident also led to omission, as users were unable to access their health records due to technical issues, causing problems with proving vaccination status at venues and events [119723]. (c) timing: The software failure incident did not specifically mention timing issues where the system performed its intended functions too late or too early. (d) value: The software failure incident did not mention the system performing its intended functions incorrectly. (e) byzantine: The software failure incident did not describe the system behaving erroneously with inconsistent responses and interactions. (f) other: The software failure incident highlighted the problems that can arise from relying on a single centralized system for critical functions, impacting international travel and access to venues [119723].

IoT System Layer

Layer Option Rationale
Perception None None
Communication None None
Application None None

Other Details

Category Option Rationale
Consequence property, delay, theoretical_consequence (a) death: There is no mention of any deaths resulting from the software failure incident in the provided article [119723]. (b) harm: The article does not mention any physical harm caused to individuals due to the software failure incident [119723]. (c) basic: The software failure incident did not impact people's access to food or shelter [119723]. (d) property: People's material goods, money, or data were impacted due to the software failure incident. Specifically, individuals were unable to prove their Covid vaccination status at airport check-ins, leading to some being unable to board flights and others being turned away from venues that require evidence of vaccination [119723]. (e) delay: People had to postpone activities due to the software failure incident. The outage of the NHS app for England left frustrated users unable to prove their Covid vaccination status, causing delays in boarding flights and entry to venues [119723]. (f) non-human: There is no mention of non-human entities being impacted by the software failure incident in the provided article [119723]. (g) no_consequence: The software failure incident had real observed consequences, as users were affected in their ability to travel and access certain venues [119723]. (h) theoretical_consequence: The article discusses potential consequences of the software failure incident that did not occur, such as big events being left without customers if people were unable to prove their vaccination status, and the impact on Britons traveling in Europe due to Brexit and the lack of integration with the EU-wide vaccine passport scheme [119723]. (i) other: The article does not mention any other specific consequences of the software failure incident beyond those discussed in options (d) and (e) [119723].
Domain information, health (a) The software failure incident mentioned in the articles is related to the information industry, specifically the production and distribution of information. The failure of the NHS app for England caused frustration among users who were unable to prove their Covid vaccination status at airport check-ins, leading to difficulties in boarding flights and accessing venues that require vaccination evidence [Article 119723]. (j) Additionally, the software failure incident is also related to the health industry. The NHS app outage impacted individuals trying to access healthcare services and venues that required proof of vaccination for entry. The incident highlighted the reliance on digital systems in the healthcare sector and the challenges that arise when such systems fail [Article 119723].

Sources

Back to List