Incident: USA Visa Website Crash Causing Flight Disruptions and Financial Loss

Published Date: 2019-11-26

Postmortem Analysis
Timeline 1. The software failure incident happened in November 2019. [Article 91793]
System The system that failed in the software failure incident reported in Article 91793 is: 1. USA's ESTA website - The Electronic System for Travel Authorization website crashed after officials decided to 'modernise' the website, leading to travellers being unable to process their applications [91793].
Responsible Organization 1. U.S. Customs and Border Protection officials were responsible for causing the software failure incident by introducing new features to the Electronic System for Travel Authorization website, leading to the crash [91793].
Impacted Organization 1. British passengers traveling to the United States were impacted by the software failure incident [Article 91793].
Software Causes 1. The USA visa website crashed due to a system update intended to 'modernize' the website, causing IT problems and preventing travellers from processing their ESTA applications [Article 91793].
Non-software Causes 1. Increased demand due to the upcoming Thanksgiving holiday, leading to a surge in ESTA applications [91793] 2. Changes in visa regulations and requirements catching travelers off guard [91793]
Impacts 1. British passengers were stopped from boarding flights to the United States, leading to missed flights and travel disruptions for individuals like Beverley Coult and Rebecca Downes [Article 91793]. 2. Some travelers incurred financial losses due to non-refundable flights and accommodation costs, such as the Downes couple who wasted £1,500 on their trip to New York [Article 91793]. 3. Emotional distress was caused to travelers who had planned important trips, like Jess Lipson who had to cancel her visit to a friend in Los Angeles for Thanksgiving [Article 91793]. 4. The software failure incident led to frustration and anxiety among travelers, as seen on social media platforms where individuals expressed concerns about missing flights and the reliability of third-party ESTA services [Article 91793]. 5. The incident affected travelers globally, not just from the UK, as there are 36 other countries signed up to America's ESTA scheme who may have faced similar issues [Article 91793].
Preventions 1. Thorough Testing: Conducting comprehensive testing, including load testing and user acceptance testing, could have helped identify any issues with the system update before it was rolled out [91793]. 2. Incremental Updates: Implementing smaller, incremental updates instead of a major overhaul could have minimized the risk of a widespread failure like the crash of the USA visa website [91793]. 3. Effective Communication: Ensuring clear communication with users about the system update, potential downtime, and alternative application methods could have helped manage expectations and prevent travelers from missing their flights [91793]. 4. Backup Systems: Having backup systems or contingency plans in place to handle unexpected failures or downtime could have mitigated the impact of the software failure incident [91793].
Fixes 1. Implement thorough testing procedures before deploying system updates to ensure the stability and functionality of the website [91793]. 2. Provide better communication channels for users experiencing issues, such as a dedicated helpline or live chat support [91793]. 3. Enhance the website's error handling mechanisms to provide more informative error messages and guidance for users encountering problems [91793]. 4. Consider staggering the implementation of updates to minimize the impact on users and allow for timely issue resolution [91793]. 5. Ensure that critical functionalities like the issuance of verification codes are thoroughly tested and monitored to prevent disruptions in user experience [91793].
References 1. Social media posts from affected travelers [Article 91793] 2. Interviews with impacted individuals such as Beverley Coult, Rebecca Downes, Will Hall, Jess Lipson [Article 91793] 3. Statements from US Customs and Border Protection spokesperson [Article 91793]

Software Taxonomy of Faults

Category Option Rationale
Recurring one_organization, multiple_organization (a) The software failure incident related to the USA visa website crashing due to a system update has happened before within the same organization. The incident occurred after officials decided to 'modernize' the website, causing travelers to be unable to process their applications [91793]. (b) The software failure incident related to the USA visa website crashing due to a system update has also affected multiple organizations or individuals. For example, Rebecca Downes and her husband were turned away at the airport due to their ESTA not coming through in time, resulting in wasted money on non-refundable flights and accommodation [91793]. Additionally, other travelers like Will Hall and Jess Lipson also faced issues with their flights and had to pay extra fees due to the website crash [91793].
Phase (Design/Operation) design, operation (a) The software failure incident in the article was primarily due to the system update that was intended to modernize the USA visa website. The update was aimed at allowing travelers to save their applications halfway through and to include Poland in the scheme. However, after the update, the website crashed, leaving travelers unable to process their ESTA applications, resulting in missed flights and significant inconvenience for many passengers [91793]. (b) The software failure incident also had operational implications as travelers were unable to complete their ESTA applications due to the crash of the website. This operational failure led to individuals missing their flights, incurring financial losses, and experiencing emotional distress. The inability to operate the system effectively, especially during a critical time when travelers needed to process their applications, resulted in significant disruptions for the affected passengers [91793].
Boundary (Internal/External) within_system (a) within_system: The software failure incident related to the USA visa website crashing was primarily due to an internal factor, specifically a system update that was intended to modernize the website [91793]. The update introduced new features but led to issues where some ESTA applicants were unable to receive the necessary verification code, causing disruptions in the application process. The failure originated from within the system as a result of the update, impacting travelers' ability to process their applications and leading to missed flights for some individuals.
Nature (Human/Non-human) non-human_actions, human_actions (a) The software failure incident occurred due to non-human actions, specifically a system update that led to the crash of the USA visa website. The update was intended to modernize the website, allowing travelers to save their applications halfway through and enabling Poles to apply after Poland was included in the scheme. The crash resulted in travelers being unable to process their applications, leading to missed flights and significant inconvenience for many individuals [91793]. (b) The software failure incident also involved human actions as officials decided to implement the system update that ultimately led to the crash of the USA visa website. The decision to 'modernize' the website without proper testing or contingency planning resulted in travelers facing issues with their ESTA applications, causing missed flights and financial losses for individuals like Beverley Coult, Rebecca Downes, and others [91793].
Dimension (Hardware/Software) software (a) The software failure incident occurring due to hardware: - The incident reported in the articles does not mention any hardware-related issues contributing to the software failure. It primarily focuses on the crash of the USA visa website after a system update, which was attributed to software issues like the inability to process applications, error messages, and website maintenance [Article 91793]. (b) The software failure incident occurring due to software: - The software failure incident, in this case, was primarily due to contributing factors originating in software. The crash of the USA visa website was a result of officials deciding to 'modernize' the website, which led to issues such as the inability to process applications, error messages, and ongoing website maintenance causing travelers to miss flights [Article 91793].
Objective (Malicious/Non-malicious) non-malicious (a) The software failure incident in this case was non-malicious. The failure occurred due to a system update intended to modernize the USA visa website, which led to the website crashing and causing issues for travelers trying to process their ESTA applications [91793]. The update was aimed at improving user experience by allowing travelers to save their applications halfway through and enabling Poles to apply after Poland was included in the scheme. However, the unintended consequence of the update was the website crashing, preventing travelers from completing their applications on time. The failure was not caused by any malicious intent but rather by the impact of the system update on the functionality of the website.
Intent (Poor/Accidental Decisions) poor_decisions (a) The intent of the software failure incident was due to poor decisions made by officials to 'modernise' the USA visa website. The update was intended to allow travellers to save their applications halfway through and to allow Poles to apply after Poland was included in the scheme. However, this update led to the website crashing, leaving travellers unable to process their applications, resulting in missed flights and significant inconvenience for many travelers [91793].
Capability (Incompetence/Accidental) development_incompetence, accidental (a) The software failure incident occurred due to development incompetence. The USA visa website crashed after officials decided to 'modernise' the website, which led to IT problems and travelers being unable to process their applications [Article 91793]. The update was intended to allow travelers to save their applications halfway through and to allow Poles to apply after Poland was included in the scheme. However, the update resulted in crashes and errors on the website, causing significant inconvenience to travelers, some of whom missed their flights as a result. (b) The software failure incident can also be attributed to accidental factors. The update introduced on the USA visa website was intended to improve user experience by allowing travelers to save their applications halfway through and to accommodate applicants from newly included countries like Poland [Article 91793]. However, the unintended consequence of this update was the crashing of the website, leading to errors and preventing travelers from processing their applications in a timely manner. This accidental outcome resulted in missed flights, financial losses, and emotional distress for affected individuals.
Duration temporary (a) The software failure incident in this case was temporary. The USA visa website crashed after a system update intended to modernize the website, causing travelers to be unable to process their ESTA applications. The issue arose after the update, and travelers were met with error messages and were unable to complete their applications in time, leading to missed flights and disruptions in travel plans. The incident was eventually addressed by US Customs and Border Protection, restoring functionality to the ESTA website [Article 91793].
Behaviour crash, omission, timing, other (a) crash: The software failure incident in this case was a crash. The USA visa website crashed after officials decided to 'modernise' the website, leaving travellers unable to process their applications. Passengers were stopped from boarding flights to the United States due to the crash [Article 91793]. (b) omission: The software failure incident also involved omission. Passengers missed flights because the ESTA didn't come through in time, leading to wasted money on non-refundable flights and accommodation. The failure to process the ESTA applications in a timely manner resulted in omission of the system's intended function [Article 91793]. (c) timing: The timing of the software failure incident was also a significant factor. Passengers were unable to get their ESTA in time, causing them to miss their flights. The system was performing its intended functions but at the wrong time, leading to missed opportunities for travellers [Article 91793]. (d) value: There is no specific mention of the software failure incident being related to the system performing its intended functions incorrectly (value). (e) byzantine: The software failure incident did not exhibit behavior related to the system behaving erroneously with inconsistent responses and interactions (byzantine). (f) other: The other behavior observed in this software failure incident was frustration and inconvenience caused to travellers due to the crash. Passengers faced challenges in renewing their ESTA, missed flights, wasted money on non-refundable bookings, and had to rebook their travel plans, leading to emotional distress and financial losses [Article 91793].

IoT System Layer

Layer Option Rationale
Perception None None
Communication None None
Application None None

Other Details

Category Option Rationale
Consequence property, delay Delay: The software failure incident caused significant delays and disruptions for travelers trying to apply for ESTA permits to visit the United States. Passengers missed flights, had to rebook flights for later dates, and incurred additional costs for changing travel plans due to the crash of the USA visa website after a system update [Article 91793].
Domain transportation, finance, government (a) The failed system was related to the transportation industry as it affected British passengers trying to board flights to the United States due to the crash of the USA visa website [91793]. (h) The incident also impacted the finance industry indirectly as travelers like Beverley Coult had to pay £150 for changing their flights due to the system failure [91793]. (l) Additionally, the government sector was involved as the USA's ESTA website, which crashed, is a part of the U.S. Customs and Border Protection's Electronic System for Travel Authorization [91793].

Sources

Back to List