Incident: Microsoft Windows Phone 7 Update Crashes Samsung Handsets, Causes Outages

Published Date: 2011-02-23

Postmortem Analysis
Timeline 1. The software failure incident of Microsoft withdrawing the first Windows Phone 7 software update after it crashed handsets happened in February 2011 as per the article published on February 23, 2011 [4124].
System 1. Windows Phone 7 system 2. Samsung phones, specifically the Omnia 7 model [4124]
Responsible Organization 1. Microsoft [4124]
Impacted Organization 1. Owners of Samsung phones [4124]
Software Causes 1. The software causes of the failure incident were related to a technical issue with the Windows Phone update process that impacted a small number of phones, specifically Samsung phones [4124].
Non-software Causes 1. Hardware issues with Samsung phones, as the problems were confined to users of Samsung handsets [4124]. 2. Manual restart attempts by users after the update did not go through properly [4124].
Impacts 1. Handsets crashed and turned off or failed to start up after downloading the Windows Phone 7 update, particularly affecting Samsung phones [4124]. 2. Users experienced bricked phones, rendering them unusable even after hard reset attempts [4124]. 3. The incident led to a firestorm of criticism on blogs and forums, with users expressing frustration over their expensive but useless phones [4124]. 4. Microsoft had to withdraw the first Windows Phone 7 update temporarily, causing inconvenience to users and portraying the company in a bad light [4124].
Preventions 1. Thorough testing and quality assurance procedures before releasing the software update could have potentially prevented the incident [4124]. 2. Implementing a more robust and reliable update mechanism to ensure that updates are installed correctly without causing issues on the devices [4124]. 3. Providing clear and detailed instructions to users on how to safely install updates and what to do in case of any issues during the update process [4124].
Fixes 1. Microsoft issuing a new update to correct the issue [4124]
References 1. Microsoft's Answers site [4124] 2. Twitter posts [4124] 3. Microsoft's support forums [4124] 4. Official Windows Phone Support Twitter account [4124]

Software Taxonomy of Faults

Category Option Rationale
Recurring one_organization (a) The software failure incident having happened again at one_organization: The article mentions that Microsoft has had a history of glitches with its products, especially its virus-ridden browser Internet Explorer. This incident with the Windows Phone 7 update adds to the list of problems faced by Microsoft products [4124]. (b) The software failure incident having happened again at multiple_organization: The article does not provide specific information about similar incidents happening at other organizations or with their products and services. Therefore, it is unknown if this particular software failure incident has occurred at multiple organizations.
Phase (Design/Operation) design, operation (a) The software failure incident in Article 4124 occurred during the development phase, specifically related to system updates. Microsoft had to withdraw the first update for its Windows Phone 7 system after it crashed handsets. Users experienced problems when their update did not go through properly, and then they manually tried to restart the phone, leading to the phones becoming unusable [4124]. (b) The software failure incident in Article 4124 also involved operation-related factors. Users found their mobiles turned off or failed to start up at all after they had downloaded the update. This indicates that the failure was also influenced by the operation or misuse of the system by the users [4124].
Boundary (Internal/External) within_system (a) within_system: The software failure incident with the Windows Phone 7 update crashing handsets was primarily within the system. The article mentions that users experienced problems if their update did not go through properly and then they manually tried to restart the phone, indicating an issue within the software update process itself [4124]. Additionally, Microsoft acknowledged a technical issue with the Windows Phone update process that impacted a small number of phones, further pointing to an internal system issue [4124].
Nature (Human/Non-human) non-human_actions, human_actions (a) The software failure incident in Article 4124 was primarily due to non-human actions. The Windows Phone 7 software update crashed handsets, causing them to turn off or fail to start up after users downloaded the program. This issue affected Samsung phones specifically, leading to complaints of phones being 'bricked' or rendered unusable [4124]. (b) Human actions also played a role in exacerbating the software failure incident. Users who manually tried to restart their phones after the update did not go through properly experienced problems, contributing to the overall issue. Additionally, the delay in issuing a new update and the lack of basic functionalities in Windows phones, as pointed out by commentators, could be attributed to human decisions and actions within Microsoft [4124].
Dimension (Hardware/Software) hardware, software (a) The software failure incident occurring due to hardware: - The article reports that the first update for Windows Phone 7 caused crashes on Samsung handsets, leading to phones being turned off or failing to start up [4124]. - Users of Samsung phones experienced problems with the update, such as their phones becoming 'bricked' or unusable, even after hard reset [4124]. (b) The software failure incident occurring due to software: - The article mentions that Microsoft identified a technical issue with the Windows Phone update process that impacted a small number of phones, leading to the temporary withdrawal of the update for Samsung phones [4124]. - Users experienced problems if their update did not go through properly and then manually tried to restart the phone, indicating a software-related issue in the update process [4124].
Objective (Malicious/Non-malicious) non-malicious (a) The software failure incident reported in Article 4124 was non-malicious. The failure occurred as a result of a technical issue with the Windows Phone update process, impacting a small number of phones. Microsoft identified this issue and temporarily took down the latest software update for Samsung phones to correct the problem and redistribute the update [4124].
Intent (Poor/Accidental Decisions) poor_decisions (a) The software failure incident related to the withdrawal of the first Windows Phone 7 update by Microsoft was primarily due to poor_decisions. Microsoft had to pull the update after it crashed handsets, leading to phones being turned off or failing to start up after the program was downloaded. Users of Samsung phones were particularly affected by this issue, with many experiencing their phones becoming 'bricked' or unusable. The incident sparked criticism and complaints from users, highlighting the negative impact of the decision to release the faulty update [4124]. Additionally, commentators mentioned that Microsoft's entry into the smartphone market with Windows phones was perceived as too late, lacking basic functionalities users expected, which could be seen as a poor decision in terms of market timing and product features.
Capability (Incompetence/Accidental) development_incompetence, accidental (a) The software failure incident related to development incompetence is evident in the article as Microsoft had to withdraw the first update for its Windows Phone 7 system after it crashed handsets. Users of Samsung phones experienced issues where their mobiles turned off or failed to start up after downloading the program. This incident reflects a lack of professional competence in the development process, leading to a faulty update being released to users [4124]. (b) The accidental nature of the software failure incident is highlighted by the fact that users experienced problems if their update did not go through properly and then they manually tried to restart the phone. This accidental action exacerbated the issues faced by users, indicating that the failure was not intentional but rather a result of unintended actions taken by users after encountering initial problems with the update [4124].
Duration temporary (a) The software failure incident in this case was temporary. Microsoft had to withdraw the first Windows Phone 7 software update after it crashed handsets, specifically Samsung phones. Users found their mobiles turned off or failed to start up after downloading the program. Microsoft acknowledged the issue and stated that they would be issuing a new update to correct the problem [4124]. (b) The software failure incident was temporary as it impacted a specific set of circumstances, in this case, users of Samsung phones who downloaded the update. The issue was not a permanent failure affecting all devices but rather a specific problem related to the update process on Samsung handsets [4124].
Behaviour crash (a) crash: The software failure incident in Article 4124 can be categorized as a crash. The Windows Phone 7 update caused handsets, particularly Samsung phones, to crash, leading to the devices being turned off or failing to start up at all after the update was downloaded. Users experienced their phones becoming 'bricked,' rendering them unusable [4124].

IoT System Layer

Layer Option Rationale
Perception None None
Communication None None
Application None None

Other Details

Category Option Rationale
Consequence property, delay (a) death: People lost their lives due to the software failure (b) harm: People were physically harmed due to the software failure (c) basic: People's access to food or shelter was impacted because of the software failure (d) property: People's material goods, money, or data was impacted due to the software failure (e) delay: People had to postpone an activity due to the software failure (f) non-human: Non-human entities were impacted due to the software failure (g) no_consequence: There were no real observed consequences of the software failure (h) theoretical_consequence: There were potential consequences discussed of the software failure that did not occur (i) other: Was there consequence(s) of the software failure not described in the (a to h) options? What is the other consequence(s)? The consequence of the software failure incident: The incident involving the Windows Phone 7 software update withdrawal did not result in any reported deaths, physical harm, impact on basic needs, or harm to non-human entities. Users experienced inconvenience, frustration, and financial loss due to their phones becoming unusable or "bricked" after the update, leading to delays in communication and work [4124].
Domain information (a) The failed system in this incident was related to the information industry, specifically the smartphone market where Microsoft's Windows Phone 7 system update caused crashes on Samsung handsets, leading to phones being 'bricked' and rendered unusable [4124]. The incident highlighted Microsoft's struggle to compete in the smartphone market against rivals like Apple's iPhone and Google's Android, emphasizing the importance of software reliability in the information industry.

Sources

Back to List