Exchange Root Cause Analysis


Over two years ago (in March 2002), Product Support Services (for Exchange) began detailed analysis on Exchange support issues.  The goal of this process was (and, has been) to improve customer satisfaction with Exchange Server by improving the supportability of the product and related content.


 


The analysis of each support incident is done by root cause.  Meaning, if SMTP relay configuration causes mail flow problems, the root cause is SMTP relay configuration.  Doing the analysis by root cause allows the Exchange development team to take better action to improve the product. 


 


Moreover, this process allows us to analyze issues where no root cause was determined.  Understanding issues with no root cause helps us to identify opportunities for better logging and diagnostics capabilities.


 


Exchange 2003


During the development of Exchange 2003, PSS met regularly with the Exchange development team to discuss the top root causes of issues for Exchange 2000.  From those meetings, our conversations mainly focused on areas, like: Disaster Recovery, Mail flow problems and Deployment problems.  There were many other issues discussed, but too many to list here.


 


Understanding our top support issues is fairly easy.  Getting these issues turned into workitems for Exchange 2003, and prioritized accordingly, was another thing.  The good thing about our root cause analysis is that we have quantitative data showing how much our top support issues are costing Microsoft (and our partners) to support.  This helped solidify priority.


 


By the time Exchange 2003 released, we got 22 of the top 35 issues either fixed or partially fixed in the product (or, in Exchange 2000 SP3).  Some of those features or improvements to the product came in the form of:


 


·         Recovery Storage Group


o        KB 824126 - How to Use Recovery Storage Groups in Exchange Server 2003


o        Using Exchange Server 2003 Recovery Storage Groups


·         Internet Connection Wizard


o        Exchange Server 2003 Transport and Routing Guide


·         Deployment Tool


o        Exchange Deployment Tools


o        KB 812593 - Exchange Server 2003 Deployment Tools Overview


o        KB 822942 – Considerations When You Upgrade to Exchange Server 2003


o        Exchange 2003 Deployment Guide


·         Improved Logging/Diagnostics


o        Improved Transport logging


o        Improved Store Startup logging


·         Miscellaneous Improvements


o        KB 828070 - Exchange Server Mailbox Store Does Not Mount When the Mailbox Store Database Reaches the 16-Gig Limit


o        Most Common Exchange Tools Web Release


o        Zombie User Improvements


§         KB 814075 - XADM: Information Store Stops Responding After You Upgrade the Access Control List


§         KB 812963 - Using the Ignore Zombie Users Registry Key


o        Improved Content for Events Linked to the Web


§         KB 830183 - Overview of the Web Documentation That Is Linked to Exchange Server


 


Future Root Cause Analysis


We continue to do Root Cause Analysis on Exchange 2000 Server issues, and have added: Root Cause Analysis for Exchange Server 2003 and all Exchange high severity support incidents.  From this breadth of information, we can continue to build strong stories for supportability improvements in future versions of Exchange Server.


 


At the same time, we are collaborating with other product groups doing similar Root Cause Analysis (i.e. Windows, Outlook, etc…).  From this data, we can collectively identify cross-product areas for improvement.


 


- Jim Lucey


 

Comments (1)
  1. Shredder says:

    no wonder things keep getting better … :)

Comments are closed.

Skip to main content