Clustered Mailbox Server will not come online on one node of cluster


In this case, we had an Exchange Server 2007 Continuous Cluster Replication (CCR) setup, and if we either failed on the first node, or attempted to move the Clustered Mailbox Server (CMS) to the other node, the Information Store resource and all of the resources for the Storage Groups would hang in an Online Pending state on the second node.

Taking a look at the cluster log showed the following when the Exchange Information Store resource was attempting to come online on the destination node:

00000bdc.00001050::2009/11/19-17:20:26.478 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] DwWaitServiceStateChangeToTargetState: service 'MSExchangeIS' current state is: running.
00000bdc.00001050::2009/11/19-17:20:26.478 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] DwStartService: service 'MSExchangeIS' was started.

…………………………………..

00000bdc.00001580::2009/11/19-18:48:53.859 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] Reporting state 129 (OnlinePending).
00000bdc.00001580::2009/11/19-18:50:23.858 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] Reporting state 129 (OnlinePending).
…………………………………..

00000bdc.00001580::2009/11/19-18:57:53.856 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] Reporting state 129 (OnlinePending).
00000888.00000898::2009/11/19-18:58:16.371 WARN [INIT] The cluster service is shutting down.
00000888.00000898::2009/11/19-18:58:16.371 WARN [FM] Shutdown: Failover Manager requested to shutdown groups.
00000bdc.00001050::2009/11/19-18:58:16.528 WARN Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] EcStoreStartServer() returned 1
00000bdc.00001050::2009/11/19-18:58:16.528 ERR  Microsoft Exchange Information Store <Exchange Information Store Instance (CMSNAME)>: [EXRES] EventLogging: Exchange Information Store Instance (TNS-MAIL-AP): The RPC call to the service to bring the cluster resource online failed.  Error Code: 1.

I searched through our vast repository of previous cases for Exchange and cluster and OnlinePending and resource online failed, and found a prior case with the same symptoms. In that case, and it turned out to be the same cause in ours, the McAfee Groupshield service was stuck in a ‘Starting’ state. When Exchange-aware Antivirus in enabled on a server, if the anitvirus does not start, it will block the Information Store from being available. In the case of our Exchange cluster, this in turn kept the Information Store resource from reaching an ‘Online’ state.

To confirm this was the problem, we set the following registry value to 0 (zero) on the node we were failing on:

HKEY_LOCAL_MACHINE\System\CurrentControlSet\Services\MSExchangeIS\VirusScan\Enabled

This tells the Exchange Information Store to disable virus scanning, effectively disabling Groupshield in this case. Once we did this, the Information Store resource and storage group resources were able to come online on this node.

The customer then worked with Groupshield to fix the problem that was causing Groupshield not to come online successfully.


Skip to main content