Cluster – Win2008R2 cluster not functioning even after several reboot attempts

 

 

SYMPTOM
=========================
Excessive 1146 events are logged on cluster nodes

Event ID: 1146
Source: Microsoft-Windows-FailoverClustering
Description: The cluster resource host subsystem (RHS) stopped unexpectedly. An attempt will be made to restart it. This is usually due to a problem in a resource DLL. Please determine which resource DLL is causing the issue and report the problem to the resource vendor.

RESOLUTION
==========================
Please call Microsoft CSS to obtain this hot fix "The Resource Hosting Subsystem (Rhs.exe) process stops unexpectedly when you start a cluster resource in Windows Server 2008 R2"
https://support.microsoft.com/kb/978527/en-us

Troubleshooting methods
==========================
Please configure Windows Error Reporting so that we can capture some data when problem recurs
Configuring the "Windows Error Reporting" to generate crash dump when RHS.exe crashes, we need to manually create following register values:

Note: You may need to backup the key HKEY_LOCAL_MACHINESOFTWAREMicrosoftWindowsWindows Error Reporting or the whole registry key before this change.

Key Name: HKEY_LOCAL_MACHINESOFTWAREMicrosoftWindowsWindows Error ReportingLocalDumpsrhs.exe

  Name: DumpFolder
  Type: REG_EXPAND_SZ
  Data: C:rhs_dumps

  Name: DumpType
  Type: REG_DWORD
  Data: 0x2

  Name: DumpCount
  Type: REG_DWORD
  Data: 0xa

When problem happens again, the dump file will be generated in the directory C:rhs_dumps.
When the issue raises again, please collect the dump file and collect a new MPS report pack and send to me for further analysis.