Application Management – Clustering SQL Guest Instances in R2 – Part 2

In a previous post, I described how one might use Service Templates to leverage new features in System Center Virtual Machine Manager 2012 R2 to build a guest cluster with a Failover Cluster Instance installation of SQL Server.

In this post, I’ll be providing a downloadable sample that you can tailor to your environment and discussing resource preparation, template import, configuration, and deployment. With the information already provided in my previous post, you should have a clear understanding of deploying a SQL Server Failover Cluster Instance install in your environment using Service Templates. If you would like to use this template to deploy new SQL Server clusters as part of Bruno Saille’s SQL Server Self-Service Kit, some instructions are posted here.

The sample download is available here:

 Download


 

The process of deploying this template involves a few steps:

 


Media Preparation

The downloadable sample does not contain the media necessary for deployment. Before deploying, you’ll need two disk images to be present in you library.

Operating System Preparation

This template expects the Operating System of the VM to be Windows Server 2012 R2 Datacenter.

Note: The expected image is a full installation. If you instead choose to use a Server Core installation, check the SQL Server documentation for components that are not supported on Server Core. Additionally, you will need to edit the template to remove the Failover Clustering Management Tools (leaving only Failover Clustering and Failover Cluster Module for Windows Powershell). Additionally, you may need to pre-install the .NET 3.5 features on the disk image.

If you already have a disk image in your library that meets this requirement, you can use it. If not, you will need to create one. The basic process of creating such an image is reasonably straight-forward.

  1. On a Hyper-V host, create a new virtual machine.
  2. Start the virtual machine, and install the operating system.
  3. After the installation has completed, open an elevated shell.
  4. In the shell, run $env:windir\System32\Sysprep\sysprep.exe /mode:VM /oobe /generalize /shutdown
  5. Once the virtual machine has shutdown, you can copy the VHDX disk image into your VMM library and use it to deploy Service Templates.

SQL Server Preparation

The installation scripts included with the sample reference D:\Setup.exe. The expectation is to include a VHDX file containing the SQL Server installation media. First, create an empty VHDX image of approximately 10GB. This can easily be done on a machine containing the Hyper-V Module for Windows PowerShell:

001 002 003 004 005 006 007 008 009 $driveLetter = "V"; New-VHD -Path SQL.vhdx -SizeBytes 10GB -Dynamic; Mount-VHD SQL.vhdx; $disk = Get-Disk | ?{ $_.FriendlyName -eq "Microsoft Virtual Disk" }; Initialize-Disk -Number $disk.Number -PartitionStyle MBR; New-Partition $disk.Number -UseMaximumSize -DriveLetter $driveLetter; Format-Volume -DriveLetter $driveLetter -FileSystem NTFS -Confirm:$false; copy -Recurse C:\Setup\* V:; Dismount-VHD SQL.vhdx;

This is assuming the C:\Setup path contains SQL Server installation media (with Setup.exe), your V: drive is unused, and you don’t have any other disk images mounted. You may need to tailor this to your environment and may choose entirely different means to produce the necessary disk image (diskpart, diskmgmt.msc, etc). As long as the Setup.exe is at the base directory of the disk image, it should function as needed. After this, you can copy the SQL.vhdx disk image to your VMM library share.

 


Account Preparation

The scripts included in the sample will need to run as an account with particular credentials. Two accounts will be needed: a domain account and a local administrator account.

Local Administrator

The local administrator account is necessary for preparation of the Operating System and serves as a means of setting the local administrator password. You may choose to create this account from within the VMM UI, or from PowerShell. In PowerShell, you can create an account by running:

001 New-SCRunAsAccount -Credential $(Get-Credential) –Name "Local Admin";

After issuing this command, you’ll be prompted to enter credentials.

image

The username should just be ‘administrator’ and the password will be the password for the local administrator account on the VMs, which has not been set at deployment time. You can also create the credential object from the shell:

001 002 003 004 $user = 'administrator'; $pass = ConvertTo-SecureString 'password!!123' -AsPlainText -Force; $cred = New-Object System.Management.Automation.PSCredential($user,$pass); New-SCRunAsAccount -Credential $cred -Name "Local Admin";

Keep in mind that running this from the shell will keep password in plain text in the shell history.

Domain Admin

For simplicity, this sample uses a Domain Administrator account (a domain account that is a member of the Domain Admins Security Group in AD). If you wish to use another domain account with lesser privileges, you’ll need to add a script that gives local administrator permissions to the VMs (this is not included in the sample). You can follow the instructions above or create the Run As account from within the VMM UI.

image

In the Settings Pane, click the Create Run As Account button on the Home ribbon to open a dialog box to guide you through account creation.

image

Simply enter the required information into the dialog box to create the account. If this domain account belongs to a domain that is in an isolated VM Network to which your VMM server does not belong, you’ll want to uncheck the Validate domain credentials checkbox.

Once the accounts have been created, you can use them to deploy the sample.

image

 


Sample Preparation

The sample download is available here.

Extract the Sample

Once downloaded, extract the files to a location reachable by your VMM server. The sample contains the following:

Filename

Description

README.docx Instructions for using the sample content
Cluster\Cluster Demo.<TYPE>.xml Sample template for deploying a Failover Cluster Instance of SQL Server
Cluster\Cluster Demo.<TYPE>\1.vhdx Empty disk image used by cluster
Cluster\Cluster Demo.<TYPE>\2.vhdx Empty disk image used by cluster
Cluster\Cluster Demo.<TYPE>\GuestClusterDemo.cr Custom resource folder

The <TYPE> in the table above is either DHCP or Static IPs. You should choose a deployment method appropriate for your environment. Note that deploying a the static IP version of the template will require the appropriate static IP pools to be present for your VM Network. The provided sample is built with IPv4 and would require some changes to support IPv6.

Move the Content to the VMM Library Share

The disk images and custom resource folder should be copied to the VMM library share. Once the library has been refreshed, the resources will be available for deployment. The default refresh interval is hourly, but a refresh can be run manually by executing Get-LibraryShare | Refresh-LibraryShare on the VMM server.

The GuestClusterDemo.cr folder contains the following files:

Filename

Description

*.cmd Start the PowerShell counterparts
Cluster\first.ps1 Allocation of disks and creation of cluster
Cluster\rest.ps1 Addition of subsequent cluster nodes
SQL\first.ps1 Install SQL Server as a Failover Cluster Instance and add firewall openings
SQL\rest.ps1 Add node to SQL Server Failover Cluster Instance and add firewall openings
SQL\first\SQL.ini Base configuration for first SQL Server Failover Cluster Instance node
SQL\rest\SQL.ini Base configuration for additional SQL Server Failover Cluster Instance nodes

For more information about the context of these scripts, you can refer to my previous post.

Template Import

Once the resources are in place on the VMM Library Share, you’re ready to import the template. The simplest import mechanism is to use the VMM UI. This will show the mapping of resources in an simplistic format.

VMM will map any components that can be automatically discovered and ask you to specify the others. The following table depicts the necessary components:

Name

Resource Type

Description

Hyper-V Capability Profile Default Hyper-V Capability Profile
GuestClusterDemo.cr Custom Resource GuestClusterDemo.cr folder copied into Library
SQL.vhdx Disk Image SQL Server Installation Media (Not included)
Server2012R2.vhdx Disk Image Operating System disk, sysprep’d (Not included)
2.vhdx Disk Image Blank disk, cluster volume for SQL Server installation
1.vhdx Disk Image Blank disk, cluster witness
Domain Admin Run As Account Domain account with privilege to create cluster to AD
Local Admin Run As Account Local admin account (username ‘administrator’)

To import the template, click the Import Template button on the Home ribbon in the Library pane of VMM. This will open a wizard to guide you through the process. Click the Browse… button to locate the template file that fits your environment (either DHCP or Static IP). Ensure the Import sensitive template settings box is checked and click Next to continue.

 

image

On the next page scroll through the reference mappings and populate any missing data by clicking on the pencil next to any component that lists None as its mapping:

image

After populating all reference mappings, click Next, and then Import on the next page.

Note: In this post, I’m importing the Static IP template. There are a few additional fields in the Static IP template and I will specify where the DHCP template differs.

After import, right-click the template and select Properties. On the Service Settings page, you’ll see a list of service settings. You can double-click any setting to see its properties, as listed in the table below:

Name

Default Value

Description

Cluster Name DEMOCLUSTER DNS Name of Windows Failover Cluster Instance
*Cluster IP   IP address of Cluster Name
Domain   Domain VMs will join
VM Network   VM Network to attach VM’s NIC to
Product Key   Windows Server Product Key
Cluster Instance Name SQLDEMOCLUSTER DNS Name of SQL Server Failover Cluster Instance
SQL Instance Name DEMO SQL Server Instance ID
*SQL Cluster IP   IP address of Cluster Instance Name
*SQL Cluster CIDR   Subnet mask for SQL Cluster IP
SQL User Name   Domain account for SQL Server User
SQL User Password   Password for SQL User Name

* = Setting only exists in Static IP template

All settings are mandatory, and the SQL User Password setting is encrypted.

Feel free to open the Service Template Designer (right-click the template and choose Open Designer) to see its configuration:

 

image

Now, you’re ready to deploy!

 


Deployment

You can deploy the template by either clicking the Configure Deployment button in the Service Template Designer or right-clicking the template and selecting Configure Deployment from the menu:

image

This will bring up a window to supply some information:

image

Provide a name for the Service Instance and click the ellipses button to select a VM Network, and then VMM will prepare an instance for deployment. In the Deploy Service window, add any necessary values for service settings, and click Refresh Preview to let VMM run its placement algorithms.

image

When you’re ready, click the Deploy Service button. The deployment may take a significant amount of time, depending on your environment.

Results

After the template has been deployed, you can switch to the VMs and Services pane in VMM, select the appropriate host group, and select Services from the Home ribbon to view the deployed service:

image

You can open a console to the first VM and view its logs in C:\ScriptLogs. You’ll see two sets of files: one for cluster setup and one for SQL Server setup. The stdout will be captured in *-out.txt and the stderr will be in *-err.txt. The error output should be empty on a successful deployment.

By opening the Failover Cluster Manager, you should see your cluster with a role for the SQL Server Failover Cluster Instance:

image

Both of the deployed VMs should appear in the Nodes view:

image

You should also see two disks in under Storage –> Disks: one cluster witness and one for the SQL Server instance:

image

Going back to the Roles view, you can view the Resources tab at the bottom (after selecting the SQL Server role) and see its assigned resources:

image

If you feel the need to test failover, you might unceremoniously turn off the active SQL Server node from Hyper-V:

image

Looking at the Failover Cluster Manager on the VM that wasn’t turned off, you should see the Failover Cluster Instance has moved to the other node:

image

The cluster log will also show a critical error:

image

And the status of the service is visible within VMM:

image

I deployed my VMs to a 3-node cluster, so I might want to scale-out the service to include another VM:

image

Note that this sample does not include any scripts for scaling-in the service. If you wished to add such scripts, you could add two scripts: one that removes a node from the cluster and one that deletes the cluster. The former would attach to Deletion: VMs Before Last and the latter to Deletion: Last VM.

That’s all for this sample. Enjoy!

 


Troubleshooting Tips

There are a few issues you might hit during deployment, especially as you start to customize this sample. I’ve outlined a few below to assist you.

Settings

You’ll want to be certain of the feasibility of the default values. For instance, if a machine already exists in your environment with the name SQLDEMOCLUSTER, the deployment will fail. VMM does not provide control logic to validate service settings, so you’ll need to do this yourself.

IP Issues in Placement

If you are trying to deploy the Static IP template, you will need to have the proper static IP pools configured in your network. If the network to which you’re deploying does not contain sufficient IP addresses, you may see placement errors:

image

Ensure that the VM network has sufficient IP addresses (if you’re using IPv6 and IPv4, you’ll need a static IP pool for each). If you’re deploying to an isolated VM network, ensure that you have the static IP pools for both the logical network and the VM network (in a dual-stack scenario, this will mean four total static IP pools).

Failover Issues

It’s important to mention that SQL Server does not support the AddNode action in parallel. This means that if you want to have a four-node cluster, you will need to serialize your deployment. That is, first deploy the two-node cluster, and then scale-out the VMs one at a time. If all four VMs are deployed in a single tier, VMM will deploy the first VM, and then deploy the other three in parallel, which will deploy successfully, but may cause the Failover Cluster Instance to inexplicably be unable to failover to some nodes.

If you wish to extend this template to configure a SQL Server Availability Group on top of the SQL Server Failover Cluster Instance, you should be aware that a SQL Server Failover Cluster Instance will not allow automatic failover of the corresponding SQL Server Availability Group. This can be a great way to provide disaster recovery, but you should be aware that moving the Availability Group to a secondary replica in the event of failure will require manual intervention.

General Troubleshooting

If deployment fails, there are a few steps you can follow to find the root cause. Troubleshooting of placement issues or file copy issues is beyond the scope of this particular post, but I’ll try to lead you through the steps to investigate issues on the assumption that the VMM server was able to create the VM.

Failure Before the Scripts Begin

If the failure occurs before the application scripts begin, you should investigate the VMM logs. Since VMM uses an answer file to complete Sysprep configuration, that should be your first starting point. Generally, you can find the answer file at $env:windir\Panther\unattend.xml and you can view logs at $env:windir\Panther\UnattendGC\*.log. This could give you insight into the post-configuration steps that failed.

Script Failures

I’ve tried to construct the scripts in a way that is easy to troubleshoot. The C:\ScriptLogs directory should contain necessary information on why any particular step might have failed. The next step would be to run the script manually. When a VM is deployed, the payload for the script is copied to $env:SystemDrive\ProgramData\VirtualMachineManagerData\TempResources\<GUID> , so you can try running the scripts from there to troubleshoot.

DNS Resolution Errors

In some environments, there may be an AD or DNS propagation delay. If this is the case, you might want to adjust the scripts to wait for DNS resolution before proceeding with joining subsequent nodes to the cluster.