Planning a high availability model Validate and understanding support policies Understanding Live...

56
Failover Clustering and Hyper-V: Planning Your Highly-Available Virtualization Environment VIR304 Elden Christensen Principal PM Lead Microsoft Symon Perriman Technical Evangelist Microsoft

Transcript of Planning a high availability model Validate and understanding support policies Understanding Live...

Page 1: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Failover Clustering and Hyper-V: Planning Your Highly-Available Virtualization EnvironmentVIR304

Elden ChristensenPrincipal PM LeadMicrosoft

Symon PerrimanTechnical EvangelistMicrosoft

Page 2: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Agenda

Planning a high availability model

Validate and understanding support policies

Understanding Live Migration

Deployment Planning

VM Failover Policies

Datacenter Manageability

Page 3: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning your Availability Model

Page 4: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Failover Clustering & Hyper-V for Availability

Foundation of your Private CloudVM mobilityIncrease VM Availability

Hardware health detectionHost OS health detectionVM health detectionApplication/service health detectionAutomatic recovery

Deployment flexibilityResilient to planned and unplanned downtime

Page 5: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Host vs. Guest Clustering

Cluster service runs inside (physical) host and manages VMsVMs move between cluster nodes

Guest Clustering

Cluster service runs inside a VMApps and services inside the VM are managed by the clusterApps move between clustered VMs

SAN

Cluster

Host Clustering

iSCSI

Cluster

Page 6: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

What Host Clustering Delivers

Avoids a single point of failure when consolidating“Do not put all your eggs in 1 basket”

Survive Host CrashesVMs restarted on another node

Restart VM CrashesVM OS restarted on same node

Recover VM HangsVM OS restarted on same node

Zero Downtime Maintenance & PatchingLive migrate VMs to other hosts

Mobility & Load DistributionLive migrate VMs to different servers to load balance

Page 7: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

What Guest Clustering Delivers

iSCSI

Cluster

• Application or service within VM crashes or hangs then moves to another VM

Application Health Monitoring

• Apps or services moves to another VM for maintenance or patching of guest OS

Application Mobility

Page 8: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Guest vs. Host: Health Detection

Fault Host Cluster Guest Cluster

Host Hardware Failure

Parent Partition Failure

VM Failure

Guest OS Failure

Application Failure

Page 9: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Host vs. Guest Clustering Summary

Host Clustering

• VM’s move from server to server

• Zero downtime to move a VM

• Works with any application or guest OS

Guest Clustering

• Apps move from VM to VM• Traditionally downtime

when moving applications• Requires “cluster aware”

applications running on Windows Server

• Requires double the resources – 2 VM’s for single workload

Page 10: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Combining Host & Guest Clustering

Best of both worlds for flexibility and protectionVM high-availability & mobility between physical nodesApplication & service high-availability & mobility between VMs

Cluster-on-a-cluster does increase complexity

CLUSTER CLUSTERiSCSI

Guest Cluster

SAN SAN

Page 11: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Mixing Physical and Virtual in the Same Cluster

Mixing physical & virtual nodes is supportedMust still pass “Validate”

Requires iSCSI storageScenarios:

Spare node is a VM in a farmConsolidated Spare

iSCSI

Page 12: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Guest Clustering on top of VMware

VMware Support Policy

Guest Clustering on top of VMware has additional considerations See VMware’s guide: http://www.vmware.com/pdf/vsphere4/r41/vsp_41_mscs.pdf

Microsoft Support Policy

Solution must still be logo’d or pass ValidateSee this blog for additional information: http://blogs.msdn.com/b/clustering/archive/2010/07/27/10042799.aspx   ESX 3.5 or 

earliervSphere 4.0 vSphere 4.1

Windows NT Server 4.0

No No No

Windows 2000 Server

No No No

Windows Server 2003

No Yes (limited hardware configurations)

No

Windows Server 2008

No Yes (See VMware restrictions)

Yes (See VMware restrictions)

Windows Server 2008 R2

No Yes (See VMware restrictions)

Yes (See VMware restrictions)

Page 13: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning for Workloads in a Guest Cluster

SQLHost and guest clustering supported for SQL 2005 and 2008Supports guest live and quick migrationSupport policy: http://support.microsoft.com/?id=956893

File ServerFully SupportedLive migration is a great solution for moving the file server to a different physical system without breaking client TCP/IP connections

ExchangeExchange 2007 SP1 HA solutions are supported for guest clustering Does not support mixing Guest and Host ClusteringDoes not support mobile VMsSupport Policy: http://technet.microsoft.com/en-us/library/cc794548.aspx

Other Server Products: http://support.microsoft.com/kb/957006

Page 14: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

demo

Configuring a highly available VM

Page 15: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Validate and Support Policies

Page 16: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Failover Cluster Support PolicyFlexible cluster hardware support policyYou can use any hardware configuration if1. Each component has a Windows Server 2008 R2 logo

Servers, Storage, HBAs, MPIO, DSMs, etc…

2. It passes Validate

It’s that simple!Commodity hardware… no special list of proprietary hardwareConnect your Windows Server 2008 R2 logo’d hardwarePass every test in Validate

It is now supported!

If you make a change, just re-run Validate

Details: http://go.microsoft.com/fwlink/?LinkID=119949

Page 17: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Validating a Cluster

Functional test tool built into the product that verifies interoperabilityRun during configuration or after deployment

Best practices analyzed if run on configured cluster

Series of end-to-end tests on all cluster components

Configuration info for support and documentationNetworking issuesTroubleshoot in-production clusters

More information http://go.microsoft.com/fwlink/?LinkID=119949

Page 18: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

demo

Cluster Validation

Page 19: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

PowerShell Support

Improved ManageabilityRun Validate

Easily Create Clusters & HA Roles

Generate Dependency Reports

Built-in Help (Get-Help Cluster)As well as online here

Hyper-V IntegrationReplaces cluster.exe as the CLI tool

Action CMDlet

Make VMs Highly-Available Add-ClusterVirtualMachineRole

Quick Migration Move-ClusterGroup

Live Migration Move-ClusterVirtualMachineRole

Add a Disk to CSV Add-ClusterSharedVolume

Move CSV Disk Move-ClusterSharedVolume

Update VM Configuration Update-ClusterVirtualMachineConfiguration

Page 20: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Understanding Live Migration

Page 21: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Initiate Migration

Client accessing VM

VHD

Live Migrate this VM to another

physical machine

SAN

IT Admin initiates a Live Migration to move a VM from one host to another

Page 22: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Memory Copy: Full Copy

Memory content is copied to new server

VHD

VM pre-staged

SAN

The first initial copy is of all in memory content

Page 23: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Memory Copy: Dirty Pages

VHD

Pages are being dirtied

Client continues accessing VM

SAN

Client continues to access VM, which results in memory being modified

Page 24: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Memory Copy: Incremental

VHD

Smaller set of changes

Recopy of changes

SANHyper-V tracks changed data, and re-copies over incremental changesSubsequent passes get faster as data set is smaller

Page 25: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Final Transition

VHD

Partition State copied

VM Paused

SAN

Window is very small and within TCP connection timeout

Page 26: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Live Migration - Post-Transition: Clean-up

VHD

Old VM deleted once migration is verified

successful

Client directed to new host

SANARP issued to have routing devices update their tablesSince session state is maintained, no reconnections necessary

Page 27: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Deployment Planning

Page 28: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Host OS is Free

Choosing a Host OS SKU

Hyper-V Server

Windows Server

Enterprise

Windows Server

Datacenter

Licensed per Server Licensed per CPU

No guest OS licenses 4 guest OS licenses Unlimited guest licenses

All include Hyper-V, 16 node Failover Clustering, and CSV

Page 29: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning Server Hardware

Ensure processor compatibility for Live MigrationProcessors should be from the same manufacturer in all nodes

Cannot mix Intel and AMD in the same cluster

Virtual Machine Migration Test Wizard can be used to verify compatibility

http://archive.msdn.microsoft.com/VMMTestWizard

‘Processor Compatibility Mode’ can also be used if you have processors not compatible with each other for live migrating (all Intel or all AMD)

Page 30: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning Network Configuration

Minimum is 2 networks:Internal & Live MigrationPublic & VM Guest Management

Best SolutionPublic network for client access to VMsInternal network for intra-cluster communication & CSVHyper-V: Live MigrationHyper-V: VM Guest ManagementStorage: iSCSI SAN network

Use ‘Network Prioritization’ to configure your networks

Page 31: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Guest vs. Host: Storage Planning

Storage Host Cluster Guest Cluster

Fibre Channel (FC)

Serial Attached SCSI (SAS)

Fibre Channel over Ethernet (FCoE)

iSCSI

3rd party replication can also be used

Page 32: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning Virtual Machine Density

1,000 VMs per Cluster supportedDeploy them all across any number of nodes

Recommended to allocate spare resources for 1 node failure

384 VM per node limit512 VP per node limit

12:1 virtual processors per logical(# processors) * (# cores) * (# threads per core) * 12 = total

Up to 16 nodes in a clusterPlanning Considerations:

Hardware LimitsHyper-V LimitsReserve Capacity

Page 33: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Cluster Shared Volumes (CSV)

Allows multiple servers simultaneous access to a common NTFS volumeSimplifies storage managementIncreases resiliency

SAN

Page 34: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Planning Number of VMs per CSV

There is no maximum number of VMs on a CSV volumePerformance considerations of the storage array

Large number of servers, all hitting 1 LUNTalk to your storage vendor for their guidance

How many IOPS can your storage array handle?

Page 35: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Active Directory Planning

All nodes must be members of a domainNodes must be in the same domainNeed an accessible writable DCDCs can be run on nodes, but use 2+ nodes (KB 281662)Do not virtualize all domain controllers

DC needed for authentication and starting cluster serviceLeave at least 1 domain controller on bare metal

Page 36: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

VM Failover Policies

Page 37: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Keeping VMs off the Same Host

Scenarios:Keep all VMs in a Guest Cluster off the same hostKeep all domain controllers off the same hostKeep tenets separated

AntiAffinityClassNames Groups with same AntiAffinityClassNames value try to avoid being hosted on same nodehttp://msdn.microsoft.com/en-us/library/aa369651(VS.85).aspx

Cluster

Page 38: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Disabling Failover for Low Priority VMs‘Auto Start’ setting configures if a VM should be automatically started on failover

Group propertyDisabling mark groups as lower priorityEnabled by default

Disabled VMs needs manual restart to recover after a crash

Page 39: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Starting VMs on Preferred Hosts‘Persistent Mode’ will attempt to place VMs back on the last node they were hosted on during start

Only takes affect when complete cluster is started upPrevents overloading the first nodes that startup with large numbers of VMs

Better VM distribution after cold startEnabled by default for VM groups

Page 40: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

CLUSTER

Enabling VM Health Monitoring

Enable VM heartbeat settingRequires Integration Components (ICs) installed in VM

Health check for VM OS from hostUser-Mode HangsSystem Crashes

SAN

Page 41: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Configuring Thresholds for Guest Clusters

Configure heartbeat thresholds when leveraging Guest Clustering

Tolerance for network responsiveness during live migration

SameSubnetThreshold & SameSubnetDelay

SameSubnetDelay (default = 1 second)Frequency heartbeats are sent

SameSubnetThreshold (default = 5 heartbeats)Missed heartbeats before an interface is considered down

Page 42: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Dynamic Memory

New feature in Windows Server 2008 R2 Service Pack 1Upgrade the Guest Integration Components

Higher VM density across all nodesMemory allocated to VMs is dynamically adjusted in real time

“Ballooning” makes memory pages non-accessible to the VM, until they are neededDoes not impact Task Scheduler or other memory-monitoring utilities

Memory Priority Value is configurable per VMHigher priority for those with higher performance requirements

Ensure you have enough free memory on other nodes for failure recovery

Page 43: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Root Memory ReserveRoot memory reserve behavior changed in Service Pack 1Windows Server 2008 R2 RTM

The cluster property, RootMemoryReserved, watches host memory reserve level during VM startup

Prevent crashes and failovers if too much memory is being committed during VM startup

Sets the Hyper-V registry setting, RootMemoryReserve (no ‘d’) across all nodes

Cluster default: 512 MB, max: 4 GB

PS > (get-cluster <cluster name>).RootMemoryReserved=1024

Windows Server 2008 R2 Service Pack 1Hyper-V will use a new memory reservation setting for the parent partition, MemoryReserve

Based on “memory pressure” algorithm

Admin can also configure a static reserve value

The cluster nodes will use this new value for the parent partition

Configuring RootMemoryReserved in the cluster does nothing

Page 44: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Refreshing the VM Configuration

Make configuration changes through Failover Cluster Manager or SCVMM

Hyper-V Manager is not cluster aware, changes will be lost

“Refresh virtual machine configuration”Looks for any changes to VM or Cluster configuration

PS > Update- ClusterVirtualMachineConfiguration

StorageEnsures VM on correct CSV disk with updated paths

NetworkChecks live migration compatibility

Several other checks performed

Page 45: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Datacenter Manageability

Page 46: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

SCVMM: Quick Storage Migration

Ability to migrate VM storage to new locationMinimizes downtime during transferSimple single-click operation

Page 47: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

SCVMM: Intelligent Placement

Capacity planning improves resource utilizationSpreads VMs across nodes“Star-Rated” results for easy decision makingCustomizable algorithm

Page 48: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

New Cluster Features in SCVMM 2012

Configure SCVMM for highly available on a Failover ClusterSimplified setup & deployment

Cluster setup / deployment from bare metal

Easy automated maintenance Cluster patch orchestration

Dynamic OptimizationLoad balance VMs across the cluster

Power OptimizationTurns off nodes when they are under utilized

Page 49: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Summary

High availability and virtualization go hand-in-handHyper-V and Failover Clustering are tightly integratedHost clustering enables you to achieve zero downtime for planned maintenanceHighly scalable to large numbers of VMs and datacenter management with System Center Virtual Machine Manager

Page 50: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Passion for High-Availability?

Are You Up For a Challenge?

Become a Cluster MVP!

Contact: [email protected]

Page 51: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Related Failover Cluster ContentBreakout Sessions

VIR303 – Failover Clustering and Hyper-V: Multi-Site Disaster Recovery VIR304 – Failover Clustering and Hyper-V: Planning Your Highly-Available Virtualization EnvironmentWSV203 – Failover Clustering 101: Get Highly Available Now! WSV308 – Failover Clustering in 2008 R2: What's New in the Top High-Availability SolutionWSV309 – Failover Clustering: Pro Troubleshooting in Windows Server 2008 R2SIM357 – Microsoft System Center Virtual Machine Manager 2012: Server Fabric Lifecycle, Part 3 - Cluster Creation, Update ManagementDBI302 – Microsoft SQL Server Code-Name "Denali" AlwaysOn Series, Part 1: Introducing the Next Generation High Availability Solution DBI404 – Microsoft SQL Server Code-Name "Denali" AlwaysOn Series, Part 2: Building a Mission-Critical High Availability Solution Using AlwaysOn EXL312 – Designing Microsoft Exchange 2010 Mailbox High Availability for Failure Domains

Interactive SessionsWSV373-INT – Failover Clustering Pro Workshop: Everything You Wanted to Know, But Were Afraid to Ask!VIR471-INT – Virtualization FAQ, Tips and Tricks

Hands-on LabsWSV273-HOL – Failover Clustering Introduction with Windows Server 2008 R2DBI393-HOL – Microsoft SQL Server 2008 R2 - Implementing Clustering

Visit the Cluster

Team in the TLC!

We will be there every

hour it is open!

Page 53: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Resources

www.microsoft.com/teched

Sessions On-Demand & Community Microsoft Certification & Training Resources

Resources for IT Professionals Resources for Developers

www.microsoft.com/learning

http://microsoft.com/technet   http://microsoft.com/msdn

Learning

http://northamerica.msteched.com

Connect. Share. Discuss.

Page 54: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

Complete an evaluation on CommNet and enter to win!

Page 55: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.

© 2011 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to

be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS

PRESENTATION.

Page 56: Planning a high availability model Validate and understanding support policies Understanding Live Migration Deployment Planning VM Failover Policies Datacenter.