How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software...

28
How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

Transcript of How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software...

Page 1: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

How to Implement a Cluster of Clusters

Atiq AdamjeeSenior ArchitectNovell, Inc.

Brad RuppSoftware EngineerNovell, Inc.

Page 2: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 9, 2004 Novell Inc.2

one Net: Information without boundaries…where the right people are connected with the right information at the right time to make the right decisions.

The one Net vision

Novell exteNd™

Novell Nsure™

Novell Nterprise™

Novell NgageSM

:

:

:

:

Page 3: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 9, 2004 Novell Inc.3

The one Net vision

Novell Nterprise is an innovative family of products which gives you the power to enable and manage the constant interaction of people with your business systems — regardless of who they are or where they are.

Novell Nterprise™

Novell exteNd™

Novell Nsure™

Novell Nterprise™

Novell NgageSM

:

:

:

:

Page 4: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.4

Agenda

Introductions

Assumptions

Components of a Disaster Recovery Plan

Data Replication

Factors Affecting Data Replication

Distance Consideration

Latency Consideration

Bandwidth Consideration

Cluster of Clusters Implementation

Q & A

Page 5: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.5

Introductions

Atiq AdamjeeSenior Architect, Novell Consulting

[email protected]

Brad RuppSoftware Engineer, Novell Inc.

[email protected]

Page 6: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.6

Assumptions

Knowledge of Novell Cluster Services™ (NCS)

Have already implemented NCS or are in the process of implenting NCS

Need to implement a disaster recovery plan

This is not a full disaster recovery plan – but a large component of it

Page 7: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.7

Components of a Disaster Recovery Plan

A complete disaster recovery plan must include more than just “servers”

People

Facilities

Information Technology

Much more...

Page 8: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.8

Data Replication

Data may be replicated in many different ways

Host-based

Disk-based

Asynchronous

Synchronous

Page 9: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.9

Host-based Replication

FCWDM

SONETATMIP

Etc.

Data Center Recovery Site

Page 10: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.10

Disk-based Replication

FCWDM

SONETATMIP

Etc.

Data Center Recovery Site

Page 11: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.11

Asynchronous Replication

replic

ate

write

write

Latency of I/O to remote disk does not slow down host I/O

Data on the remote disk is only as current as the last replication session

Page 12: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.12

Synchronous Replication (Mirroring)

Latency of I/O to remote disk will slow down host I/OData on the remote disk is always current

write

write

write

2ms

15ms

Page 13: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.13

Factors Affecting Data Replication

Distance

Latency

Bandwidth

2

1

3

Page 14: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.14

Distance Consideration

Distance Consideration

Within a campus

Under 10 km

Under 100 km

Over 100 km

Page 15: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.15

Distance Consideration

Within a campus - use Ethernet or fibre channel

Under 10 km – consider fibre channel

Under 100 km – fiber, WDM, FC/IP, ATM, SONET, iSCSI, …

Over 100 km – iSCSI, fiber, WDM, FC/IP, ATM …

As distance increases so does the cost of the connection

iSCSI has changed this by allowing SCSI data to be routed over TCP/IP.

Page 16: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.16

Latency Consideration

Shorter round trip delay = more frequent transmission

Impacted by overhead, i.e. how much of each packet is really data

With longer distances, latency impacts performance more than bandwidth

Page 17: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.17

Bandwith Consideration

What is the amount of daily change in the data?

How much time do you have to replicate the data?

What is the round trip delay between this site and the distant replica?

What is the overhead for the selected transport?

Page 18: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.18

Cluster of Clusters

Powered by NetWare and Novell Cluster Services

Manually fails over network services from one cluster to another geographically, separate cluster

Provides high availability to your mission critical resources in the event of a disaster

SBD and heartbeat issues go away

Page 19: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.19

Cluster of Clusters Implementation Overview

Real World Implementation of a Cluster of Clusters

Six servers at the primary site

Two servers at the disaster recovery (DR) site

EMC CLARiiON SAN at both sites

Both SANs mirrored over an OC3 link

Page 20: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.20

Cluster of Clusters Implementation – Phase I

Unmask the LUNs on the SAN so all servers at the site can see all shared devices

If using existing NSS pools, mark them as shareable for clustering

Install NCS

Activate the pools and volumes

Create cluster volume objects and virtual servers

Configure resources, load/unload scripts, failover priorities, etc.

Bring resources online

Test failover between servers

Page 21: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.21

Cluster of Clusters Implementation – Phase II

Down the cluster at the primary site

Stop SAN mirroring

Mask (hide) the LUNs from the servers at the primary site

Remove the LUN masking (make the LUNs visible) from the servers at the DR site

Repeat the steps in Phase I to set up the DR cluster

Page 22: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.22

Cluster of Clusters Implementation – Phase III

Down the cluster at the DR site

Mask (hide) the LUNs from the servers at the DR site

Start SAN mirroring

Remove the LUN masking (make the LUNs visible) from the servers at the primary site

Start the primary cluster

Page 23: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.23

Cluster of Clusters Implementation – Phase IV

Prepare for disaster recovery

Login Scripts – create DR scripts ahead of time and comment them out or use tools like the NDS Login Script editor available from Cool Solutions

Printing – set up printers at the DR site as printers are location-based entities

eDirectory replicas

Other services, e.g. DHCP, DNS, SLP, etc.

Page 24: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.24

Cluster of Clusters Implementation – Phase V

Test Cluster Failover

Down the cluster at the primary site

Stop the SAN mirroring

Mask (hide) the LUNs from the servers at the primary site

Remove the LUN masking (make the LUNs visible) from the servers at the DR site

Enable the cluster (and mount the volumes) at the DR site

Make necessary changes to the login scripts

Restore other services, e.g. printing, DHCP, DNS, SLP, etc.

Page 25: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.25

Cluster of Clusters Implementation – Phase VI

Test Cluster Failback

Down the cluster at the DR site

Mask (hide) the LUNs from the servers at the DR site

Remove the LUN masking (make the LUNs visible) from the servers at the primary site

Enable the cluster (and mount the volumes) at the primary site

Make necessary changes to the login scripts

Restore other services, e.g. printing, DHCP, DNS, SLP, etc.

Page 26: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

Question and Answer

Page 27: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.
Page 28: How to Implement a Cluster of Clusters Atiq Adamjee Senior Architect Novell, Inc. Brad Rupp Software Engineer Novell, Inc.

© March 4, 2004 Novell Inc.28

General DisclaimerThis document is not to be construed as a promise by any participating company to develop, deliver, or market a product. Novell, Inc., makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. Further, Novell, Inc., reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All Novell marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.

No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of Novell, Inc. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability.