Best Practices for Surviving Outages

Best Practices for Surviving OutagesDesigning and implementing a High Availability and Disaster Recovery strategy

Sal Cardello, Director of Pro Services

Matt Dolian, System Engineer

Avroham Katz, System Engineer

Disaster Recovery

Photo credit: naturaldisasterss.com/wp-content/uploads/2011/12/Natural-Disaster-Images.jpg

0 - No off-site data

1 - Data backup with no hot site

2 - Data backup with hot site

3 - Electronic vaulting

4 - Point-in-time copies

5 - Transaction integrity

6 - Zero or near-Zero data loss

7 - Highly automated, business integrated solution

Tiers of Disaster Recovery

Citation: http://en.wikipedia.org/wiki/Seven_tiers_of_disaster_recovery

Definition: High Availability

“Design approach & associated service implementation that ensures a pre-arranged level of operational performance will be met during a contractual measurement period”

Citation: ttp://en.wikipedia.org/wiki/High_availability

High Availability Architecture

Why implement HA?

Best Practices for High Availability

7Photo Credit: http://bit.ly/z9OEwG

Environment Analysis

Geographic Mirroring

Database Replication

Store Assets Replication

Validate Synchronization

Escalation Plan

Launch

• Environment Specific Configurations

• Asset Hosting

• Page Caching

• Other Data Stores

• Background Processing

• Cron Jobs

Application Considerations

Photo credit: http://www.flickr.com/photos/dseneste/5912382808/

1. Client contacted per terms of SLA

2. Engine Yard syncs database and performs manual failover

3. Redundant database promoted to master

4. DNS is updated

5. Replication to former master is re-established

Failover Process at Engine Yard

Manual, customer owned decision

Questions?

Get in touch

Contact us: Sal Cardello, Director of Pro Servicesproservices@engineyard.com

Learn more:http://www.engineyard.com/services

Best Practices for Surviving Outages

Technology

Transcript of Best Practices for Surviving Outages

The Secure Communicator: Best Practices for Surviving Heartbleed and Other Threats

The Impact of Router Outages on the AS-Level Internetconferences.sigcomm.org/sigcomm/2017/files/program/ts-11-3-outages… · The Impact of Router Outages on ... Challenges in topology

Detecting Peering Infrastructure Outages in the Wild...Detecting peering infrastructure outages in the wild 54 159 outages in 5 years of BGP data 76% of the outages not reported in

Planned Outages 96-98

A SMALL BUSINESS GUIDE - Pronto Marketing · A SMALL BUSINESS GUIDE – SURVIVING POWER & INTERNET OUTAGES 3 Installing a standby generator is the best way to ensure ongoing power

TGP Outages August & September 2013

Sustaining Planned/Unplanned Database Outages: Best Practices … - Copy.p… · Objectives for planned/unplanned downtimes Planned Maintenance – Detect “DOWN” event triggered

“Surviving Securely & Surviving Security -- Thoughts After 9/11”

Sustaining Planned/Unplanned Database - Oracle...Sustaining Planned/Unplanned Database Outages: Best Practices for DBAs & Developers Nirmala Sundarappa Principal Product Manager, Oracle

Best Practices for Acquiring Transportation Services Surviving Capacity “Crunches” & The Impact of CSA 2010 GSA’s 2011 Transportation Forum Washington,

EIA Refinery Outages

Are Lengthy Power Outages Acceptable?

Outages During the Run

Protect your app from Outages

A SMALL BUSINESS GUIDE - Exigent Technologies · A SMALL BUSINESS GUIDE TO SURVIVING POWER & INTERNET OUTAGES 3 Installing a standby generator is the best way to ensure ongoing power

CIO Study: Certificate-Related Outages Continue to Plague ......CIO Study: Certificate-Related Outages Continue to Plague Organizations 2 Executive Overview Outages caused by expired

Best Practices For Avoiding or Surviving an Audit GREG REYBOLD, J.D.

BEST PRACTICES FOR SURVIVING ORIGIN VERIFICATIONS BY US CUSTOMS & BORDER PROTECTION

A SMALL BUSINESS GUIDE · A SMALL BUSINESS GUIDE – SURVIVING POWER & INTERNET OUTAGES 3 Installing a standby generator is the best way to ensure ongoing power for IT and other operations.

Cost of Data Center Outages 2016 - … of Data Center Outages ... third study is to continue to analyze the cost behavior of unplanned data center outages. ... cost estimation is a