Post on 02-Nov-2014
description
© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
Backup and Archiving in the AWS Cloud
Agenda
• Why AWS for Backup and Archive?• AWS Global Infrastructure• Traditional vs. Cloud Approach• Cloud Backup and Archive Architecture• Cloud Integrated Backup and Archive Gateways • TCO
Why AWS for Backup and Archive?
Metered usage:
Pay as you go
No capital investment
No commitment
No risky capacity planning
Avoid OPEX and risks
of physical media
handling
Control your
geographic locality for
performance and
compliance
Gartner Magic Quadrant for Public Cloud Storage Services 2014
Gartner, Magic Quadrant for Cloud Storage Services, Gene Ruth, Arun Chandrasekaran et al., July 9, 2014. This graphic was published by Gartner, Inc. as part of a larger research document and should be evaluated in the context of the entire document. The Gartner document is available at http://www.gartner.com/technology/reprints.do?id=1-1WWKTQ3&ct=140709&st=sb. Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings. Gartner research publications consist of the opinions of Gartner's research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose.
AWS Global Infrastructure
10 Regions
25 Availability Zones
50+ Edge Locations
AWS Regions and Availability Zones
You decide where your data resides
Archive: Data retained for the
long term, for
compliance or research
Backup: Data retained to
support near-term
business continuity
Backup and Archive defined
Traditional Backup and Archive
Traditional Backup and Archive
• Time: Long/slow recovery time• Cost: Capital intensive with ongoing upgrades• Effort: Complex to manage• Quality: Low durability, error prone
Days or Weeks
Traditional Backup and Archive
• Backup Software• Catalogs backup sets• Application agents• Media servers
• Connectivity:• LAN/WAN• SAN: FibreChannel
• Targets:• Tape Libraries• Virtual Tape Libraries
• Tape out / Vaulting
AWS Storage and Archive Architecture
Cloud Backup and Archive Topologies
1. Branch office backup to cloud2. Core data center backup to cloud3. Cloud backup to cloud4. Hybrid cloud backup
Branch office backup to cloud
Considerations:- Backup Software- Storage / Caching Gateway - WAN or Internet- Deduplication- Compression- Encryption- WAN Acceleration
Core data center backup to cloud
Considerations:- Backup Software- Storage / Caching gateway- Direct Connect or Internet- Deduplication- Compression- Encryption- WAN Acceleration
Hybrid Cloud Backup
VPC – Datacenter #4
Single GUI for Management
Cloud backup to Cloud Applications running on EC2 backing up to S3 / Glacier
Considerations:- Backup software- Encryption- Deduplication- Compression- Native S3 and Glacier
integration- EBS Snaps / Scripting
AWS Storage and Archive Options
Amazon Simple Storage Service (S3)Highly scalable object storage
1 byte to 5 TB in size
99.999999999% durability
Amazon Elastic Block Store (EBS)High-performance block storage device
1 GB to 1 TB in size
Mount as drives to instances with
snapshot/cloning functionalities
Amazon GlacierLong-term object archive
Extremely low cost per gigabyte
99.999999999% durability
AWS Storage and Archive Options
Amazon Elastic Block Store (EBS)
• High I/O block storage for Amazon EC2
• Point-in-time snapshots to Amazon S3• 99.999999999% Durability• Snapshot software is FREE
• Point-in-time snapshots across regions
AWS Storage and Archive OptionsAmazon Simple Storage Service (S3)
• Durable and low cost
• Unlimited number of objects and volume
• Back up to Amazon S3 buckets via
HTTP/HTTPS
– Create scripts using PowerShell, Perl,
Python…
– Numerous solutions for data backup
• Authentication mechanisms ensure data is
kept secure
• Reduced redundancy storage (RRS) option
AWS Storage and Archive OptionsAmazon Glacier
• $0.01 per GB/mo, $120 per TB/yr• 3-5 hour data retrieval latency• Archives: single file or zipped files• Vaults: collection of archives• Infinite archival storage• 99.999999999% durability• Immutable, encrypted by default
AWS Storage and Archive OptionsObject Lifecycle Management: Amazon S3 → Amazon Glacier
→
• Seamlessly move data from Amazon S3 → Amazon Glacier• 3-5 hour asynchronous retrieval• Data lifecycle policies• $0.01 per GB for Amazon Glacier costs
Data Ingestion Options
AWS Direct ConnectDedicated bandwidth between
your site and AWS
InternetTransfer data in a secure SSL tunnel over
the public Internet
AWS Import/ExportPhysical transfer of media into and
out of AWS
AWS Ingest OptionsAWS Direct Connect
• Private connectivity to AWS– Physical connection – 1 Gbps or 10 Gbps port
• Consistent network performance• Consider burst models on ingest• Reduces costs for bandwidth-heavy
outbound workloads
Locations• CoreSite 32 Avenue of the Americas, NY • CoreSite One Wilshire & 900 North Alameda, LA • Equinix DC1 – DC6 & DC10 - DC11, Ashburn, VA • Equinix SV1 & SV5, San Jose, CA • Equinix SE2 & SE3, Seattle, WA • Equinix SG2, Singapore • Equinix SY3, Sydney • Equinix TY2, Tokyo • Eircom, Clonshaugh • TelecityGroup Docklands, London • Terremark NAP do Brasil, Sao Paulo
AWS Ingest OptionsAWS Import/Export
• Rapidly move data into and out of AWS
• Portable storage device shipment to AWS
• Supports– Amazon EBS– Amazon S3– Amazon Glacier
• Use cases– Initial data migration– Content distribution via portable
devices– Disaster recovery
Cloud Integrated Backup and Archive Gateways
AWS Storage Gateway-VTL(Virtual Tape Library)
• On-premises, virtual tape library storage appliance
• $125 / Month
• 10 virtual tape drives / 1500 virtual tape slots
• 150 TB local cache– VTL – virtual tape library
• Restore in seconds from VTL– VTS – virtual tape shelf
• Next Generation Offsite Vault• 24 hour retrieval from VTS
• Encryption in transit and at rest
• Gateway VTL-AMI
AWS partner backup and archive solutions
Avere → S3-GlacierAWS SGW → S3AWS VTL → S3-Glacier BridgeSTOR → S3-GlacierCA Arcserve → S3CA Mainframe → S3-GlaicerCommvault → S3-GlacierCtera → S3Druva → S3 Oracle RMAN + OSB Module → S3Panzura → S3 Riverbed Whitewater → S3-GlacierSonian → S3Veeam → S3-GlacierZmanda → S3
Riverbed SteelStore
• Local caching appliance• Presents NAS protocols
– CIFS / NFS
• Up to 30x deduplication• Compression• Encryption• Key Management• WAN Acceleration• S3 and Glacier support• AMI Available
Ctera
Commvault
• Unified platform integrates Backup, Archive, Replication, Analysis and Search, Alerting, Reporting, and Tracking of all data via a single common code base
• Integrated with Amazon S3 and Glacier with deduplication & encryption support
• Single console management Amazon S3 Amazon Glacier
TCO: On-Premises Cost Considerations
1. Primary storage hardware (primary / remote site)2. DR / Remote site storage hardware3. Raw to utilized storage (both primary and DR)4. Storage growth (cost of upgrades)5. Storage management software and 3rd party tools6. Professional services7. Hardware maintenance8. Software maintenance9. Backup software10.Backup hardware (primary / remote site)11. Offsite tape storage / vault12.Archive software
13.Archive hardware14.Power15.Cooling16.Space17.Labor18.Cost of capital19.Training20.Asset depreciation21.Migration22.Decommission / remove23.Recycle
AWS – Your Global Data Center for Backup and Archive
• Choose the region that fits your business and compliance needs• 10 regions world wide – set up with a few clicks• Broad range of backup/archive tools that are AWS integrated• Low cost, reliable AWS Transport and Storage options• Enhance Security Posture• Increase Scalability• Significantly Higher Data Durability• All at a lower TCO
© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.© 2014 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
THANK YOU
AWS Storage Gateway
• On-premises, virtual iSCSI storage appliance
• $125 / Month
• Local cache enables low latency access to data
• Server Side Encryption (SSE)
• 5 TB of throughput per day
• Recover to Amazon EBS
TrillionsOf Unique Customer Objects
AWS Ingest OptionsInternet / One Common Theme: Parallel Uploads
1. Multipart upload
2. Request rate optimization
3. TCP window scaling
4. TCP selective acknowledgement
AWS has customers that ingest roughly 1 PB per day
Customer StoriesAWS Storage Gateway is used in a variety of ways
Jollibee (JFC) is using the AWS Storage Gateway to backup and mirror their Oracle SQL server database from their on-premises
data center to AWS. JFC is the largest fast food chain in the Philippines with revenues well over 2 Billion USD.. The Storage
Gateway also provides us access to the same database snapshots for use in Amazon EC2, providing a cost-effective in-
cloud DR solution.
AWS Storage Gateway provided us the most cost effective way to backup our SAP workloads to AWS, it is helped us perform SAP System ‘refresh’ much faster and in a more convenient way, backing up to S3 has also helped us
to prepare for DR & also run SAP Dev/QA restores easily on EC2
“Amazon Web Services and AWS Storage Gateway are great assets that help us scale fast, store data in an ultra-secure
environment, spend more time on product development (rather than disaster recovery & backup)
…By using AWS Storage Gateway, we went to just hours instead of days to restore from backup.”
The large Japanese Retail chain uses AWS Storage Gateway to share & store files in S3 and drastically cut down it’s spend on premise NAS
footprint.