EMC Data Domain Technical Overview
-
Upload
telagamsetti -
Category
Documents
-
view
361 -
download
28
description
Transcript of EMC Data Domain Technical Overview
1© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
EMC Data DomainOverview
2© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
EMC Data DomainProtection Storage for Backup and Archive Data
• Scale and performance– Reduce storage required by 10–30x– Protect up to 100 PB of logical capacity in a single system– Complete backups faster—up to 31 TB per hour
• Seamless integration– Integrates with backup, archiving, and enterprise applications
• Reliable access and recovery– End-to-end data verification, fault detention, and self healing
• Efficient resource utilization– Send only deduplicated data across the network to reduce
bandwidth required by up to 99%
3© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
EMC Data Domain:Leadership and InnovationA History of Industry Firsts
First deduplication NAS
First deduplication volume replication
First deduplicationdirectory replication
First deduplication virtual tape library
Fastest backupcontroller
Cascaded replication
First distributed
deduplication processing
First deduplication for long-term retention of backup data
First inline deduplication to support retention
for compliance
2013
First deduplication optimized for backup and archive data
2014
First deduplication with secure
multi-tenancy
2012201120102009200820072006200520042003
4© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
0
500
1000
1500
2000
2500
3000
DATA PROTECTION SOFTWARE &
MARKET LEADER
STORAGE
Source: IDC, Worldwide Purpose-Built Backup Appliance Tracker, Q42013 and IDC, Worldwide Storage Software QView, Q42013
Appliance
Software
$M
5© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Why Protection Storage?
Scalability Data Integrity Consolidation
• Enable scale without cost and complexity
• Inline deduplication minimizes storage footprint by 10–30x
• Data Domain Data Invulnerability Architecture ensures data is recoverable and accessible
• Consolidate backup, archive, and disaster recovery on a single system
• Protect a wide variety of data sources
6© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain BasicsSeamless Integration with Existing Environment
Replication
CIFS, NFS, NDMP, DD Boost,
Ethernet
Virtual Tape Library (VTL) or DD
Boost over Fibre Channel
Control Tier
Target Tier Disaster Recovery Tier
Data Domain System Data Domain System
Backup Applications
Archiving Applications
Enterprise Applications
EMCSymantec
CommVault
EMCSymantec
CommVault
VMwareHPIBM
HPDell
OpenText
OraclePivotal
SQL
SAPHANADB2
7© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Deduplication: Technology OverviewStore More Backups in a Smaller Footprint
Second Friday Full Backup
B C D E F L G H
A B C D E F G H I J
Friday Full Backup
A B C D A E F G
Mon Incremental A B H
Tues Incremental C B I
Thurs IncrementalA C K
Weds Incremental E G J
Backup Estimated Data Logical Reduction Physical
Monday Incremental 50 GB 7–10x 5 GB
Tuesday Incremental 50 GB 7–10x 5 GB
K L
Wednesday Incremental 50 GB 7–10x 5 GB
Thursday Incremental 50 GB 7–10x 5 GB
Second FRIDAY FULL 1 TB 50–60x 18 GB
TOTAL 2.2 TB 7.6x 288 GB
FRIDAY FULL 1 TB 2–4x 250 GB
8© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Data Invulnerability ArchitectureIndustry’s Best Defense Against Data Integrity Issues
Stored Correctly Stays Correct Recovers Correctly
✓ ✓✓• Inline Data
Verification• Continuous
Fault Detection and Self- Healing
• Recovery/Access• Verification
✓✓✓
9© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Inline vs. Post-Process DeduplicationPOST-PROCESS
Deduplication After Storing
The more processes, the more resource contention
− Copy to tape: Too slow to stream tape− Recovery: Service level agreement
predictability− Replication: Poor time-to-disaster-recovery− Deduplication: If interleaved with backup or
restore
More administration to fight these issues
DeduplicationStore
3x disk accesses to shared store
Other activities unimpeded
− Predictable− Simpler
INLINEDeduplication Before Storing
Deduplication
10© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
CPU-Centric v Spindle-Bound Performance
Thro
ughpu
t M
B/s
50
6,000
Number of Disk Spindles
50 100 150 200
Data Domain
Fibre Channel SATA
Mostdeduplication
vendors
Improvement since 2003:Throughput: ~200xCapacity: ~1650x
11© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Delivering Data Protection as a Service Secure Multi-Tenancy
Enables enterprises to deliver Data Domain in a private cloud
Enables service providers to deliver Data Domain in a private/public cloud
Features: – Logical data isolation
and administration– Roles for users and admin– Tenant management and reporting
Tenant A
Tenant B
Tenant Unit A
Data Domain system
Tenant Unit B
12© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Software Options Data Domain Boost
• Advanced integration with apps
• Speed backups by up to 50%
Data Domain Encryption
• Inline encryption of data at rest
• Protects against theft or loss of a physical system
Data Domain Extended Retention
• Long-term retention of backup
• Up to 100 PB logical capacity
Data Domain Replicator
• Network-efficient and encrypted
• Consolidate up to 270 remote sites into a single system
Data Domain Retention Lock
• Secure retention for archive data
• Satisfies governance and compliance
Data Domain Virtual Tape Library
• Supports open systems and IBM i operating environments
13© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain BoostAdvanced Integration for Faster Backup
• Advanced integration with leading backup and enterprise applications
• Speeds backups by up to 50%• Enables more efficient resource utilization • Provides application control of Data Domain
replication process
DD Boost
14© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Boost EcosystemA
pp
Serv
er
Back
up
Serv
er
Avamar NetWorker NetBackup Backup Exec vRanger NetVaultData
Protector RMAN SAPSAP
HANA DB2 SQLGreenplum
DD Boost Supported over SAN
DD Boost Supported over LAN
VDP Advanced
DD Boost Supported over WAN
15© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain ReplicatorNetwork-Efficient Replication for Backup and Archive Data
• Reduces bandwidth requirements up to 99%• Protects sensitive data when replicating over
untrusted networks• Accelerates time-to-disaster recovery (DR)
readiness • Consolidates backup and archive data from
hundreds of remote sites• Leverages multiple replication topologies
Disaster Recovery Site
16© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain EncryptionEnhance the Security of Backup and Archive Data
• Encrypts all data stored on a Data Domain system
• Encrypts data inline before it’s written to disk• Leverage the internally generated static
default key or rotate keys for compliance
Backup Archive
17© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Extended RetentionLong-Term Retention of Backup Data
Data Domain Controller
z
Active Tier
Retention Tier
• Separate tiers of storage for long-term retention of data to eliminate reliance on tape
• Cost-effective scalability • Fault isolation for access and recoverability of
long-term data• Granular replication for simplified disaster
recovery
18© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Governance and Compliance for Archive DataData Domain Retention Lock
Efficiently store and manage governance and compliance archive data on a single Data Domain system
Meets the strictest regulatory requirements such as SEC 17a-4(f)
Litigation hold protects archive data during legal actions
Secure file locking of archive data at an individual file level
Integrates seamlessly with industry-leading archiving applications
Archive Software
Backup Data
Archive Data
Governance Archive Data
ComplianceArchive Data
19© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Virtual Tape LibraryHigh-Speed, Inline Deduplication for SAN Environments
• Eliminates physical tape challenges
• Integrates seamlessly into existing Fibre Channel SAN environments
• Replicates virtual tape cartridges efficiently offsite, over a wide area network (WAN)
20© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Management CenterVirtual Appliance for Aggregate Multi-system Management
• Dashboards show the aggregate status of all Data Domain systems
• Manages and monitors up to 75 Data Domain systems through a single interface
• Role-based access control restricts access to authorized users
21© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain SystemsProtection Storage for Backup And Archive
Backup Archive
DatabaseMainframeIBM iBig Data
File/EmailVMwareNASROBO
Backup Use Cases
File/EmailBig DataVirtual Machine
Archive Use Cases
NetworkReplicationOver WAN
Content Mgmt.Storage TieringDatabase
Content ManagementFile Shares/Servers
Virtual Machines
Disaster Recovery,Long-Term Retention
Databases
On Premise or Cloud
Enterprise ApplicationsData Sources
Email Servers
22© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Data Domain Systems
DD160 DD2200 DD2500 DD4200 DD4500 DD7200 DD990
Speed (DD Boost) 1.1 TB/hr 4.7 TB/hr 13.4 TB/hr 22.0 TB/hr 22.0 TB/hr 26.0 TB/hr 31.0 TB/hr
Speed (other) 667 GB/hr 3.5 TB/hr 5.3 TB/hr 10.2 TB/hr 10.2 TB/hr 11.9 TB/hr 15.0 TB/hr
Logical capacity 40–195 TB 172–860 TB 1.3–6.6 PB1.8-9.4 PB5.6-28.4 PB1
2.8-14.2 PB11.4-57.0 PB1
4.2-21.4 PB17.1-85.61
5.7–28.5 PBUp to 100 PB1
Usable capacity Up to 3.98 TB Up to 17.2 TB Up to 133 TBUp to 189 TBUp to 569 TB1
Up to 285 TBUp to 1.1 PB1
Up to 428 TBUp to 1.7 PB1
Up to 570 TBUp to 2.0 PB1
• DD Boost• DD Encryption • DD Extended Retention• DD Management Center
Midsize Enterprise
Large Enterprise• DD Replicator• DD Retention Lock• DD Virtual Tape Library
1 With DD Extended Retention software option
Data Domain Software
Small Enterprise/ROBO
23© Copyright 2014 EMC Corporation. All rights reserved.© Copyright 2014 EMC Corporation. All rights reserved.
Why Data Domain?Protection Storage for Backup and Archive Data
Industry-leading speed and scale
Seamless integration
Reliable access and recovery
Efficient network utilizationBackup Data
Archive Data