Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

63
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Prahlad Rao, Solutions Architect Sept 29 th 2016 Managed Database Services on Amazon Web Services

Transcript of Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Page 1: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Prahlad Rao, Solutions Architect

Sept 29th 2016

Managed Database Services on Amazon Web Services

Page 2: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Today’s agenda

• Why managed database services?• A non-relational managed database• A relational managed database• A managed in-memory cache• A managed data warehouse• What to do next

Page 3: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Why managed database services?

Page 4: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Options for running your database

• Self-Managed—You are responsible for the hardware, OS, security, updates, backups, replication etc., but have full control over it. This is typically on premise.

• EC2 Instances—You only need to focus on the database level updates, patches, replication, backups etc. and don’t have to worry about the hardware and underlying infrastructure.

• Fully Managed—You get features such as backup and replication etc. as a package service and don’t have to bother with patching and updates.

Page 5: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

What are the AWS managed DB options?

Page 6: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

A managed service for each major DB type

Amazon DynamoDB

Document and Key-

Value Store

Amazon RDS

SQL Database Engines

Amazon ElastiCache

In-Memory Key-Value

Store

Amazon Redshift

Data Warehouse

Page 7: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Pick the best tool for the job

Page 8: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

What is Amazon RDS?

Page 9: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Relational databases

Fully managed

Fast, predictable performance

Simple and fast to scale

Low cost, pay for what you useAmazonRDS

Amazon Aurora

Page 10: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Use cases

Applicable wherever you need relational databases

eCommerce Gaming

Websites IT Solutions

Apps

Reporting

Page 11: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

RDS feature matrixFeature Aurora MySQL PostgreSQL MariaDB Oracle SQL Server

VPC High availability Instance scaling Encryption

Read replicas Oracle Golden GateCross region

Max storage 64 TB 6 TB 6 TB 6 TB 6 TB 4 TB

Scale storage Auto Scaling

Provisioned IOPS NA 30,000 30,000 30,000 30,000 20,000

Largest instance R3.8XL R3.8XLM4.10XL

R3.8XLM4.10XL

R3.8XLM4.10XL

R3.8XLM4.10XL

R3.8XLM4.10XL

Page 12: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon RDS is simple and fast to scale

Database instance types offer a range of CPU and memory selections

Scale up or down among instance types on demand

Database storage is scalable on demand

Page 13: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon RDS offers fast, predictable storage

General Purpose (SSD) for most workloads

Provisioned IOPS (SSD) for OLTP workloads up to 30,000 IOPS

Magnetic for small workloads with infrequent access

Page 14: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

High availability Multi-AZ deployments

Enterprise-grade fault tolerance solution for production databases

Automatic failoverSynchronous replication

Inexpensive and enabled with a few clicks

Page 15: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Choose Read Replicas for greater scalability

Bring data close to your customer’s applications in different regions

Relieve pressure on your master node for supporting reads and writes.

Promote a read replica to a master for faster recovery in the event of disaster

Page 16: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Choose cross-region replication for enhanced data locality, even more ease of migration

Even faster recovery in the event of disaster

Bring data close to your customers

Promote to a master for easy migration

Page 17: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Choose cross-region snapshot copy for even greater durability, ease of migration

Copy a database snapshot to a different AWS region

Warm standby for disaster recovery

Base for migration to a different region

Page 18: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

How Amazon RDS backups work?

Automated backupsRestore your database to a point in time

Enabled by default

Choose a retention period, up to 35 days

Manual snapshotsBuild a new database instance from a snapshot when needed

Initiated by you

Persist until you delete them

Stored in Amazon S3

Page 19: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Monthly bill = +

Further details at http://aws.amazon.com/rds/pricing/

Storage consumedDuration for which DB instances were used

(Price depends on type of storage)

(Price depends on type of DB instance)

Free tier (for first 12 months)• 750 micro DB instance hours • 20 GB of DB storage• 20 GB for backups• 10 million I/O operations

GBN ×

You pay for the resources that you use

Page 20: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Selected Amazon RDS customers

Page 21: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

What is Amazon DynamoDB?

Page 22: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon DynamoDB

NoSQL database

Fully managed

Single-digit millisecond latency

Massive and seamless scalability

Low costAmazon

DynamoDB

Page 23: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon DynamoDB: a managed document and key-value store

Simple and fast to deploySimple and fast to scale

• To millions of IOPSData is automatically replicatedFast, predictable performance

• Backed by SSD storage Secondary indexes offer fast lookupsNo cost to get started; pay only for what you consume

Page 24: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Popular use cases

Ad Tech IoT Gaming Mobile& Web

Ad serving, retargeting, ID lookup, user profile management, session-tracking, RTB

Tracking state, metadata and readings from millions of devices, real-time notifications

Recording game details, leaderboards, session information, usage history, and logs

Storing user profiles, session details, personalization settings, entity specific metadata

Page 25: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

WritesReplicated continuously to 3 AZsPersisted to disk (custom SSD)

ReadsStrongly or eventually consistent

No latency trade-off

Automatic replication for rock-solid durability and availability

Page 26: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon DynamoDB is a schemaless database

Table Items

Attributes (name-value pairs)

Page 27: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Each item must include a key

Hash key (DynamoDB maintains an

unordered index)

Page 28: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Each item must include a key

Hash key

Range key (DynamoDB maintains a

sorted index)

Page 29: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Local secondary indexes = alternate range keys

Hash key

Range key

LSI key

Page 30: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Global secondary indexes = “pivot charts” for your table

Choose which attributes

to project (if any)

Page 31: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Define the desired performance using provisioned throughput

Read capacity units

Writecapacity units

1 RPS > 2.5 M requests in a month

Page 32: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

DynamoDB: What are capacity units?

One write capacity unit One read capacity unitOne strongly consistent

read per second up to 4KB

or

Two eventually consistent reads per second

One write per second up to 1KB

Page 33: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Simple app architecture with Amazon DynamoDB

Elastic LoadBalancing Amazon EC2

app instances

Clients

DynamoDB

Business logic

Page 34: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

How DynamoDB billing works

Monthly bill = GB +

Assumes DB instance accessed only from AWS regionFurther details at http://aws.amazon.com/dynamodb/pricing/

≈ 5 GB * $0.25 + 21 * 720 hrs * $0.0065/10 + 35 * 720 hrs * $0.0065/50 ≈ $14.36

Storage consumed(plus 100 bytes per item)

Charge for write capacity units

per hour

+Charge for

read capacity unitsper hour

Page 35: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

How DynamoDB billing works (with free tier)

Monthly bill = GB +

Assumes DB instance accessed only from AWS regionFurther details at http://aws.amazon.com/dynamodb/pricing/

≈ 5–25 GB * $0.25 + 21–25 * 720 hrs * $0.0065/10 + 35–25 * 720 hrs * $0.0065/50

Storage consumed(plus 100 bytes per item)

Charge for write capacity units

per hour

Charge for read capacity units

per hour

+

Free tier (for first 12 months)• 25 GB Storage• 25 Units Write Capacity• 25 Units Read Capacity

Page 36: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

How DynamoDB billing works (with free tier)

Monthly bill = GB +

Assumes DB instance accessed only from AWS regionFurther details at http://aws.amazon.com/dynamodb/pricing/

≈ 0 + 0 + 10 * 720 hrs * $0.0065/50 ≈ $0.94

Storage consumed(plus 100 bytes per item)

Charge for write capacity units

per hour

+Charge for

read capacity unitsper hour

Page 38: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

NoSQL vs. SQL for a new app: how to choose?• Strong schema, complex

relationships, transactions and joins

• Scaling is difficult• Focus on consistency

over scale and availability

• Schema-less, easy reads and writes, simple data model

• Scaling is easy• Focus on performance and

availability at any scale

NoSQL SQL

Page 39: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

What is Amazon Redshift?

Page 40: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift

a lot fastera lot cheapera whole lot simpler

Relational data warehouse

Massively parallel; petabyte scale

Fully managed

HDD and SSD platforms

$1,000/TB/year; starts at $0.25/hour

Page 41: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Who uses Amazon Redshift?

• Reduce costs by extending DW rather than adding HW

• Migrate completely from existing DW systems

• Respond faster to business; provision in minutes

• Improve performance by an order of magnitude

• Make more data available for analysis

• Access business data via standard reporting tools

• Add analytic functionality to applications

• Scale DW capacity as demand grows

• Reduce HW and SW costs by an order of magnitude

Traditional enterprise DW Companies with big dataSaaS companiesCompanies with big data

Page 42: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift architectureLeader node

• Simple SQL endpoint• Stores metadata• Optimizes query plan• Coordinates query execution

Compute nodes• Local columnar storage• Parallel/distributed execution of all

queries, loads, backups, restores, resizes

Start at just $0.25/hour, grow to 2 PB (compressed)

• DC1: SSD; scale 160 GB–326 TB• DS2: HDD; scale 2 TB–2 PB

10 GigE(HPC)

IngestionBackupRestore

SQL Clients/BI Tools

128GB RAM

16TB disk

16 cores

Amazon S3/DynamoDB/Amazon EMR

JDBC/ODBC

128GB RAM

16TB disk

16 coresCompute Node

128GB RAM

16TB disk

16 coresCompute Node

128GB RAM

16TB disk

16 coresCompute Node

LeaderNode

Page 43: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift dramatically reduces I/O

• Column storage

• Data compression

• Zone maps

• Direct-attached storage • With row storage, you do unnecessary I/O

• To get total amount, you have to read everything

ID Age State Amount

123 20 CA 500

345 25 WA 250

678 40 FL 125

957 37 WA 375

Page 44: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

• With column storage, you only read the data you need

ID Age State Amount

123 20 CA 500

345 25 WA 250

678 40 FL 125

957 37 WA 375

Amazon Redshift dramatically reduces I/O

• Column storage

• Data compression

• Zone maps

• Direct-attached storage

Page 45: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

analyze compression listing;

Table | Column | Encoding ---------+----------------+---------- listing | listid | delta listing | sellerid | delta32k listing | eventid | delta32k listing | dateid | bytedict listing | numtickets | bytedict listing | priceperticket | delta32k listing | totalprice | mostly32 listing | listtime | raw

Amazon Redshift dramatically reduces I/O

• Column storage

• Data compression

• Zone maps

• Direct-attached storage• COPY compresses automatically

• You can analyze and override

• More performance, less cost

Page 46: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift dramatically reduces I/O

• Column storage

• Data compression

• Zone maps

• Direct-attached storage

10 | 13 | 14 | 26 |…

… | 100 | 245 | 324

375 | 393 | 417…

… 512 | 549 | 623

637 | 712 | 809 …

… | 834 | 921 | 959

10

324

375

623

637

959

• Track the minimum and maximum value for each block

• Skip over blocks that don’t contain relevant data

Page 47: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift dramatically reduces I/O

• Column storage

• Data compression

• Zone maps

• Direct-attached storage

128 GB RAM

16 cores

16 TB disk

DW.HS1.8XL:

• > 2 GB/sec scan rate

• Optimized for data processing

• High disk density

16 GB RAM2 cores

2 TB disk

DW.HS1.XL:

Page 48: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Fully managed, continuous/incremental backups

Multiple copies within cluster

Continuous and incremental backups to Amazon S3

Continuous and incremental backups across regions

Streaming restore Amazon S3

Amazon S3

Region 1

Region 2

Compute Node

Compute Node

Compute Node

Page 49: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon Redshift offers rock-solid fault tolerance

Amazon S3

Amazon S3

Region 1

Region 2

Compute Node

Compute Node

Compute Node

Disk failures

Node failures

Network failure

AZ/region level disasters

Page 50: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

You pay for what you use

Further details at https://aws.amazon.com/redshift/pricing/

Monthly bill = N ×

Duration for which the

nodes were usedNumber of nodes

(Price depends on type of node)

Free Tier (2 month free trial)• 750 DC1.Large hours per month

Page 51: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Redshift has a large ecosystemData Integration Systems IntegratorsBusiness Intelligence

Page 52: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Selected Amazon Redshift customers

Page 53: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

What is Amazon ElastiCache?

Page 54: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

In-memory key-value store

High-performance

Resizable in-memory caching

Memcached and Redis

Fully managed; zero admin

Compatible with your existing applicationsAmazonElastiCache

Page 55: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Popular use cases

Caching layer for performance or cost optimization of an underlying database

Storage of ephemeral key-value data

High-performance application patterns such as leaderboards (for gaming users), session management, event counters, in-memory lists

Page 56: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

• Fully managed• Cache node auto-

discovery• Multi-AZ node placement

Key ElastiCache features

• Fully managed• Multi-AZ with auto-failover• Persistence• Read replicas

Page 57: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon ElastiCache: simple app architecture

Elastic LoadBalancing Amazon EC2

app instances

Clients

Amazon RDSAmazon ElastiCache

Page 58: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Amazon ElastiCache: resilient app architecture

Elastic LoadBalancing

Clients

AZ a

AZ b

Page 59: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

How ElastiCache billing works

Monthly bill = N ×

Further details at http://aws.amazon.com/elasticache/pricing/

Duration for which the nodes were used

Number of nodes

(Price depends on type of node)

Free tier (for first 12 months)• 750 micro cache node hours

Page 60: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Selected ElastiCache customers

Page 61: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Managed DB services: better together

Elastic LoadBalancing

Clients

AZ a

AZ b

Page 62: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Next Steps

Free Tier

DynamoDB

RDS

ElastiCache

Redshift

Page 63: Getting Started with Managed Database Services on AWS - September 2016 Webinar Series

Thank you!