[Jun AWS 201] Technical Workshop
-
Upload
amazon-web-services-korea -
Category
Technology
-
view
999 -
download
1
Transcript of [Jun AWS 201] Technical Workshop
Agenda • House Keeping & Setup
• Introduction to Amazon Web Services
• Lab: Building a Web Property on AWS – Storage: S3, EBS
– Compute
– Content Delivery
– Relational Database
– Dynamo DB
• Lab: Scalability and Availability – Snapshots
– Load Balancing
– Auto Scaling
– Security
• Log Processing Scenario – Logging to AWS
– Elastic Map Reduce
Agenda • House Keeping & Setup
• Introduction to Amazon Web Services
• Building a Web Property on AWS – Storage: S3, EBS
– Compute
– Content Delivery
– Relational Database
• Scalability and Availability – Snapshots
– Load Balancing
– Auto Scaling
– Security
• Log Processing Scenario – Logging to AWS
– Elastic Map Reduce
– Dynamo DB
Edge Locations
Global AWS Infrastructure ] [
Dallas (2)
St.Louis
Miami
Jacksonville Los Angeles (2)
Palo Alto
Seattle
Ashburn (2)
Newark New York (3)
Dublin London (2) Amsterdam (2)
Stockholm
Frankfurt (2) Paris (2)
Singapore (2)
Hong Kong (2)
Tokyo (2)
Sao Paulo
South Bend
San Jose
Osaka Milan
Sydney
Hayward
Madrid
[ ]
Management +
Interface
Cross
Service
Features
CloudFormation Simple WorkFlow
CloudWatch
Elastic BeanStalk
IAM
The Amazon Web Services universe
(some services are omitted here)
Simple Notification Service
[ ]
Management +
Interface
Platform
Building
blocks
RDS
EMR
DynamoDB
CloudFront SQS
CloudSearch SES
Cross
Service
Features
The Amazon Web Services universe
(some services are omitted here)
Redshift
Infrastructure
Building
Blocks
[ ]
Management +
Interface
Platform
Building
blocks
Cross
Service
Features
Glacier EBS S3 Virtual Private Cloud
The Amazon Web Services universe
(some services are omitted here)
EC2
[ ]
Management +
Interface
RDS CloudSearch SES CloudFront SQS EMR DynamoDB
Elastic Beanstalk Simple WorkFlow
CloudFormation CloudWatch
IAM
VPC EC2 EBS S3
(some services are omitted here)
Infrastructure
Building
Blocks
Platform
Building
blocks
Cross
Service
Features Command Line Web Console
APIs SDK
The Amazon Web Services universe
Agenda • House Keeping & Setup
• Introduction to Amazon Web Services
• Building a Web Property on AWS – Storage: S3, EBS
– Compute
– Content Delivery
– Relational Database
– Dynamo DB
• Scalability and Availability – Snapshots
– Load Balancing
– Auto Scaling
– Security
• Log Processing Scenario – Logging to AWS
– Elastic Map Reduce
Labs
During this workshop, we will build from scratch a highly available, redundant,
scalable web property on AWS.
Storage
Amazon Simple Storage Service (S3)
Amazon S3 Bucket Volume Snapshot Amazon Elastic
Block Storage
(EBS)
Amazon Elastic Block Storage (Amazon EBS)
Our Building Blocks
Compute
Our Building Blocks
Amazon Elastic Compute Cloud (EC2)
Amazon EC2 Instance Instances AMI DB on Instance
Instance with CloudWatch
Elastic IP
Content Delivery & Database
Our Building Blocks
Amazon Database Services RDS/DDB
Amazon RDS MySQL DB Instance DynamoDB
Amazon Cloudfront
Amazon Cloudfront
Download Distribution
Edge Location Streaming Distribution
• Object-based storage
(no Filesystem)
• Easily store/retrieve data
• Durability of 99.999999999%
or 99.99%
• Integrated with other AWS
Services
• Scalable
• Redundancy is managed
transparently
• File (Object): up to 5 TB each
• HTTP, HTTPS, BitTorrent
protocols
28
Amazon S3
Simple Storage Service
Data
Any Amazon S3 Region
Your Data
Data
Data
Data Data
Amazon S3 Redundancy
Data is replicated
multiple times
In case of failure,
data is replicated again,
transparently
• Backup
• Archive (Glacier)
• Disaster Recovery (DR)
• Content Storage
• Website
30
Amazon S3 Use cases
• Delivery of content
worldwide
• Static: HTTP / HTTPS
• Streaming: RTMP
• Dynamic Content
• Origin server: S3,
EC2, non-AWS
• Public/Private
Distributions
33
Amazon CloudFront
Content Delivery Network
Regions, Availability Zones, Edge Locations
Dallas
St.Louis Miami
Jacksonville
Los Angeles (2)
Palo Alto
Seattle
Ashburn
Newark
New York (2)
Dublin
London Amsterdam
Stockholm
Frankfurt Paris
Singapore
Hong Kong
Tokyo
Sao Paulo
South Bend
San Jose
38 Edge Locations in total (as of Dec 2012)
Sydney
Let’s simplify a bit: we consider only a few of them
Stockholm
Hong Kong
Sao Paulo
San Jose
Your web servers
in Singapore
Stockholm
Hong Kong
Sao Paulo
San Jose
Content Delivery Network: How it works
Dynamic pages (PHP, Java) (from web servers)
Static content or streaming (with CloudFront)
Your web servers
in Singapore
• Accelerated web content delivery
• Off-load traffic from web servers
• Big spikes in traffic
• Event streaming
• Marketing campaigns
37
Amazon CloudFront Use cases
• Linux / Windows virtual
machines
• Elastic (scale in minutes)
• Many Instance types
(micro, small, large...)
• AMI: Amazon Machine
Image
• VPC: Virtual Private Cloud
• Security (Groups,
Keys...)
• VM Import
• Auto Scaling, Elastic
Load Balancing
40
Amazon EC2
Elastic Compute Cloud
• Media
• Hosting
• High Performance
Computing
• Dev & Test
• Internal Applications
• Gaming
• ... Everything that
needs computing!
42
EC2 Use cases
Lab Exercise • Create a new Security Group • Launch an Amazon EC2 instance (Linux) • Log in with SSH as ec2-user@ • Install a web server • Create a simple web page • Test it on a browser: it works! • Create and attach an Elastic IP • Create an AMI from an EC2 Instance
• Block Level Storage for use
with EC2
• Volume: 1 GB to 1 TB
• Raw unformatted block
device
• Local to an Availability Zone
• Redundant
• Persistent
• Point-in-time snapshots to
Amazon S3
• Integration with CloudWatch
46
EBS
Elastic Block Storage
• Relational Database “as a
Service”
• Simple to Deploy
• Managed by the AWS team
• MySQL, SQLServer or
Oracle (as of 2012)
• Scalable
• Optional: automatic
Standby Replica
• Optional: multiple Read-
Only copies
• Easy DB Snapshots and
automated backup
49
RDS
Relational Database Service
Demo RDS • Create a DB Instance on RDS (MySQL)
• Enable Multi-AZ Deployment
• Enable one Read Replica
• Optional: connect to the DB Instance
Agenda • House Keeping & Setup
• Introduction to Amazon Web Services
• Lab: Building a Web Property on AWS – Storage: S3, EBS
– Compute
– Content Delivery
– Relational Database
– Dynamo DB
• Lab: Scalability and Availability – Snapshots
– Load Balancing
– Auto Scaling
– Security
• Log Processing Scenario – Logging to AWS
– Elastic Map Reduce
EBS
Elastic Load Balancer
• Automatically distribute incoming traffic to multiple Amazon EC2 instances (in the same Region).
• Automatic Health check
• IPv6 support
• Can be integrated with AutoScaling
Snapshots & AMIs
• Copies of EBS Volumes
• Essential to Reusability
• Copy between Regions
• Durability in S3
Lab Exercise • Duplicate your entire architecture by
making an AMI
• Increase your availability by spreading your application across availability zones
• Bring up an ELB in front of your website
• Optional – Create a CNAME to the ELB
CloudWatch
• Integrated across AWS platform
• Nothing to install
• Custom Metrics
• Set Alarms
• View graphs and statistics
• Feeds into Autoscaling
AutoScaling
• Auto Up and Auto Down
• Runs on CloudWatch metrics
• Notifications via SNS
• Spot or On-demand
• No additional Fees
AutoScaling
• Launch config: AMI to be used
• Autoscaling group: where/how to launch
• Autoscaling policy: what should AS do
• Autoscaling trigger: what will activate AS
Security
• Security Groups
• Granular tiered secure architecture
• Roles for services
• Best Practices - Bastions
Before we get started on the controls…
• AWS Reports, Certifications & Accreditations • SOC 1, Type 2 report • SOC 2 report • ISO 27001 • PCI DSS Level 1 service provider • FISMA Moderate • MPAA
• Look at http://aws.amazon.com/security
Agenda • House Keeping & Setup
• Introduction to Amazon Web Services
• Lab: Building a Web Property on AWS – Storage: S3, EBS
– Compute
– Content Delivery
– Relational Database
– Dynamo DB
• Lab: Scalability and Availability – Snapshots
– Load Balancing
– Auto Scaling
– Security
• Log Processing Scenario – Logging to AWS
– Elastic Map Reduce
Getting your Data into S3
S3 Console Upload
FTP
S3 API
AWS Import / Export
Direct Connect
Tsunami UDP
Storage Gateway
3rd Party Commercial Applications
CloudFront Flume
AWS Data Pipeline
Introducing Apache Hadoop
• Apache Hadoop • Software for distributed data analysis
• Map/Reduce framework
• Focus on data
EMR is Hadoop in the Cloud
Hadoop is an open-source framework for parallel processing huge amounts of data on a cluster of machines
What is Amazon Elastic MapReduce (EMR)?
How does it work?
EMR
EMR Cluster S3
Put the data into S3
Choose: Hadoop distribution, # of nodes, types of nodes, custom
configs, Hive/Pig/etc.
Get the output from S3
Launch the cluster using the EMR console, CLI, SDK,
or APIs
You can also store everything in HDFS
011001101
Watch out for unexpected Costs
When the Technical Workshop comes to an end, to avoid unwanted costs: • Delete your S3 objects • Destroy your CloudFront distributions • Stop or Shut Down your EC2 and RDS instances
The customer is responsible for the resources he’s using. AWS declines any responsibility if the customer forgets to shut down resources.