To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly...

21
To Relay or Not to Relay for Inter-Cloud Transfers? Fan Lai, Mosharaf Chowdhury, Harsha Madhyastha

Transcript of To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly...

Page 1: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

To Relay or Not to Relay for Inter-Cloud Transfers?

Fan Lai, Mosharaf Chowdhury, Harsha Madhyastha

Page 2: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

• Over 40 Data Centers (DCs) on EC2, Azure, Google Cloud• A geographically denser set of DCs across clouds

• Cloud apps host on multiple DCs• Web search, Interactive Multimedia• Low latency access, privacy regulations

• Massive data across geo-distributed DCs

Background

Page 3: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

WAN is Crucial for Geo-distributed Service

• Bandwidth-intensive transfers• Geo-distributed replication: Web search, cloud storage• Inter-DC Routing: SWAN[SIGCOMM’13], Pretium[SIGCOMM’16], etc• Big data analytics: Iridium [SIGCOMM’15], Clarinet [OSDI’16] …• …

• Latency-sensitive traffic• Interactive service: Skype, Hangout• Transaction processing: SPANStore[SOSP’13], Carousel[SIGMOD’18], etc• …

Page 4: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

•WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant regions[1]

Prior Efforts: WAN b/w varies spatially

VM WANSao Paulo Singapore

Relay:

VM

WAN

VMVirginia

WAN

Direct:

≈3x

Bandwidth Measurement across 11 EC2 regions[1]

[1] “Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds.” NSDI’17

Page 5: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

• Reproduce prior measurements• 11 EC2 regions, 110 inter-DC pairs• Tools: iperf (TCP)

• Heterogeneous link capacity• Varies between the same type of VMs• Lower b/w between distant regions

• Relay should work pretty well

WAN Bandwidth Varies Spatially

Page 6: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

About 40% percent data transfers between EC2 regions can have more than 1.5x bandwidth increase via relay

Bandwidth improvement via best relay on EC2

40%

Page 7: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

How to identify and tackle this complicated WAN?

- Heterogeneous across regions- Dynamic runtime environment- Great complexity in sys design

Page 8: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

How to identify and tackle this complicated WAN?

- Heterogeneous across regions- Dynamic runtime environment- Great complexity in sys design

Assumptions in prior measure-ments:

- Default TCP setting works well- Single TCP is representative

enough for the available b/w

Page 9: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

What if we Break Down these assumptions ?

- Default TCP setting works well- Single TCP is representative

enough for the available b/w

#1: Whether the b/w still varies spatially ?

#2: Whether the b/w still varies temporally? #3: How much room for WAN improvement via relay?

Page 10: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Default TCP Setting may be Sub-optimal

• B/w varies across regions• Lower b/w between distant regions• RTT varies across regions

• Max TCP window is bounded• TCP throughput is RTT-based

•Google: Bandwidth to Iowa

Page 11: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Default TCP Setting is Sub-optimal

• B/w varies across regions• Lower b/w between distant regions• RTT varies across regions

• Max TCP window is bounded• TCP throughput is RTT-based

• Per-TCP rate limit on the WAN

Google: Bandwidth to Iowa

Page 12: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Single TCP is not Representative

• Single TCP underutilize the b/w• Use multiple TCPs

• Per-VM cap for outbound rate• Per-TCP rate limit < Per-VM cap

• Aggregate b/w is homogeneous• VM-cap works on all connections

Google: Bandwidth to Iowa

Page 13: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

What if we Break Down these assumptions ?

- Default TCP setting works well- Single TCP is representative

enough for the available b/w

#1: Whether the b/w still varies spatially ? Often Homogeneous

#2: Whether the b/w still varies temporally? #3: How much room for WAN improvement via relay?

Page 14: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Available B/w is often Stable

• Measurement setup• Create/terminate connections

• Inter-DC connections share the VM-cap

•Create new connections

Google: Throughput from Iowa

Page 15: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Available B/w is often Stable

• Measurement setup• Create/terminate connections

• Inter-DC connections share the VM-cap

Google: Throughput from Iowa

Terminate connections

Page 16: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Available B/w is often Stable

• Measurement setup• Create/terminate connections

• Inter-DC connections share the VM-cap

• Max b/w (VM cap) is stable

Google: Throughput from Iowa

Aggregate b/w is stable

Page 17: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Maximum available bandwidth - Homogeneous across regions - Stable over time - Varies with VM instances - Performance can be predict-

able w/o great sys complexity

What will happen if the b/w is homogeneous ?

Homogeneous bandwidth

Page 18: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Little Scope for Optimization via Inter-DC Relay

What will happen if the b/w is homogeneous ?

Homogeneous bandwidth

Latency Measurement across 40 DCs

Page 19: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

• Intra-DC relay from poor performance VMs to high performance VMs• Gain more inter-DC bandwidth without extra costs for transfers• Routing through a third DC takes your money away

VM

VMDC 2

Takeaway

VM

VM

Intra-DC relay

DC 1

$

0 + $ + 0 = $

VMVM

VM

$

$ + $ = 2$

Inter-DC routing

DC 1 DC 2

DC 3

Page 20: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Takeaway

• Turn to the optimization of bandwidth contentions inside VMs• VM-cap VS link-level optimizations used in existing GDA work• VM-aware VS WAN-aware

• Bandwidth measurements are far from complete• More than 40 VM instance types

VM

VM VMVM

b1b2

bn ∑bi ≤ VM-cap

Page 21: To Relay or Not to Relay for Inter-Cloud Transfers? · • WAN bandwidth(b/w) varies significantly between different regions • Close regions have more than12× of the b/w than distant

Thank you! Questions?

[email protected]

#1: Whether the b/w still varies spatially ? Often Homogeneous

#2: Whether the b/w still varies temporally? Often Stable #3: How much room for WAN improvement via relay?

Case by case