A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso...

25
Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji, and D.K. Panda Network-Based Computing Laboratory Department of Computer Science and Engineering The Ohio State University { jinhy, narravul, browngre, vaidyana, balaji, panda}@cse.ohio-state.edu

Transcript of A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso...

Page 1: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Performance Evaluation of RDMA over IP:A Case Study with the Ammasso Gigabit Ethernet NIC

H.-W. Jin, S. Narravula, G. Brown,

K. Vaidyanathan, P. Balaji, and D.K. Panda

Network-Based Computing Laboratory

Department of Computer Science and EngineeringThe Ohio State University

{ jinhy, narravul, browngre, vaidyana, balaji, panda}@cse.ohio-state.edu

Page 2: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Contents

• Introduction

• WAN Emulator for Cluster-of-Clusters

• Performance Evaluation of RDMA over IP

• Conclusions and Future Work

Page 3: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Introduction

• Sockets over TCP/IP• RDMA over LAN

– InfiniBand, Myrinet, Quadrics– HPC middleware (MPI) and file systems (PVFS)

• RDMA over WAN– iWARP, RDDP– Grid and Internet applications

• RDMA-enabled Gigabit Ethernet NIC– Ammasso

Page 4: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Ammasso Gigabit Ethernet NICApplications

Sockets Interface CCIL(Cluster Core Interface Lang.)

Sockets

TCP

IP

Device Driver

Gigabit Ethernet

RDMA

TOE(TCP/IP Offload Engine)

Am

masso

Gigabit E

thernet NIC

Ope

ratin

g S

yste

m

Page 5: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Problem Statement

• There have been no comprehensive quantitative evaluations of RDMA over WAN environment

• How to Emulate the WAN Environment?

• What Kind of Performance Metrics?

• Sockets vs. CCIL

Page 6: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Contents

• Introduction

• WAN Emulator for Cluster-of-Clusters

• Performance Evaluation of RDMA over IP

• Conclusions and Future Work

Page 7: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Experimental WAN Setup

GigESwitch

GigESwitch

IP

eth0 eth1

Device Driver

Linux Workstation-basedRouter

IP Network A IP Network BWANEmulation

Page 8: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

WAN Emulator for Cluster-of-Clusters

• Characteristics of WAN Environments– High network delay– Packet loss– Etc.

• User-Level or Kernel-Level Emulator?

• Blocking or Queueing based Delay Adding?

Page 9: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Degen: Delay generator

eth0 eth1

Device Driver Device Driver

Routing Decision Degen Netfilter

Timestamp delay queue

reinjection

IP

Degen Kernel Module

Dgen DaemonWAN Emulator for Cluster-of-Clusters

Page 10: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Kernel Patch for CCIL WAN Communication

• Ammasso Setup– Ammasso 1100– Ammasso software version amso1100-1.2-ga2

• Packet Drops for CCIL WAN Communication– Timeout– Retransmission

• Kernel Patch on Router

Page 11: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Contents

• Introduction• WAN Emulator for Cluster-of-Clusters• Performance Evaluation of RDMA over IP

– Basic communication latency– Computation and communication overlap– Communication progress– CPU resource requirements– Unification of communication interface– Bandwidth (throughput)

• Conclusions and Future Work

Page 12: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Basic Communication Latency

0

50

100

150

200

250

300

350

400

450

4 8 16 32 64 128

256

512

1024

2048

4096

8192

1638

4

Message Size (Byte)

Late

ncy

(us)

SocketsCCIL

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

0 1 2 4 8Network Delay (ms)

Late

ncy

(us)

SocketsCCIL

• No impact of zero-copy on the basic communication latency• Basic communication is not an important metric

1KB Message Size

Page 13: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Computation and Communication Overlap

Router SwitchSwitchn0 n1

Computation(t1)

TotalTime(t2)

Overlap Ratio = t1/ t2

Send

Receive

Page 14: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Computation and Communication Overlap

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 61 122 182 242 302 362 422

Computation (ms)

Ove

rlap

Rat

io

SocketsCCIL

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 1 2 4 8Network Delay (ms)

Ove

rlap

Rat

io

SocketsCCIL

• RDMA can achieve a better computation and communication overlap• Its benefit reduces as the network delay increases

1KB Message Size 242ms Computation1098%

114%

Page 15: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Communication Progress

Router SwitchSwitchn0 n1

Response DelayBy Load

DataFetchingLatency

Request

Response

Page 16: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Communication Progress

1

10

100

1000

10000

100000

0 1 4 16 64

Response Delay by Load (ms)

Late

ncy

(us)

SocketsCCIL

1

10

100

1000

10000

100000

0 1 2 4 8

Network Delay (ms)

Late

ncy

(us)

• RDMA can achieve a better communication progress• Its benefit reduces as the network delay increases

16ms Response Delay1KB Message Size

98% 65%

Page 17: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

CPU Resource Requirements

Router SwitchSwitchn0 n1

… 40 Streams

Application

Application Execution Time?

Page 18: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

CPU Resource Requirements

0

5

10

15

20

25

30

35

40

45

50

1K 2K 4K 8K 16K

Message Size (Byte)

Exe

cutio

n T

ime

(Sec

)

SocketsCCIL

0

5

10

15

20

25

30

35

40

45

50

0 1 2 4 8

Network Delay (ms)

Exe

cutio

n T

ime

(Sec

)

• RDMA-based communication does not affect to the application execution time• RDMA has a strong potential of saving the CPU resource

16KB Message Size

Page 19: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Unification of Communication Interface

switch

switch

Inter-Cluster

Intra-Cluster

0

50

100

150

200

250

4 8 16 32 64 128

256

512

1024

2048

4096

8192

1638

4

Message Size (Byte)

Late

ncy

(us)

SocketsCCIL

• RDMA over IP can provide a unified communication interface• RDMA can achieve lower latency for intra-cluster communication

38%

Page 20: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Bandwidth

• Where is the bottleneck?• Ethernet devices on the router• TCP window size

16KB Message Size

0

100

200

300

400

500

600

4 8 16 32 64 128

256

512

1024

2048

4096

8192

1638

4

Message Size (Byte)

Ban

dwid

th (

Mbp

s)

SocketsCCIL

0

50

100

150

200

250

300

350

400

450

500

0 1 2 4 8

Network Delay (ms)

Ban

dwid

th (

Mbp

s)

SocketsCCIL

Page 21: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Contents

• Introduction

• WAN Emulator for Cluster-of-Clusters

• Performance Evaluation of RDMA over IP

• Conclusions and Future Work

Page 22: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Conclusions

• The first quantitative study of RDMA over IP on a WAN setup

• WAN Emulator for Custer-of-Clusters– Degen

• RDMA over IP Can– Save CPU resource on the server side even on a high

delay WAN environment– Achieve better

• computation and communication overlap• communication progress• peak bandwidth

– Provide unified interface

Page 23: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Future Work

• Performance Evaluations– Other performance factors

• impact of address exchange• bandwidth

– Application-level performance

• WAN Emulator for Cluster-of-Clusters– Delay model– Other components

• RDMA-aware Middleware for Widely Distributed Systems over WAN

Page 24: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Acknowledgements

Our research is supported by the following organizations:

• Current Funding support by

• Current Equipment donations by

Page 25: A Case Study with the Ammasso Gigabit Ethernet · PDF fileA Case Study with the Ammasso Gigabit Ethernet NIC H.-W. Jin, ... WAN Emulator for ... – Ammasso software version amso1100-1.2-ga2

Thank You

{ jinhy, narravul, browngre, vaidyana, balaji, panda}@cse.ohio-state.edu

Network-Based Computing Laboratoryhttp://nowlab.cse.ohio-state.edu/