Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

18
Designing Efficient Systems Services and Primitives for Next-Generation Data- Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based Computing Laboratory (NBCL) Computer Science and Engineering Ohio State University

description

Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers. K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based Computing Laboratory (NBCL) Computer Science and Engineering Ohio State University. Introduction and Motivation. - PowerPoint PPT Presentation

Transcript of Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Page 1: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

K. Vaidyanathan, S. Narravula, P. Balaji and D. K. PandaNetwork Based Computing Laboratory (NBCL)

Computer Science and Engineering

Ohio State University

Page 2: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Introduction and Motivation

• Interactive Data-driven Applications– Scientific as well as Enterprise/Commercial Applications

• Static Datasets: Medical Imaging Modalities• Dynamic Datasets: Stock value datasets, E-commerce, Sensors

– Need for interacting, synthesizing and visualizing large datasets– Data-centers enable such capabilities

• Clients initiate queries (over the web) to process specific datasets– Data-centers process data and reply to queries

Page 3: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Typical Multi-Tier Data-center Environment

• Requests are received from clients over the WAN• Proxy nodes perform caching, load balancing, resource monitoring, etc.• If not cached, the request is forwarded to the next tiers Application Server• Application server performs the business logic (CGI, Java servlets, etc.)

– Retrieves appropriate data from the database to process the requests

ProxyServer

Web-server(Apache)

Application Server (PHP)

DatabaseServer

(MySQL)

WAN

ClientsStorage

More Computation and CommunicationRequirements

Page 4: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Overview of Research• Propose a novel framework for next generation data-centers

– Delivering performance and scalability– Providing advanced features such as active caching, fine-grain resource

monitoring, dynamic resource adaptation, etc

• Novel approaches using the advanced features of InfiniBand and other RDMA-enabled Networks– Resilient to the load on the back-end servers– Order of magnitude performance gain for several scenarios– Exploit features like RDMA and remote atomic operations for new primitives and

services

• Three-layer Architecture– Advanced Communication Protocol Support– Data-Center Primitives– Data-Center Services

Page 5: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Proposed ArchitectureExisting Data-Center Components

RDMA Atomic Multicast

Sockets Direct Protocol

ProtocolOffload

PacketizedFlow-control

GlobalMemory

Aggregator

DistributedLock

Manager

PointTo

Point

Advanced System Services

Data-CenterService

Primitives

AdvancedCommunication Protocols

and Subsystems

Network

ActiveCaching

SoftSharedState

Async. Zero-copyCommunication

CooperativeCaching

DynamicReconfiguration

ResourceMonitoring

Dynamic Content Caching Active Resource Adaptation

Distributed Data Sharing Substrate

ActiveCaching

CooperativeCaching

DynamicReconfiguration

ResourceMonitoring

SoftSharedState

DistributedLock

Manager

Distributed Data Sharing Substrate

Async. Zero-copyCommunication

Page 6: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Publications (So Far)• Architecture for Caching Responses with Multiple Dynamic

Dependencies in Multi-Tier Data-Centers over InfiniBand, CCGrid 2005• On the Provision of Prioritization and Soft QoS in Dynamically

Reconfigurable Shared Data-Centers over InfiniBand, ISPASS 2005• Asynchronous Zero-copy Communication for Synchronous Sockets in

the Sockets Direct Protocol (SDP) over InfiniBand, CAC 2006• Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-

Centers over RDMA-enabled Networks, CCGrid 2006• Exploiting RDMA operations for Providing Efficient Fine-Grained

Resource Monitoring in Cluster-Based Servers, RAIT 2006• DDSS: A Low-Overhead Distributed Data Sharing Substrate for

Cluster-Based Data-Centers over Modern Interconnects, HiPC 2006• High Performance Distributed Lock Management Services using

Network-based Remote Atomic Operations, CCGrid 2007

http://nowlab.cse.ohio-state.edu/projects/data-centers/index.html

Page 7: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Sockets Direct Protocol: Throughput and OverlapThro ug hp ut

0

2 0 0 0

4 0 0 0

6 0 0 0

8 0 0 0

1 0 0 0 0

1 2 0 0 0

M e s s a g e S ize (B yte s )

Th

rou

gh

pu

t (M

bp

s)

B S D P

ZS D P

A Z -S D P

C o m p ./C o m m . O ve rla p

0

1 0 0 0

2 0 0 0

3 0 0 0

4 0 0 0

5 0 0 0

6 0 0 0

7 0 0 0

8 0 0 0

9 0 0 0

1 0 0 0 0

D e la y (us e c )

Th

rou

gh

pu

t (M

bp

s)

B S D P

ZS D P

A Z S D P

Asynchronous Zero-copy Communication for Synchronous Sockets in the Sockets Direct Protocol (SDP) over InfiniBand, P. Balaji, S. Bhagvat, H. –W. Jin and D. K. Panda. Workshop on Communication Architecture for Clusters (CAC); with IPDPS ‘06.

Page 8: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Presentation Layout

Introduction and Motivation

Cooperative Caching Services

Resource Monitoring Services

Conclusions and Ongoing Work

Page 9: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Cooperative Caching Services

• Aggregate cache benefits – well known!!• Performance considerations

– Two-sided operation vs. One-sided RDMA operations– Placement of data ( Local Vs. Remote)– Controlling data redundancy– Utilize available remote memory– Load sensitive Protocols

• Objective– Can we design efficient cooperative caching schemes

utilizing the idle resources in the Data-Centers and the RDMA capabilities in networks and eliminate redundancy to optimize available system cache size?

Page 10: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Data-Center Throughput with Cooperative Caching

0

5 0 0 0

1 0 0 0 0

1 5 0 0 0

2 0 0 0 0

2 5 0 0 0

3 0 0 0 0

3 5 0 0 0

A p a c h eC a c h in g

Ba s icC o o p e r a t iv e

C a c h in g

C o o p e r a t iv eC a c h in gW it h o u t

Re d u n d a n c y

M u lt i- T ie rA g g r e g a t e

C o o p e r a t iv eC a c h in g

Hy b r idC o o p e r a t iv e

C a c h in g

8 k 1 6 k 3 2 k 6 4 k

8-Proxy nodes

• Our schemes achieve significant performance gain over basic Apache Caching (AC)

Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks, S. Narravula, H. -W. Jin, K. Vaidyanathanand D. K. Panda. In International Symposium on Cluster Computing and the Grid (CCGrid), 2006

TPS

Page 11: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Presentation Layout

Introduction and Motivation

Data-center Service Primitives

Cooperative Caching Services

Resource Monitoring Services

Conclusions and Ongoing Work

Page 12: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Resource Monitoring Services

• Traditional approaches– Coarse-grained in nature– Assume resource usage is consistent throughout the monitoring

granularity (in the order of seconds)• This assumption is no longer valid

– Resource usage is becoming increasingly divergent• Fine-grained monitoring is desired but has additional

overheads – High overheads, less accurate, slow in response

• Can we design fine-grained resource monitoring scheme with low overhead and accurate resource usage?

Page 13: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Synchronous Resource Monitoring using RDMA (RDMA-Sync)

/proc

KernelSpace

UserSpace

KernelSpace

UserSpace

Front-endNode Memory Memory

CPU CPU

AppThreads

Front-endMonitoringProcess

KernelData Structures

AppThreads

Back-endNode

RDMA

Page 14: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Impact of Fine-grained Monitoring with Applications

Im p a c t o n R U B iS a n d Z ip f Tra c e s

0 %

5 %

1 0 %

1 5 %

2 0 %

2 5 %

3 0 %

3 5 %

4 0 %

α = 0 .9 α = 0 .7 5 α = 0 .5 α = 0 .2 5

Zip f a lp h a va lu e s

% I

mp

rov

em

en

t

S o c k e t -S y n c R D M A -A s y n c R D M A -S y n c e -R D M A -S y n c

Exploiting RDMA operations for Providing Efficient Fine-Grained Resource Monitoring in Cluster-Based Servers, K. Vaidyanathan, H. –W. Jin and D. K. Panda. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations and Technologies, 2006

• Our schemes (RDMA-Sync and e-RDMA-Sync) achieve significant performance gain over existing schemes

Page 15: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Work-in-Progress• Data-Center Primitives

– Efficient Global Memory Aggregator Mechanisms

• Advanced Communication Protocol Mechanisms– Efficient Packetized Flow-Control

• Detailed Data-Center Evaluation with the proposed framework

• Software release of several data-center components– Have received multiple requests from organizations for such a

release including a large financial company

Page 16: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Conclusions• Proposed new protocols, primitives and services for next

generation data-centers– Use advanced features of InfiniBand and other RDMA-Enabled

interconnects– Significant performance gains and scalability for several

scenarios

• Potential for designing next generation scalable and high

performance data-center architectures

Page 17: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Future Challenges• Challenges

– Benefits of all these components and services in an integrated

manner for handling• Terabytes of data and Multi-thousand users

– Redesigning middleware and applications on next generation

data-centers

• Significance to the SMA and PDOS components of the

program

• Discussion Bullet– How to re-architect next generation data-center architectures,

software services, middleware and applications with advances in

modern networking technologies and capabilities?

Page 18: Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers

Web Pointers

Website: http://www.cse.ohio-state.edu/~panda

Group Homepage: http://nowlab.cse.ohio-state.edu

Email: [email protected]

NBCL