Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel...

22
2/16/2011 1 Jianwu Wang 1 , Daniel Crawl 1 , Ilkay Altintas 1 , Chad Berkley 2 , Matthew B. Jones 2 1 San Diego Supercomputer Center, UCSD 2 National Center for Ecological Analysis and Synthesis, UCSB Ninth Biennial Ptolemy Miniconference

Transcript of Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel...

Page 1: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

2/16/2011 1

Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2

1 San Diego Supercomputer Center, UCSD 2 National Center for Ecological Analysis and Synthesis, UCSB

NinthBiennialPtolemyMiniconference

Page 2: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Outline

•  Distributed Execution Architectures in Kepler

•  Master-Slave Distributed Execution Architecture in Kepler

•  MapReduce Distributed Execution Architecture in Kepler

•  Comparison between the Above Two Architectures

2/16/2011 2 NinthBiennialPtolemyMiniconference

Page 3: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Part I

•  Distributed Execution Architectures in Kepler

•  Master-Slave Distributed Execution Architecture in Kepler

•  MapReduce Distributed Execution Architecture in Kepler

•  Comparison between Master-Slave and MapReduce Distributed Execution Architectures

2/16/2011 3 NinthBiennialPtolemyMiniconference

Page 4: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Distributed Execution Requirements in Kepler

•  Various requirements on distributed execution in different environments, examples: – Ad-hoc network resources – Web service resources – Cluster resources – Grid resources – Cloud resources – …

2/16/2011 4 NinthBiennialPtolemyMiniconference

Page 5: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Distributed Execution Supports in Kepler

•  Kepler integrated frameworks and libraries to support the requirements – Remote method invocation (RMI) for ad-hoc

network resources – Axis Web service libraries for Web Service

invocation – Ssh session libraries (JSch) for remote

execution and job submission on clusters – Globus libraries for Grid computing

2/16/2011 5 NinthBiennialPtolemyMiniconference

Page 6: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Three Distributed Execution Levels in Kepler

•  Workflow level: the whole workflow can be executed in distributed environments – Example: Web service for Kepler workflow

execution •  Actor level: distributed computing and data

resources can be utilized in an actor – Example: Web service actor in Kepler

•  Sub-workflow level: sub-workflows can be executed in distributed environments – Example: Master-Slave and MapReduce

Distributed Execution 2/16/2011 6 NinthBiennialPtolemyMiniconference

Page 7: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Advantages of Using Workflow System for Distributed Execution

•  Reuse existing workflows – Easily transform workflow from centralized

execution to distributed execution •  Transparent implementation – Hide diverse distributed techniques from users,

such as different job schedulers –  Just drag-and-drop, no coding is needed

•  Optimal execution –  (Semi-)automatically get the best execution plan

•  Provenance support •  Fault tolerance

2/16/2011 7 NinthBiennialPtolemyMiniconference

Page 8: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Part II

•  Distributed Execution Architectures in Kepler

•  Master-Slave Distributed Execution Architecture in Kepler

•  MapReduce Distributed Execution Architecture in Kepler

•  Comparison between Master-Slave and MapReduce Distributed Execution Architectures

2/16/2011 8 NinthBiennialPtolemyMiniconference

Page 9: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Distributed Composite Actor

•  As the role of Master, each token received by Distributed Composite Actor is distributed to a Slave node, executed, and the results returned.

2/16/2011 9 NinthBiennialPtolemyMiniconference

Page 10: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Distributed Composite Actor Behaviors with Different Computation Models

2/16/2011 10 NinthBiennialPtolemyMiniconference

Page 11: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Demo Workflow

2/16/2011 11 NinthBiennialPtolemyMiniconference

Page 12: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Usability

•  Users use the DistributedCompositeActor just like the common composite actor

•  Interaction for execution environment transition

2/16/2011 12 NinthBiennialPtolemyMiniconference

Page 13: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Performance Experiment

2/16/2011 13 NinthBiennialPtolemyMiniconference

Page 14: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Part III

•  Distributed Execution Architectures in Kepler

•  Master-Slave Distributed Execution Architecture in Kepler

•  MapReduce Distributed Execution Architecture in Kepler

•  Comparison between Master-Slave and MapReduce Distributed Execution Architectures

2/16/2011 14 NinthBiennialPtolemyMiniconference

Page 15: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

MapReduce Actor in Kepler

2/16/2011 15

(a) MapReduce actor. (b) Map sub-workflow in MapReduce actor. (c) Reduce sub-workflow in MapReduce actor.

NinthBiennialPtolemyMiniconference

Page 16: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

MapReduce Actor Execution in Hadoop

2/16/2011 16 NinthBiennialPtolemyMiniconference

Page 17: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Using MapReduce Actor for Word Count

2/16/2011 17

Word count workflow in Kepler

Map sub-workflow Reduce sub-workflow

Sub-workflow in IterateOverArray actor

NinthBiennialPtolemyMiniconference

Page 18: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Performance Experiment for Word Count

2/16/2011 18 NinthBiennialPtolemyMiniconference

Page 19: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Part IV

•  Distributed Execution Architectures in Kepler

•  Master-Slave Distributed Execution Architecture in Kepler

•  MapReduce Distributed Execution Architecture in Kepler

•  Comparison between Master-Slave and MapReduce Distributed Execution Architectures

2/16/2011 19 NinthBiennialPtolemyMiniconference

Page 20: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Commonalities

•  Both have distributed data + distributed programs

•  Both have master and slaves •  Both have execution engines on slaves

2/16/2011 20 NinthBiennialPtolemyMiniconference

Page 21: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Main Differences

•  MapReduce – Usually, all input data needs to be staged in

beforehand and outputs is only accessible when the whole execution is finished

– More suitable for large data sets, and has good scalability on clusters with numerous nodes

•  Master-Slave –  Inputs can be provided dynamically and get its

result gradually once it is generated – More suitable for dynamic data distribution cases

2/16/2011 21 NinthBiennialPtolemyMiniconference

Page 22: Jianwu Wang , Daniel Crawl Ilkay Altintas , Chad ... - Chess · 2/16/2011 1 Jianwu Wang1, Daniel Crawl1, Ilkay Altintas1, Chad Berkley2, Matthew B. Jones2 1 San Diego Supercomputer

Thanks!

•  Papers for the Above Work –  J. Wang, D. Crawl, I. Altintas. Kepler + Hadoop – A General

Architecture Facilitating Data-Intensive Applications in Scientific Workflow Systems. In Proc. of the 4th Workshop on Workflows in Support of Large-Scale Science (WORKS09) at Supercomputing 2009 (SC2009) Conf..

–  J. Wang, I. Altintas, P. R. Hosseini, D. Barseghian, D. Crawl, C. Berkley, M. B. Jones. Accelerating Parameter Sweep Workflows by Utilizing Ad-hoc Network Computing Resources: an Ecological Example. In Proceedings of IEEE 2009 Third International Workshop on Scientific Workflows (SWF 2009), 2009 Congress on Services (Services 2009).

•  More Information: –  Distributed Execution Interest Group of Kepler: https://

dev.kepler-project.org/developers/interest-groups/distributed –  Contact: [email protected]

2/16/2011 22 NinthBiennialPtolemyMiniconference