DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution Engines

DryadOpt: Branch-and-Bound on Distributed Data-Parallel

Execution Engines

Mihai Budiu, Daniel Delling, Renato WerneckMicrosoft Research - Silicon Valley

IEEE International Parallel & Distributed Processing Symposium

IPDPS 2011

DDPEEs

Execution

Application

Storage

Language

Map-Reduce

GFSBigTable

CosmosAzureHPC

DryadLINQScope

FlumeJava

Hadoop

Pig, HiveDryadOpt

Your problem

Branch-And-Bound (BB)

• Solve optimization problems

• Explore potential solutions tree

• Bound solution cost• Prune search

Optimization Problems

• Minimize/maximize cost• Many are NP-hard• Arise frequently in practice• Parallelism = linear speedup/exponential algorithm– may make a solution practical – e.g., one CPU-year / day– real-world instances are not always hard– relatively small problems

Why Is This Work Interesting?

• Generic distributed BB implementation– Separate sequential and parallel components– Parallelism hidden from user

• DDPEEs offer a restricted computation model– Communication is expensive– DDPEEs require idempotent computations

(DryadOpt uses any sequential solver)• DryadOpt exploits parallelism well (CPU/core)

Generic Solution Search

Solver API

Sequential Solver

DryadOpt

Concern Separation

Solver interface

Sequentialengine

Multi-coreengine

Distributedengine

(DryadOpt) Solver engines

Specializedsequentialsolvers

Steinertree

Travellingsalesman

Optimizationproblem

Outline

• Introduction• Mapping BB to DDPEEs• Running the algorithm• Parallelization details• Performance results• Conclusions

DDPEE Computation Structure

Input Computations

Communication

Output

Computation graph is statically constructed

Unbalanced Search Trees

No static tree partition will work well

Algorithm structure

• Dynamic load-balancing• Iterative computation

Expand tree

Load-balance

Iterate

Distributing Search Trees

Outline

1. Start tree on a single machine

2. Split the open problems randomly

3. Distribute open problems

4. Proceed independently

5. Split Independently, Randomly

6. Redistribute

7. Merge

8. Iterate

Final Tree

Outline

Bird’s Eye View

Current frontier

New frontier

Sequential solver

Load-balancing

Aggregate stateTermination test

New frontier

instance

global state

Broadcast

computation

Repeat if not done

Nested Parallelism

Inter-machine parallelism Inter-core parallelism

Partition

Other Details in Paper

• Cluster resources are unpredictable– Outliers can lead to low cluster utilizationUse real-time scheduling

• Sequential solver is not idempotent– Fault tolerance-triggered re-executions

can lead to incorrect resultsCkeckpoint frontier at suitable execution points

Other Details in Paper

• Trade-off memory/load balancing– The frontier can grow very largeAdjust dynamically tree traversal strategy BFS/DFS

• Sub-problems may differ little from problem– Many sub-problems can cause memory pressureUse an incremental sub-problem representation

Outline

Benchmark: Steiner Tree Solver

Cluster

• Machines– 2 dual-core AMD Opteron 2.6Ghz– 16 GB RAM– Windows Server 2003

• DryadLINQ• 128 machines (512 cores)

Scalability

Conclusions

• Generic parallelization (problem-independent)• Nested machine/core parallelization• Careful scheduling needed for good performance• Solvers are not idempotent:

interference with fault-tolerance mechanisms

• Search Tree Exploration is efficiently parallelizable in the DDPEE model

Backup Slides

Real-Time Scheduling

real-time deadlines

Preempted Completed

Load-Balancing

• BFS: – large frontier– Efficient load-balancing– Memory pressure

• DFS– Reduces # of open subproblems

• Solution: dynamically switch BFS DFS

Tree Traversal Strategies

The Solver API[Serializable] interface IBBInstance {}

[Serializable] interface IBBGlobalState {void Merge (IBBGlobalState s);void Copy (IBBGlobalState s); }

List<IBBInstance> Solve (List<IBBInstance> incrementalSteps,IBBGlobalState state,BBConfig c)

Re-execution & Idempotence

X Y X Y X Y

X Y X Y

DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution Engines

Documents

Transcript of DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution Engines

discussion questions - Bound to Stay Bound Books

Illud: Utilizing Semantic Similarity for Image Search · Topic Model StaySense Cosine Similarity on Records in Topic Execution Pipeline Data Management Pipeline Search engines commonly

Language-bound terms—term-bound languages

Focused Execution WIDER SMILES IMPROVED ACCESS HIGHER … · Each project is meticulously planned with time bound goals. From need based ... alloys industry and has been invited to

An ISO 9001:2008 Certified Company - BRIDGEWAY … · Project planning, coordination & execution in a time bound manner, along with technical support. Heavy lifts, Offshore oil rig

Nikolaj Bjørner Microsoft Research FSE &. Using Decision Engines for Software @ Microsoft. Dynamic Symbolic Execution Bit-precise Scalable Static Analysis.

Sharing data and work across queries in analytical workloads exam... · Iraklis Psaroudakis, Ph.D. Student DIAS, I&C, EPFL Abstract—Traditionally, query execution engines in relational

CONTINUUM INTENSITIES A COMPUTER PROGRAM FOR …scienide2.uwaterloo.ca/~rleroy/Pubn/89CPC-BCONT.pdf · R.J. Le Roy /Bound-~continuum intensities 3.85 during execution. The structure

OpenBox - SIGCOMM · 2016. 8. 26. · Java-based OpenBox Controller Software OpenBox Service Instance Generic wrapper for execution engines (Python) FW Northbound API REST client/server

Datasheet – Volume 1 of 2€¦ · 4 Datasheet, Volume 1 2.4.1 3D and Video Engines for Graphics Processing............................................33 2.4.1.1 3D Engine Execution

Jan 31, 2007CS5111 Unbound STEP graph Bound STEP graph Resource management components solve graph embedding Deployment Sensor eXecution Environments Remember.

Multi-Objective Optimization in a Finite Time Bound Method ... · Introduction The energy crisis ... Some researchers did different works about Stirling engines, Stirling cycles,

Heat Engines Coal fired steam engines. Petrol engines Diesel engines Jet engines Power station turbines.

Web-App Remote Code Execution Via Scripting Engines

Kdump on the Mainframe - · PDF filethat are eligible for execution on Specialty Engines (e.g., zIIPs, zAAPs, and IFLs). ... Reserve memory for kdump kernel with “crashkernel”

Reducing Concurrent Analysis Under a Context Bound to ...pages.cs.wisc.edu/~akash/files/techreport.pdf · to resume execution from l1 (along with the global store, which T2 might

DryadOpt: Branch-and-Bound on Distributed Data-Parallel Execution

Prototyping Symbolic Execution Engines for Interpreted Languages · 2020. 12. 7. · dynamic interpreted languages like Python. Building a new symbolic execution engine is a monumental

Happy Thanksgiving Merry Christmas Upward Bound Upward ... · Happy Thanksgiving Merry Christmas Upward Bound! Upward Bound Upward BoundUpward BoundUpward Bound Happy Valentine’s

APLICAÇÃO DO MÉTODO BRANCH-AND-BOUND NA … · of father node, lower bound, strategy execution order and sequence construction order. The obtained results have demonstrated robustness