Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai...

33
Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no technical data subject to the EAR or the ITAR.

Transcript of Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai...

Page 1: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Scalable Algorithms for Complex Networks

Tuhin Sahai

United Technologies Research Center

University of Connecticut

This page contains no technical data subject to the EAR or the ITAR.

Page 2: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Outline / Acknowledgements

2

� Decentralized Clustering

� Alberto Speranzon (UTRC)

� Andrzej Banaszuk (UTRC)

� Distributed Computation

� Stefan Klus (U. Paderborn/UTRC)

� Michael Dellnitz (U. Paderborn)

� Uncertainty Quantification

� José Miguel Pasini (UTRC)

� Amit Surana (UTRC)

� Andrzej Banaszuk (UTRC)

� Vladimir Fonoberov (Aimdyn Inc.)

� Sophie Loire (UC Santa Barbara)

This page contains no technical data subject to the EAR or the ITAR.

Page 3: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Complex Networks: Building Systems

3

� 40% of produced energy in the US is consumed by buildings

Energy Efficient Integrated Buildings

Monitoring, Security,

Occupancy Estimation etc.

Distributed Estimation

Distributed Optimization

Large Sensor

Networks

Tosur Tisur Tamb Tzone

C

1/hiA 1/hoA R

Qsurfi Qsurfo C

Qstructure

Rwin

Thermal models

CFD, Reduced Order

Models

Distributed Computation

Uncertainty Quantification

Need

Scalable &

Decentralized

Solutions

Large ODE

ModelsAnalysis

& Control

This page contains no technical data subject to the EAR or the ITAR.

Page 4: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

� Renewable energy sources: DOE’s 2030 goal – 20% of all

energy to have renewable sources

Complex Networks: Smart Grids & Aircraft Systems

4

Decentralized ControlStability Analysis

Heterogeneous

Switching

systems

No

Central

Control

• Uncertainty

Quantification

FAA

Certification

This page contains no technical data subject to the EAR or the ITAR.

Aircraft Electrical Systems

Page 5: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Complex Networks: UAV Swarms/Sensor Nets

5

� Networks of mobile sensors needed for various tasks

UAV Swarm

[Mathew and Mezic, ’09]

Path planning

for search & rescueCentralized:

Changing

Environment?

Sample Prior Distribution

Sensor Coverage

[Ghrist and Jadbabaie]

Slow:

Random Walk/

Consensus

� Novel algorithms needed for deployment in applications

� Mobile sensors need to self-organize using local information

� Role of uncertainty quantification for random parametersThis page contains no technical data subject to the EAR or the ITAR.

Page 6: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Complex Networks: Social Graphs

� Analysis of large social networks: time-varying and multi- attributed

interactions

� Games on social networks

� Other important complex networks: MEMS oscillator networks, biological

systems etc. 6

GUARD-DOG Program

Parallel/Cloud Computation Community detectionSocial Network

Scalable algorithms

This page contains no technical data subject to the EAR or the ITAR.

Page 7: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Scalable Solutions for Complex Networks

� “Divide and Conquer” - help build scalable solutions for

various problems

7

Partition Graph/Data

� Decentralized, scalable and fast

solutions for clustering, computation,

estimation, uncertainty quantification

etc.

� Clustering enables scalability

Implement distributed

algorithm

Desired:

This page contains no technical data subject to the EAR or the ITAR.

Page 8: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

� “Can one hear the shape of a drum?” – an article by Mark Kac:

do frequencies of vibrations determine the shape uniquely?

� Sparked decades of mathematical activity!

� Answer: Yes (if convexity assumed)

� Similarly we ask: Can one hear

the properties of a graph

(such as cluster locations)?

� Answer: Yes (if no symmetries)

“Shape” from Harmonic Analysis

8

M. Kac, Can One Hear the Shape of a Drum, American Mathematical Monthly, 1966

Non-convex drums with same frequencies

This page contains no technical data subject to the EAR or the ITAR.

Page 9: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Graph Analysis

� Matrix approach for representing graphs:

� Laplacian has nice properties

� Eigenvalues:

� Eigenvectors:

� Eigenvalues: Connected components, diameter etc

� Spectral Gap: Number of clusters

� Eigenvectors: Clustering, Localization, Sensor Coverage,

PageRank etc9

F. Chung, Spectral Graph Theory, 1997This page contains no technical data subject to the EAR or the ITAR.

Page 10: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Spectral Clustering (Centralized)

� Partition graphs based on:

where,

10

U. v. Luxburg, A Tutorial on Spectral Clustering, Statistics and Computing, 17(4), 2007

NP-Complete

Clusters: set of nodes strongly connected to each other

Relax problem

Rayleigh-Ritz

Strength of connection

Size of clusters

This page contains no technical data subject to the EAR or the ITAR.

Page 11: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Spectral Clustering (Centralized)

11

.….Eigenvector of Laplacian holds the clustering information

M. Fiedler, Czechoslovak Mathematical Journal, 1975.

+ +- - + +

It is extended to more than 2 clusters by using higher

eigenvectors (combinations of signs)

Very popular approach

Drawbacks:

• Solution is expensive for large graphs

• Central computer needed

This page contains no technical data subject to the EAR or the ITAR.

Page 12: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Distributed Spectral Clustering

� Orthogonal Iterations on a graph

� Random walks can be slow – equivalent to evolving the

heat equation on the graph. Convergence:

12

: Measure of time for random walk to get to steady stateD. Kempe and F. McSherry, A Decentralized Algorithm for Spectral Analysis, 2008.

Random Walk on the graph

This page contains no technical data subject to the EAR or the ITAR.

Page 13: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Insight

Traditional distributed clustering uses random walks

� Heat Equation (random walks):

� On the graph:

� Wave equation:

� On the graph:

The solution of the heat equation dies out, but the wave

equation does not

13

• T. Sahai, A. Speranzon and A. Banaszuk, Hearing the Clusters in a Graph:

A Distributed Algorithm, Automatica, 2012.

• M. Hein, J.-Y. Audibert and U. V. Luxburg, From Graphs to Manifolds - Weak

and Strong Pointwise Consistency of Graph Laplacians, 2005.

This page contains no technical data subject to the EAR or the ITAR.

Page 14: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

14

Example of the Algorithm

C4

C1

C2

C2

C3

C3

C1

-

+

+

-

-

+

+

+

+

-

-

+

-

+

Local FFT

This page contains no technical data subject to the EAR or the ITAR.

Page 15: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Main Result

15

Evolving the wave equation combined with local frequency estimation is

equivalent to computing the eigenvectors of the graph Laplacian

This page contains no technical data subject to the EAR or the ITAR.

Page 16: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Proof Outline

16

Need to start with a random initial condition

This page contains no technical data subject to the EAR or the ITAR.

Page 17: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Performance vs. Random Walk

� It can be shown that:

� Time needed to resolve the

lowest frequency in the FFT

� Compare to [Kempe & McSherry, 2008]

� Orders of magnitude faster than random walks

� is a function of

17This page contains no technical data subject to the EAR or the ITAR.

Page 18: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Comments about the Wave Equation

� Computes all eigenvectors and eigenvalues of the Laplacian

� Useful for distributed graph analysis and self-organizing

networks

� Algorithm is decentralized (no central node required)

� Fast Convergence (faster than random-walk based methods)

� Only scalar quantities exchanged (low communication cost)

18This page contains no technical data subject to the EAR or the ITAR.

Page 19: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Results: Line Graph

19This page contains no technical data subject to the EAR or the ITAR.

Page 20: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Applications & Results

20

Fortunato Benchmark Example

T. Sahai, A. Speranzon and A. Banaszuk, Hearing the Clusters in a Graph: A Distributed Algorithm

� Community detection in MapReduce (DARPA)

� Algorithm to detect disconnected components and isolated

nodes

� Scalable analysis of large quantities of sensor data

Contaminant Flow in Buildings

� Sensors self-organize into groups

� Accelerates distributed estimation

by orders of magnitude

� Extends work by Mathew and Mezic 2009

� Mobile agents self-organize into groups

� Decentralized computation of trajectories

Decentralized Path Planning

Social Network & Data Analysis

Distributed Estimation

This page contains no technical data subject to the EAR or the ITAR.

Page 21: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Distributed Computation

� Waveform Relaxation: Algorithm for distributed simulation of differential equations on parallel computers

21

J. K. White and A. Sangiovanni-Vincentelli, Relaxation Techniques for the Simulation of VLSI Circuits,1986.

Split

Compute

Iterate

Variables in CPU #1

Variables in CPU #2

Goal: Simulate for the interval

This page contains no technical data subject to the EAR or the ITAR.

Page 22: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Distributed Computation

22

Waveform Relaxation always converges as long as the original

function is Lipschitz continuous

S. Klus, T. Sahai, C. Liu and M. Dellnitz (2011), An Efficient Algorithm for the Parallel Solution of High-Dimensional Differential Equations, Journal of Computational and Applied Mathematics.. This page contains no technical data subject to the EAR or the ITAR.

Page 23: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Adaptive Waveform Relaxation

23

Waveform relaxation can be represented as:

� Split into parts

�Reduce by reducing when

estimate for is large

Take large steps when function changes slowly

and small steps when function changes fast

� Start with an initial time interval of say:

� Compute the solution using waveform relaxation

� Let be the number of desired iterations and the desired tolerance

� Estimate the next time interval:

� Where is estimated using a standard extrapolation formula

Adaptive Waveform Relaxation

This page contains no technical data subject to the EAR or the ITAR.

Page 24: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Optimal Splitting for AWR

24

� Optimal splitting changes with the numerical scheme and

step size: NP complete

Optimal for

• Implicit Euler:

• Implicit Euler:

Optimal for

• Implicit Euler:

• Trapezoidal Rule:

Spectral Clustering is a good heuristic for splitting (symmetrized Jacobian)

This page contains no technical data subject to the EAR or the ITAR.

Page 25: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

25

Thermal Model of a Building

Energy consumption in Buildings

Equations forRooms & Walls

HVAC Appliances/People

SolarConductionNewton’s Law of Cooling

This page contains no technical data subject to the EAR or the ITAR.

Page 26: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Thermal Model of a Building

26

Energy consumption in Buildings

Tosur Tisur Tamb Tzone

C

1/hiA 1/hoA R

Qsurfi Qsurfo C

Qstructure

Rwin

Models

ODEs

1 floor gives 140 states

Clustering

0 10 20 30 40 50 60 70-0.2

0

0.2

0.4

0.6

0.8

1

1.2

1.4

23 Clusters

AWR 10 times faster than WR

AWR & WR

This page contains no technical data subject to the EAR or the ITAR.

Page 27: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Propagating Uncertainty through Complex Networks

27

Given input (parametric) uncertainty, quantify the output uncertainty:

� Monte Carlo

� Quasi-Monte Carlo

� Response Surface Methods

� Polynomial Chaos/Probabilistic Collocation Methods

� Combination of the aboveT. Sahai, V. Fonoberov and S. Loire (2010), Uncertainty as a stabilizer of the head-tail ordered phase in

carbon-monoxide monolayers on graphite, Physical Review B.This page contains no technical data subject to the EAR or the ITAR.

Page 28: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Determining determines

Polynomial Chaos

28

Expand the state variables (or outputs) as:

Starting with the system:

Exponential convergence rate for processes with finite variance.

Performing a Galerkin Projection gives:

Curse of dimensionality:

Orthogonal Polynomials

X. Wan and G. E. Karniadakis (2008), Recent Advances in Polynomial Chaos Methods and Extensions

: distribution on

This page contains no technical data subject to the EAR or the ITAR.

Page 29: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

UQ for Hybrid Systems

� H-PC + Wavelet expansions + Boundary Layers help deal with hybrid

dynamical systems – curse of dimensionality

29

T. Sahai and J. M. Pasini, Uncertainty Quantification in Hybrid Dynamical Systems, 2013

Hybrid dynamics

: Uncertain

Bouncing Ball Monte Carlo Boundary Layer

Wavelets

H-PC:

This page contains no technical data subject to the EAR or the ITAR.

Page 30: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Scalable Uncertainty Quantification

� Use decentralized clustering to partition the dynamical

system into “weakly” interacting components

30

A. Surana, T. Sahai and A. Banaszuk (2012), Iterative Methods for Scalable Uncertainty Quantification in Complex Networks, International Journal of Uncertainty Quantification.

Weak interactions –

lower order expansions

Random Parameters

This page contains no technical data subject to the EAR or the ITAR.

Page 31: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Results: Scalable UQ

31

Tosur Tisur Tamb Tzone

C

1/hiA 1/hoA R

Qsurfi Qsurfo C

Qstructure

Rwin

� Consider a two room model with uncertain parameters

2 rooms give a 10 state model

Random parameters – Gaussian distributions

60 state model

Clustering + AWR

This page contains no technical data subject to the EAR or the ITAR.

Page 32: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

Conclusions/Future Work

� Motivated from continuous approaches one can construct efficient

approaches for NP-hard problems such as graph partitioning

� Scalability is enabled by graph decomposition

� Simulating high-dimensional dynamical systems

� Polynomial chaos based uncertainty quantification

in complex networks

� Future Directions include:

� Distributed Computation: Index 2 DAEs, Equation-Free Methods

� Clustering of time varying multi-attributed graphs for distributed

computation, sensor coverage and localization

� Machine Learning for “Big-Data” problems

32This page contains no technical data subject to the EAR or the ITAR.

Page 33: Scalable Algorithms for Complex Networks · Scalable Algorithms for Complex Networks Tuhin Sahai United Technologies Research Center University of Connecticut This page contains no

33

Thank You!

This page contains no technical data subject to the EAR or the ITAR.