Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected...

31
Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242, 2013 Presenter: Seyyed Shaho Alaviani Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242, 2013 Presenter: Seyyed Shaho Alaviani

Transcript of Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected...

Page 1: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Designing Games for Distributed Optimization

Na Li and Jason R. Marden

IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242, 2013

Presenter: Seyyed Shaho Alaviani

Designing Games for Distributed OptimizationNa Li and Jason R. MardenIEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242, 2013

Presenter: Seyyed Shaho Alaviani

Page 2: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Introduction -advantages of game theory

Problem Formulation and Preliminaries - potential games -state based potential games -stationary state Nash equilibrium

Main Results - state based game design -analytical properties of designed game -learning algorithm

Numerical Examples

Conclusions

Page 3: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Network-Consensus-Rendezvous-Formation-Schooling-Flocking

All: special cases of distributed optimization

Page 4: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Game Theory: a powerful tool for the design and control of multi agent systems

Using game theory requires two steps:

1- modelling the agent as self-interested decision maker in a game theoretical environment: defining a set of choices and a local objective function for each decision maker

2- specifying a distributed learning algorithm that enables the agents to reach a Nash equilibrium of the designed game

Introduction

Page 5: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Core advantage of game theory:

It provides a hierarchical decomposition between

the distribution and optimization problem (game design)

and

the specific local decision rules (distributed learning algorithm)

Example: Lagrangian

The goal of this paper:

To establish a methodology for the design of local agent objective functions that leads to desirable system-wide behavior

Page 6: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Connected and disconnected graphs Directed and undirected graphs

connected disconnected directed

undirected

Graph

Page 7: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Consider a multi-agent of agents,

set of decisions, nonempty convex subset of real numbers

Optimization problem:

s.t.

where is a convex function, andthe graph is undirected and connected

Problem Formulation and Preliminaries

Page 8: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Physics:

Page 9: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Main properties of potential games:

1- a PSNE is guaranteed to exist

2- there are several distributed learning algorithms with proven asymptotic guarantees

3- learning PSNE in potential games is robust: heterogeneous clock rates and informational delays are not problematic

Page 10: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Stochastic games( L. S. Shapley, 1953):

In a stochastic game the play proceeds by steps from position to position, according to transition probabilities controlled jointly by two players.

State Based Potential Games(J. Marden, 2012):

A simplification of stochastic games that represents and extension to strategic form games where an underlying state space is introduced to the game theoretic environment

Page 11: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,
Page 12: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

State Based Game Design:The goal is to establish a state based game formulation for our distributed optimization problem that satisfies the following properties:

Main Results

Page 13: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

A State Based Game Design for Distributed Optimization:

- State Space

- Action sets

- State dynamics

- Invariance associated with state dynamics

- Agent cost functions

Page 14: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

State Space:

Page 15: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Action sets:

An action for agent I is defined as a tuple

indicates a change in the agent value

indicates a change in the agent’s estimation term

Page 16: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

State Dynamics:

For a state and an action , the ensuing state is given by

Page 17: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Invariance associated with state dynamics:

Let be the initial values of the agents

Define the initial estimation terms to satisfy

Then for all

Page 18: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Agent cost functions:

Page 19: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Analytical Properties of Designed Game

Theorem 2 shows that the designed game is a state based potential game.

Theorem 2: The state based game is a state based potential game with potential function

and represents the ensuing state.

Page 20: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Theorem 3: Let G be the state based game. Suppose that is a differentiable convex function, the communication graph is connected and undirected, and at least one of the following conditions is satisfied:

Theorem 3 shows that all equilibria of the designed game are solutions to the optimization problem.

Page 21: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,
Page 22: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Question:

Could the results in Theorem 2 and 3 have been attained using framework of strategic form games?

impossible

Page 23: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Learning Algorithm

We prove that the learning algorithm gradient play converges to a stationary state NE.

Assumptions:

Page 24: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Theorem 4: Let G be a state based potential game with a potential function that satisfies the assumption. If the step size for all , then the state action pair of the gradient play

asymptotically converges to a stationary state NE.

Page 25: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Example 1:

Consider the following function to be minimized

Numerical Examples

Page 26: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,
Page 27: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Example 2: Distributed Routing Problem

source destination

m routes

Application: the Internet

Amount traffic

Percentage of traffic that agent i designates to route r

Page 28: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

For each route r, there is an associated congestion function that reflects the cost of using the route as a function of the amount of traffic on that route.

Then total congestion in the network will be

Page 29: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

R=5

N=10

Communication graph

𝛼=900

Page 30: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

Conclusions:

- This work presents an approach to distributed optimization using the framework of state based potential games.

- We provide a systematic methodology for localizing the agents’ objective functions while ensuing that the resulting equilibria are optimal with regards to the system level objective function.

- It is proved that the learning algorithm gradient play guarantees convergence to a stationary state NE in any state based potential game

- Robustness of the approach

Page 31: Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp. 230-242,

MANY THANKS

FOR

YOUR ATTENTION