Discrete Optimization Last Time Local Search Meta-Heuristic Search Specific Search Strategies ...

89
Discrete Optimization Last Time Local Search Meta-Heuristic Search Specific Search Strategies Simulated Annealing Stochastic Machines Convergence Analysis 111/06/20 Shi-Chung Chang, NTUEE, GIIE, GICE, Spring, 2008

Transcript of Discrete Optimization Last Time Local Search Meta-Heuristic Search Specific Search Strategies ...

Page 1: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Discrete Optimization

Last Time Local Search Meta-Heuristic Search Specific Search Strategies Simulated Annealing Stochastic Machines Convergence Analysis

112/04/19Shi-Chung Chang, NTUEE, GIIE, GICE,

Spring, 2008

Page 2: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Discrete Optimization

Today SA: Convergence Analysis Evolutionary Computation Genetic Algorithms: Simple Example GA: General Coding and mapping Selection Genetic Variability Operators Fitness Design ssues

112/04/19Shi-Chung Chang, NTUEE, GIIE, GICE,

Spring, 2008

Page 3: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Reading Assignments 1. A Genetic Algorithm Tutorial

Darrell Whitley Statistics and Computing (4):65-85, 1994.

112/04/19Shi-Chung Chang, NTUEE, GIIE, GICE

Page 4: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Convergence Analysis of Simulated AnnealingRef: E. Aarts, J. Korst, P. Van Laarhoven, “Simulated Annealing,” in Local Search in Combinatorial Optimization, edited by E. Aarts and J. Lenstra, 1997, pp. 98-104.

Page 5: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Boltzmann distribution

• At thermal equilibrium at temperature T, the Boltzmann distribution gives the relative probability that the system will occupy state A vs. state B as:

• where E(A) and E(B) are the energies associated with states A and B.

)/)(exp(

)/)(exp()()(exp

)(

)(

TAE

TBE

T

BEAE

BP

AP

Page 6: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Simulated annealing in practice

• Geman & Geman (1984): if T is lowered sufficiently slowly (with respect to the number of iterations used to optimize at a given T), simulated annealing is guaranteed to find the global minimum.

• Caveat: this algorithm has no end (Geman & Geman’s T decrease schedule is in the 1/log of the number of iterations, so, T will never reach zero), so it may take an infinite amount of time for it to find the global minimum.

Page 7: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Simulated annealing algorithm

• Idea: Escape local extrema by allowing “bad moves,” but gradually decrease their size and frequency.

Algorithm when goalis to minimize E.< -

-

Page 8: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Note on simulated annealing: limit cases

• Boltzmann distribution: accept “bad move” with E<0 (goal is to maximize E) with probability P(E) = exp(E/T)

• If T is large:E < 0E/T < 0 and very smallexp(E/T) close to 1accept bad move with high probability

• If T is near 0: E < 0E/T < 0 and very largeexp(E/T) close to 0accept bad move with low probability

Random walk

Deterministicdown-hill

Page 9: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Markov Model

• Solution State• Cost of a solution Energy of a state

• Generation Probability of state j from state i: Gij

|)(|

)()(

iN

jcG kij

k.iteration at re temperatu the

otherwise. 0,

N(i);i if 1,

i of odneighborho

:

:)(

kc

χ(i)

iN

Page 10: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Acceptance and Transition Probabilities

• Acceptance probability of state j as next state at state i

otherwise.

if

,0

;0,

))]()([

exp()(

aaa

c

ifjfcA

kkij

• Transition probability from state i to state j

j. if

j if

icAcG

icAcGkP

iliNlkilkil

kijkij

ij ,)()(1

;),()()(

),(

Page 11: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Irreducibility and Aperiodicity of M.C.

• Irriducibility

eirreducibl is M.C. the

such that 0

0)(, ijnPnji

• Aperiodicity

aperiodic. is M.C. the

1, ) gcd( , stateeach for

{

then

DiIf

PnD

i

iin

i

}0)(|

Page 12: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Theorem 1: Existence of Unique Stationary State Distribution

• Finite homogeneous M.C.• Irriducibility + Aperiodicity

iPq

iq

jij

j

i

state for

stateat is M.C. y that theprobabilit

existence of unique stationary distribution

Page 13: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Theorem 2: Asymptotic Convergence of Simulated Annealing

• P(k): the transition matrix of the homogeneous M.C. associated with the S.A. algorithm

• Ck = C for all k

1.-n0,1,...,k with

and 1

,0

,...,,,

1

10

kk ll

n

G

jllilnji

existence of a unique stationary distribution

solutions. optimal ofset solution, optimalan

||

1 (c)

0c

lim (c)

state for exp[

exp[ (c)

::

),(

]/)(

]/)(

*

**

i

iqqand

icjf

cifq

ii

j

i

Page 14: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Asymptotic Convergence of Simulated Annealing

. {x(k)

k

lim

0c

lim 1}

Pc

From Theorem 2

Page 15: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 15

Genetic Algorithms and their Application to the Artificial Evolution of

Genetic Regulatory Networks

Tutorial ICSB 2007Johannes F. Knabe, Katja Wegner,

and Maria J. SchilstraUniversity of Hertfordshire, UK

Page 16: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 16

Part 1: Fundamentals

Biological Evolution

Page 17: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 17

Evolutionary cycle : Generation

Recombination

MutationPopulation

Offspring

ParentsSelection

Replacement

Page 18: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 18

Dictionary 1

Gene – smallest unit with genetic information Genotype – collectivity of all genes Phenotype – expression of genotype in

environment Individual – single member of a population with

genotype and phenotype Population – set of several individuals Generation – one iteration of evaluation,

selection and reproduction with variation

Page 19: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 19

Selection and Reproduction

Selection does not act on genotype at all but on the performance of the phenotype (fitness)

There is differential reproduction phenotypes better adapted to the environment are likely to produce more offspring

Slightly unfaithful reproduction creates genotypic variations affect traits of the phenotype, which in turn affect fitness

These genotypic variations are heritable

Page 20: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 20

Recombination (crossover)

Choose two individuals from current population parents

New combination of the genetic material of these individuals offspring

No new genetic information, only reshuffling of existing information

But can have strong effects on phenotype

http://student.biology.arizona.edu/honors2001/group12/introduction.html

Page 21: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 21

Duplication

Any doubling of a certain region, e.g. through unequal recombination

If this region consists of a gene, it is called gene-duplication

http

://e

n.w

ikip

edia

.org

/wik

i/Ge

ne_d

uplic

atio

n

Page 22: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 22

Mutation

Permanent changes to genetic material

Can be caused by errors during reproduction of DNA

• Mutation rate: i.e. 1 in 10.000 bases is incorrectly reproduced

Brings variability into reproduction Usually small changes at individual

level but strongly depends on “importance” of mutated base to phenotype

htt

p:/

/ww

w.b

iocr

aw

ler.

com

/en

cycl

op

ed

ia/M

uta

tion

Page 23: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 23

The Evolutionary Mechanisms

Selection and differential reproduction

• DECREASE diversity in population

Genetic operators (mutation, recombination)

• INCREASE diversity of population

Page 24: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 24

Part 1: Fundamentals (2)

Evolutionary Computation

Genetic Algorithms (GA)

Page 25: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Evolutionary Computation

Page 26: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 26

Evolutionary Computation

Exploitation of concepts of natural evolution for problem solving using computers

Simulation of evolutionary processes (recombination, mutation, selection) for solving a desired problem

Particularly well-suited to complex, multidimensional problems too big to search exhaustively (non-linear optimization problems)

Cannot solve all problems perfectly, but has fewer restrictions than most problem-solving algorithms

Page 27: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 27

Optimization - Problems

Example: hill-climbing• Start with estimate of global maximum

• Try to improve by finding other solutions that have a greater value than the current estimate (local search)

Localmaximum

Globalmaximum

• Local maxima = hazards could converge to local maximum instead of global

Page 28: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 28

Evolutionary cycle - revisited

RecombinationMutation

Population

Offspring

Parents

Replacement

(random) start population

SelectionEnd?

Evaluation

Page 29: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 29

Individual - one candidate solution Population - set of individuals Genotype - encoded representation of individual Phenotype - decoded representation of individual Mapping - decodes the phenotype Mutation - variability operator that modifies a genotype Recombination/Crossover - variability operator mixing

genotypes Fitness - performance of a phenotype with regard to

objective Iteration - Generation

Dictionary 2

Page 30: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 30

EC - General properties

Exploit collective “learning“ process of a population (each individual = one solution = one search point)

Evaluation of individuals in their environment = measure of quality = fitness comparison of individuals

Selection favors better individuals who reproduce more often than those that are worse

Offspring is generated by random recombination and mutation of selected parents

Page 31: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 31

Main trends

Genetic algorithms (GAs)Genetic programming (subform of GAs)Evolutionary strategies (ES)Evolutionary programming (EP)

Page 32: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Genetic Algorithms: Simple Example

Page 33: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 33

Simple example – f(x) = x² Finding the maximum of a function:

• f(x) = x²

• Range [0, 31] Goal: find max (31² = 961)

• Binary representation: string length 5 = 32 numbers (0-31)

= f(x)

Page 34: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 34

F(x) = x² - Start Population

100001String 5

4412110101String 4

1001001010String 3

9300011String 2

36600110String 1

fitnessvaluebinary

1

Page 35: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 35

F(x) = x² - Selection

100001String 5

4412110101String 4

1001001010String 3

9300011String 2

36600110String 1

fitnessvaluebinary

1

Worst one removed

Page 36: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 36

F(x) = x² - Selection

100001String 5

4412110101String 4

1001001010String 3

9300011String 2

36600110String 1

fitnessvaluebinary

1

Best individual: reproduces twice keep population size constant

Page 37: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 37

F(x) = x² - Selection

100001String 5

4412110101String 4

1001001010String 3

9300011String 2

36600110String 1

fitnessvaluebinary

1

All others are reproduced once

Page 38: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 38

F(x) = x² - Recombination

Parents and x-position randomly selected (equal recombination)

0 0 1 1 0

0 0 0 1 1

0 0 1 1 1

0 0 0 1 0

String 1:

String 2:

2String 4String 3

4String 2String 1

x-positionpartner

Page 39: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 39

F(x) = x² - Recombination

Parents and x-position randomly selected (equal recombination)

0 1 0 1 0

1 0 1 0 1

0 1 1 0 1

1 0 0 1 0

String 3:

String 4:

2String 4String 3

4String 2String 1

x-positionpartner

Page 40: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 40

F(x) = x² - Mutation

bit-flip:

• Offspring -String 1: 00111 (7) 10111 (23)

• String 4: 10101 (21) 10001 (17)

Page 41: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 41

F(x) = x²

• All individuals in the parent population are replaced by offspring in the new generation• (generations are discrete!)

• New population (Offspring):

289String 5

2561610000String 4

1691301101String 3

4200010String 2

5292310111String 1

fitnessvaluebinary

1710001

Page 42: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 42

F(x) = x² - End

Iterate until termination condition reached, e.g.:

• Number of generations

• Best fitness

• Process time

• No improvements after a number of generations

Result after one generation:

• Best individual: 10111 (23) – fitness 529

Page 43: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Genetic Algorithms - General

Page 44: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 44

Genetic algorithms

Meta SearchDifferences to other search and

optimization algorithms:• GAs search from a population of points

(possible solutions), not from a single point

• GAs use probabilistic, not deterministic rules

Page 45: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 45

History

In 1960s John H. Holland, University of Michigan:• Abstraction and generalisation of the

population concept with genetic coding and operators

Use in Bioinformatics, e.g.:• motif discovery,

• sequence alignment,

• protein structure prediction etc.

Page 46: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

46

Procedure of Genetic Algorithms

P(t): Parents in current generation t.

C(t): offspring in current generation t.

Page 47: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Coding and Mapping

Page 48: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 48

Genetic coding

Finite strings (= genome, represents genotype) Strings consists of units with information (unit =

gene) One string ( individual) = one possible

solution of the problem Genotype often real numbers or bit string:

0 1 0 1 0 11.853 0.492

Page 49: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 49

Genetic coding and mapping What should the phenotype look like and how to encode it

as a genotype? How does one map from genotype to phenotype,

considering the sources of variation (mutation and recombination)?

• Highly problem dependent!

• Hint: small changes to genotype should often result in small changes to phenotype, i.e. similar performance: heritability of traits!

• heritability of traits is important otherwise GA becomes only random search

Page 50: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 50

Mapping – Example

Binary coding versus Gray coding of a number

Hamming distance:

• Number of bits that have to be changed to map one string into another one

• E.g. 000 and 001 distance = 1

Remember: small changes in genotype should cause small changes in phenotype

Page 51: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 51

Mapping – Example cont‘d

• Binary coding of 0-7 (phenotype):

1106

1117

1015

1004

0113

0102

0011

0000

Page 52: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 52

Mapping – Example cont‘d

• Binary coding of 0-7 (phenotype):

1106

1117

1015

1004

0113

0102

0011

0000

• Hamming distance, e.g.:– 000 (0) and 001 (1)

• Distance = 1 (optimal)

– 011 (3) and 100 (4)• Distance = 3 (max possible)

Page 53: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 53

Mapping – Example cont‘d

• Gray coding of 0-7:

1016

1007

1115

1104

0103

0112

0011

0000

Page 54: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 54

Mapping – Example cont‘d

• Gray coding of 0-7:

1016

1007

1115

1104

0103

0112

0011

0000

• Hamming distance:– Two neighboring numbers

(phenotypes) have always a genotype distance of 1 (all differ only by one bit flip) – OPTIMAL mapping

Page 55: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 55

Mapping – Example cont‘d Comparing kinship with distance = 1

Binary: Gray:

Page 56: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Selection

Page 57: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 57

Selection

Based on fitness function:• Determines how „good“ an individual is (fitness)

• Better fitness, higher probability of selection

Selection of individuals for differential reproduction of offspring in next generation

Favors better solutions Decreases diversity in population

Page 58: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 58

Selection - Roulette-Wheel Each solution gets a

region on a roulette wheel according to its fitness

Spun wheel, select solution marked by roulette-wheel pointer

stochastic selection (better fitness = higher chance of reproduction)

http://www.edc.ncl.ac.uk/highlight/rhjanuary2007g02.php

Page 59: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 59

Selection - Elitism

Individual(s) kept unchanged for next population

Example:• Selection based on fitness values

• Keep the best individual of current population

• unrealistic but ensures best fitness of a generation never decreases decrease of diversity

Page 60: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 60

Selection - Tournament randomly select q individuals from current

population

Winner: individual(s) with best fitness among these q individuals

Example:• select the best two individuals as parents for

recombination

q=6 selection

Page 61: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Genetic variability operators

Page 62: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 62

Mutation Varies details, usually exploitive Changes one position in the string

• each position same small probability of undergoing a mutation

Goal: search around existing good solution, possibly leave local optima

0 1 0 0 0 0

1.853 1.807

Page 63: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 63

Recombination/Crossover

Usually explorative Creates new strings by combining parts of

two existing strings

00 1 000 1 1 1

100 10 1 00011 1

101

Parents:

Offspring:

1 1

Page 64: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 64

Recombination

• Unequal:– Crossover points independent for each

string chosen

00 1 000 1 1 1

100 10 1 00011 1

101

Parents:

Offspring:

1 1

Page 65: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Fitness

Page 66: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 66

Fitness function

Nature: • only survival and reproduction count

• how well do I do in my environment

Fitness space structure:• Defined by kinship of genotypes and fitness function

• Advantage: visual representation can be useful when thinking about model design

• Limitation: ideas might be too simplistic when not working on toy-problems - complex spaces and movements (think crossover!)

Page 67: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 67

Fitness space or landscape

Schema of genetic kinship How we “move” in that landscape over generations is

defined by our variability operators, usually mutation and recombination

Now add fitness…

0000 0001

100110001100

0100

0110 0010 0011

Page 68: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 68

Fitness space or landscape

Schema of genetic kinship How we “move” in that landscape over generations is

defined by our variability operators, usually mutation and recombination

Now add fitness…

0000 0001

100110001100

0100

0110 0010 0011

fitness

Page 69: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 69

Fitness landscapes contd. x/y axes: kinship, i.e. the more genetic resemblance the

closer together z axis: fitness

Every “snowflake” one

individual, search

focuses on “promising”

regions (due to

differential reproduction) Anim

ati

on a

dapte

d f

rom

Andy K

eane,

Uni. O

f South

am

pto

n

Page 70: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 70

Fitness space – Good design Easy to find the optimum by local search neighboring genotypes have similar fitness

(smooth curve high evolvability)

Fitn

ess

Genotypes

Page 71: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 71

Fitness space - Bad design Here we will have a hard time finding the optimum Low evolvability (fitness is right/wrong) Either problem not well suited for GA or bad design

Fitn

ess

Genotypes

Page 72: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 72

Fitness space – Mediocre design Many local optima, so we are likely to find one However not much of a gradient to find global

optimum, random search could do as well

Fitn

ess

Genotypes

Page 73: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 73

Dynamic fitness landscape

Fitness does not need to be static over generations

Can allow to reach

regions otherwise

uncovered

Natural fitness

certainly very dynamic Anim

ati

on b

y M

ichael H

erd

y,

TU

Berl

in

Page 74: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Design issues

Page 75: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 75

Integrating problem knowledge

Always to some degree in representation/ mapping

Create more complex fitness functionStart population chosen instead of a

uniform random one• Useful e.g. if constraints on range of solutions

• Possible problems: Loss of diversity and bias

Page 76: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 76

Design decisions

GAs: high flexibility and adaptability because of many options:• Problem representation

• Genetic operators with parameters

• Mechanism of selection

• Size of the population

• Fitness function Decisions are highly problem dependent Parameters not independent, you cannot

optimize them one by one

Page 77: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 77

Hints for the parameter search

Find balance between:• Exploration (new search regions)

• Exploitation (exhaustive search in current region) Parameters can be adaptable, e.g. from high in the

beginning (exploration) to low (exploitation), or even be subject to evolution themselves

Balance influenced by:• Mutation, recombination:

• create indiviuals that are in new regions (diversity!!)

• fine tuning in current regions

• Selection: focus on interesting regions

Page 78: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 78

Keep in mind Start population has a lot of diversity “Invest” search time in areas that have proven good

in the past Loss of diversity over evolutionary time Premature convergence: quick loss of diversity poses

high risk of getting stuck in local optima Evolvability:

• Fitness landscape should not be too rugged

• Heredity of traits

• Small genetic changes should be mapped to small phenotype changes

Page 79: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Wrapping up Part 1

Page 80: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 80

GA- Summary

Selection:• Focus on fittest individuals

Recombination:• Adds alternative solutions to population

Mutation: • Makes sure that most of the search space is

reached

Page 81: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 81

GA- Summary cont'd Advantages:

• Basic method simple and broadly applicable• No need for very detailed understandung of the problem• But can be adjusted to problem if knowledge present• Fast and can be scheduled in parallel

Disadvantages:• No guarantee to find best solution• High computational demands• Adapting to problems at hand can be hard, e.g. finding

suitable representation/mapping and evolutionary operators

• Search can get “caught” in local optima

Page 82: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

STRI, University of Hertfordshire 82

More recent inputs from Biology Populations are spatial, e.g. for “speciation”

• interaction (mating, competition) localized to maintain diversity

Populations have structure, e.g. niche protection

• competition will be stronger if many individuals do the same to maintain diversity

Diploidy with dominance / recessivity N-point crossover and other variants Morphogenesis instead of simple function mapping

(allowing for modularity, making crossover less fatal)

Page 83: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

GA Fundamental Theorems

Tian-Li [email protected]

Department of Electrical EngineeringNational Taiwan University

AcknowledgmentDavid E. Goldberg’s slides for his GA course.

Ying-Ping Chen’s slides for his EC course.

Page 84: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Agenda

• Schema theorem• Takeover & drift• Control map• Problem difficulty• BB hypothesis• Time-to-convergence• Population sizing• No-free-lunch (NFL) theorem

Page 85: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Derive the Schema Theorem

• Holland (1975).

• Ensure the growth of best schema on average.

• Conservative bound: ignores several favorable possibilities.

• Considers effect of different operators:– selection: proportionate– crossover: one-point– mutation: simple bitwise

Page 86: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Schema

• Schemata (pl.)

• Introduce the wild card ‘*’.– “1*0” denotes 100 and 110; we call 1 and 0 as fixed.

• Order of schema H: o(H)– The number of fixed positions.– o(1**0*) = 2

• Defining length of schema H: δ(H)– Distance between the 1st and the last fixed positions.– δ(1**0*) = 4-1 = 3

Page 87: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Schemata Growth: Selection

f

HfHmHm

)()()('

Selection Pressure on H

• m(H): # of individuals in the population that belong to H.

• The average fitness changes over time.• Exponential growth at the beginning.• Saturated when population is nearly converged.

Page 88: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

Schemata Disruptions

• One-Point XO:

• Mutation:

• Lower bounds.• Innovation not considered.

1

)(1)()('

l

HpHmHm c

)(1)()(' HopHmHm m

Page 89: Discrete Optimization Last Time  Local Search  Meta-Heuristic Search  Specific Search Strategies  Simulated Annealing  Stochastic Machines  Convergence.

The Schema Theorem

• If m(H,t+1) > m(H,t), the schema H grows.• Lower-bound estimation of schema growth.• Consider only destructive forces.

• Minimal, sequential, superior (ms2) schemata grow.

)(

1

)(1

)(),()1,( Hop

l

Hp

f

HftHmtHm mc