Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters...

72
Small phylogeny problem: character evolution trees Arvind Gupta J ´ an Ma ˇ nuch Ladislav Stacho and Chenchen Zhu School of Computing Science and Department of Mathematics Simon Fraser University, Canada Small phylogeny problem:character evolution trees – p. 1/1

Transcript of Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters...

Page 1: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny problem:character evolution trees

Arvind Gupta Jan Manuch Ladislav Stacho and Chenchen Zhu

School of Computing Science and Department of Mathematics

Simon Fraser University, Canada

Small phylogeny problem:character evolution trees – p. 1/14

Page 2: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Phylogenetics

science determining ancestor/descendent relationshipsbetween species

Small phylogeny problem:character evolution trees – p. 2/14

Page 3: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Phylogenetics

science determining ancestor/descendent relationshipsbetween species

usually expressed by phylogenetic trees

Small phylogeny problem:character evolution trees – p. 2/14

Page 4: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Phylogenetics

science determining ancestor/descendent relationshipsbetween species

usually expressed by phylogenetic trees

Small phylogeny problem:character evolution trees – p. 2/14

Page 5: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Phylogenetics

science determining ancestor/descendent relationshipsbetween species

usually expressed by phylogenetic trees

the leaves represent extant speciesinternal nodes hypothetical ancestors

Small phylogeny problem:character evolution trees – p. 2/14

Page 6: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Phylogenetics

science determining ancestor/descendent relationshipsbetween species

usually expressed by phylogenetic treesv1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

the leaves represent extant speciesinternal nodes hypothetical ancestors

Small phylogeny problem:character evolution trees – p. 2/14

Page 7: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Characters

Principle of parsimony: the goal is to find the treerequiring the smallest number/score of evolutionarytransitions (such as the loss of one character, or themodification or gain of another).

Small phylogeny problem:character evolution trees – p. 3/14

Page 8: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Characters

Principle of parsimony: the goal is to find the treerequiring the smallest number/score of evolutionarytransitions (such as the loss of one character, or themodification or gain of another).

Each character (a morphological feature, a site in aDNA sequence, etc.) takes on one of a few possiblestates.

Small phylogeny problem:character evolution trees – p. 3/14

Page 9: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Characters

Principle of parsimony: the goal is to find the treerequiring the smallest number/score of evolutionarytransitions (such as the loss of one character, or themodification or gain of another).

Each character (a morphological feature, a site in aDNA sequence, etc.) takes on one of a few possiblestates.

Species can be modeled as vectors of states of a groupof characters.

Small phylogeny problem:character evolution trees – p. 3/14

Page 10: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Large phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)

Small phylogeny problem:character evolution trees – p. 4/14

Page 11: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Large phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)

task:find a phylogeny tree and a labeling of internalnodes that minimizes cost over all evolutionary steps(principle of parsimony)

Small phylogeny problem:character evolution trees – p. 4/14

Page 12: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Large phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)

task:find a phylogeny tree and a labeling of internalnodes that minimizes cost over all evolutionary steps(principle of parsimony)

This problem is NP-hard [Foulds, Graham (1982)].

Small phylogeny problem:character evolution trees – p. 4/14

Page 13: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)structure of phylogeny tree (extant species areleaves of the tree)

Small phylogeny problem:character evolution trees – p. 5/14

Page 14: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)structure of phylogeny tree (extant species areleaves of the tree)

task:find a labeling of internal nodes that minimizes costover all evolutionary steps

Small phylogeny problem:character evolution trees – p. 5/14

Page 15: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny problem

given:set of charactersset of states for each charactercosts of transitions from one state to anotherextant species (labeled with states for eachcharacter)structure of phylogeny tree (extant species areleaves of the tree)

task:find a labeling of internal nodes that minimizes costover all evolutionary steps

There are polynomial algorithms: [Fitch (1971)](uniform costs), [Sankoff (1975)] (non-uniform costs).

Small phylogeny problem:character evolution trees – p. 5/14

Page 16: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Character evolution tree

So far we assumed that during one evolutionary stepone state of a character can change to any other state.However, for many characters character state orderand character state polarity can be observed.

Example: character evolution trees for pollen

Small phylogeny problem:character evolution trees – p. 6/14

Page 17: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Character evolution tree

So far we assumed that during one evolutionary stepone state of a character can change to any other state.However, for many characters character state orderand character state polarity can be observed.

Example: character evolution trees for pollen

one furrow

three furrows multiple pores

Small phylogeny problem:character evolution trees – p. 6/14

Page 18: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Character evolution tree

So far we assumed that during one evolutionary stepone state of a character can change to any other state.However, for many characters character state orderand character state polarity can be observed.

Example: character evolution trees for pollen

one furrow

three furrows multiple pores

Our goal is to find a method of directly comparing acharacter evolution trees with a phylogenetic trees.

Small phylogeny problem:character evolution trees – p. 6/14

Page 19: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

p

Small phylogeny problem:character evolution trees – p. 7/14

Page 20: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

p

Small phylogeny problem:character evolution trees – p. 7/14

Page 21: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

task:find a labeling l : V (G) → V (H) which is:

p-constrained

Small phylogeny problem:character evolution trees – p. 7/14

Page 22: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

task:find a labeling l : V (G) → V (H) which is:

p-constraineda

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

p

Small phylogeny problem:character evolution trees – p. 7/14

Page 23: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

task:find a labeling l : V (G) → V (H) which is:

p-constraineda

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 7/14

Page 24: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

given:character evolution tree Hh with V (H) being states ofthe charactera phylogeny tree Gg with leaves L(G) being extantspeciesa leaf labeling p : L(G) → V (H) (a partial function)

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)

Small phylogeny problem:character evolution trees – p. 7/14

Page 25: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)

Hh Gg

a

h

u

v

g

l

Small phylogeny problem:character evolution trees – p. 7/14

Page 26: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)

Hh Gg

l(u)

l(v)

h

u

v

g

l

Small phylogeny problem:character evolution trees – p. 7/14

Page 27: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)does not allow “scattering” [Lipscomb (1992)]

Small phylogeny problem:character evolution trees – p. 7/14

Page 28: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)does not allow “scattering” [Lipscomb (1992)]

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 7/14

Page 29: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)does not allow “scattering” [Lipscomb (1992)]

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

scatteringscattering

Small phylogeny problem:character evolution trees – p. 7/14

Page 30: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Small phylogeny with character evolution

task:find a labeling l : V (G) → V (H) which is:

p-constrainedif a species v is a child of a species u then thecharacter state l(v) is either equivalent to, or achild of the character state l(u)does not allow “scattering” [Lipscomb (1992)]

existence of such labeling l is very close tograph-theoretical notion of graph minors

Small phylogeny problem:character evolution trees – p. 7/14

Page 31: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

Small phylogeny problem:character evolution trees – p. 8/14

Page 32: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 33: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 34: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

a

b c

mammalsv3

snakes

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 35: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

evolutionary step 〈u, v〉 is a realization of anevolutionary transition 〈a, b〉 if l(u) = a and l(v) = b

Small phylogeny problem:character evolution trees – p. 8/14

Page 36: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

evolutionary step 〈u, v〉 is a realization of anevolutionary transition 〈a, b〉 if l(u) = a and l(v) = b

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 37: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

evolutionary step 〈u, v〉 is a realization of anevolutionary transition 〈a, b〉 if l(u) = a and l(v) = b

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

lrealizationsrealizations

Small phylogeny problem:character evolution trees – p. 8/14

Page 38: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

bag-set of a state a: the set of connected components(called bags) of the subgraph of Gg induced by verticesin l−1(a)

evolutionary step 〈u, v〉 is a realization of anevolutionary transition 〈a, b〉 if l(u) = a and l(v) = b

We say that Hh is a rooted-tree minor of Gg, if thereexists a labeling l : V (G) → V (H) satisfying:

(1) for each character state a, the bag-set of a containsexactly one component; and

(2) each evolutionary transition has a realization.

Small phylogeny problem:character evolution trees – p. 8/14

Page 39: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

We say that Hh is a rooted-tree minor of Gg, if thereexists a labeling l : V (G) → V (H) satisfying:

(1) for each character state a, the bag-set of a containsexactly one component; and

(2) each evolutionary transition has a realization.

a

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 40: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

We say that Hh is a rooted-tree minor of Gg, if thereexists a labeling l : V (G) → V (H) satisfying:

(1) for each character state a, the bag-set of a containsexactly one component; and

(2) each evolutionary transition has a realization.

however, there exists such p-constrained labeling la

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

p

Small phylogeny problem:character evolution trees – p. 8/14

Page 41: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

We say that Hh is a rooted-tree minor of Gg, if thereexists a labeling l : V (G) → V (H) satisfying:

(1) for each character state a, the bag-set of a containsexactly one component; and

(2) each evolutionary transition has a realization.

however, there exists such p-constrained labeling la

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 42: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor

We say that Hh is a rooted-tree minor of Gg, if thereexists a labeling l : V (G) → V (H) satisfying:

(1) for each character state a, the bag-set of a containsexactly one component; and

(2) each evolutionary transition has a realization.

however, there exists such p-constrained labeling la

b c

v1

mammals

v2

turtles

v3

snakes

v4

crocodiles birds

l

Small phylogeny problem:character evolution trees – p. 8/14

Page 43: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Small phylogeny problem:character evolution trees – p. 9/14

Page 44: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Tree minor problem is NP-hard. [Matousek, Thomas(1992)]

Small phylogeny problem:character evolution trees – p. 9/14

Page 45: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Tree minor problem is NP-hard. [Matousek, Thomas(1992)]

Tree minor problem can be converted to Rooted-treeminor problem, hence Rooted-tree minor problem isalso NP-hard (when p is the empty leaf labeling).

Small phylogeny problem:character evolution trees – p. 9/14

Page 46: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Tree minor problem is NP-hard. [Matousek, Thomas(1992)]

Tree minor problem can be converted to Rooted-treeminor problem, hence Rooted-tree minor problem isalso NP-hard (when p is the empty leaf labeling).

Consider an instance of Tree minor problem:

a

b c

d

1

2

4

Small phylogeny problem:character evolution trees – p. 9/14

Page 47: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Tree minor problem is NP-hard. [Matousek, Thomas(1992)]

Tree minor problem can be converted to Rooted-treeminor problem, hence Rooted-tree minor problem isalso NP-hard (when p is the empty leaf labeling).

Hu

. . .uβ2 β3 βn

α

G1

v1G2

v2

Gnvn

. . .

v1 v2 vn

γ

Small phylogeny problem:character evolution trees – p. 9/14

Page 48: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Theorem 1. Rooted-tree minor problem is NP-hard.

Small phylogeny problem:character evolution trees – p. 9/14

Page 49: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Theorem 1. Rooted-tree minor problem is NP-hard.

Theorem 2. If the leaf labeling p is complete, thenRooted-tree minor problem can be decided in lineartime.

Small phylogeny problem:character evolution trees – p. 9/14

Page 50: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Theorem 1. Rooted-tree minor problem is NP-hard.

Theorem 2. If the leaf labeling p is complete, thenRooted-tree minor problem can be decided in lineartime.

Compute the LCA-tree.(The label of each species is the least commonancestor of labels of its children.)This can be done in linear time (requirespreprocessing on the character evolution tree [Harel,Tarjan (1984)]).

Small phylogeny problem:character evolution trees – p. 9/14

Page 51: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Theorem 1. Rooted-tree minor problem is NP-hard.

Theorem 2. If the leaf labeling p is complete, thenRooted-tree minor problem can be decided in lineartime.

Compute the LCA-tree.Fix labels of inner vertices of all single branch pathsin Gg (if possible).Crucial lemma: the ends of single branch paths arealready fixed correctly for any labeling l satisfying thedefinition of Rooted-tree minor.

Small phylogeny problem:character evolution trees – p. 9/14

Page 52: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Rooted-tree minor problem

Rooted-tree minor problem. Given two rooted trees Hh

and Gg, and a leaf labeling p : L(Gg) → V (H). Decidewhether Hh is a rooted-tree minor of Gg with respect to p.

Theorem 1. Rooted-tree minor problem is NP-hard.

Theorem 2. If the leaf labeling p is complete, thenRooted-tree minor problem can be decided in lineartime.

Compute the LCA-tree.Fix labels of inner vertices of all single branch pathsin Gg (if possible).If each evolutionary transition has exactly onerealization accept the input.

Small phylogeny problem:character evolution trees – p. 9/14

Page 53: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Incongruences

Hh Gg

b

a

u

v

Hh Gg

a

cb

u

v

Hh Gg

a b

u

v

Inversion Transitivity Addition

Hh Gg

b

au

v

Hh Gg

a

b

Separation NegligenceRemark: solid lines represent arcs, wavy lines directedpaths of length at least one.

Small phylogeny problem:character evolution trees – p. 10/14

Page 54: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Incongruences

Hh Gg

b

a

u

v

Hh Gg

a

cb

u

v

Hh Gg

a b

u

v

Inversion Transitivity Addition

Hh Gg

b

au

v

Hh Gg

a

b

Separation Negligence

find labeling of nodes in phylogenetic tree minimizingthe number of incongruences

Small phylogeny problem:character evolution trees – p. 10/14

Page 55: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Parsimony criteria

Given two rooted trees Hh and Gg with a labeling l : G → H

and a weight function d on Hh, the arc cost and the bag costof l are defined as follows:

arccost(Hh, Gg, l) : =∑

〈u,v〉∈A(Gg)

d(l(u), l(v)),

bagcost(Hh, Gg, l) : =∑

v∈V (H)

size of the bag-set of v.

Small phylogeny problem:character evolution trees – p. 11/14

Page 56: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Parsimony criteria

Given two rooted trees Hh and Gg with a labeling l : G → H

and a weight function d on Hh, the arc cost and the bag costof l are defined as follows:

arccost(Hh, Gg, l) : =∑

〈u,v〉∈A(Gg)

d(l(u), l(v)),

bagcost(Hh, Gg, l) : =∑

v∈V (H)

size of the bag-set of v.

a

b c

d

1

2

4

Small phylogeny problem:character evolution trees – p. 11/14

Page 57: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Relaxations of Rooted-tree minor

Relax-minor: Given two rooted trees Hh and Gg with aleaf labeling p, we say that Hh is a relax-minor of Gg

with respect to p if there exists a p-constrained labelingfunction l : G → H satisfying the following twoconditions:

each evolutionary transition has a realization (nonegligence); andif u is an ancestor of v, then l(v) cannot be a properancestor of l(u) (no inversion).

Small phylogeny problem:character evolution trees – p. 12/14

Page 58: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Relaxations of Rooted-tree minor

Relax-minor:each evolutionary transition has a realization (nonegligence); andif u is an ancestor of v, then l(v) cannot be a properancestor of l(u) (no inversion).

Example of other incongruences:

a

b

c

d

Ha

v1

v2 v3

v4 v5 v6 v7

separationseparation

transitivity additionGv1

Small phylogeny problem:character evolution trees – p. 12/14

Page 59: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Relaxations of Rooted-tree minor

Given two rooted trees Hh and Gg with a leaf labeling p,we say that Hh is a pseudo-minor of Gg with respect top if there exists a p-constrained labeling functionl : G → H such that

for every evolutionary step 〈u, v〉, l(u) is an ancestorof l(v) (no addition and inversion).

Hh Gg

l(u)

l(v)

u

v

l

Small phylogeny problem:character evolution trees – p. 12/14

Page 60: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Relaxations of Rooted-tree minor

Given two rooted trees Hh and Gg with a leaf labeling p,we say that Hh is a pseudo-minor of Gg with respect top if there exists a p-constrained labeling functionl : G → H such that

for every evolutionary step 〈u, v〉, l(u) is an ancestorof l(v) (no addition and inversion).

Example of other incongruences:

a

b c

e d

Ha

v1

v2

v3

v4 v5

separationseparation

transitivity

negligence

Gv1

Small phylogeny problem:character evolution trees – p. 12/14

Page 61: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Relaxations of Rooted-tree minor

summary:

rooted minor relax-minor pseudo-minorinversion N N Ntransitivity N Y Yaddition N Y Nseparation N Y Ynegligence N N Y

Small phylogeny problem:character evolution trees – p. 12/14

Page 62: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

Small phylogeny problem:character evolution trees – p. 13/14

Page 63: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.

Small phylogeny problem:character evolution trees – p. 13/14

Page 64: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.NP-hard even when p is a complete leaf labeling.

Hh

Gg

Yh

Zg

v1 v2 vk. . .

g

h

g

h

v′1

v′2

v′k. . .

v′′1

v′′2

v′′k. . .

w1w2 wk. . .

Small phylogeny problem:character evolution trees – p. 13/14

Page 65: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.NP-hard even when p is a complete leaf labeling.

Problem 2. Find a relax-minor labeling with the minimalarc-cost.

Small phylogeny problem:character evolution trees – p. 13/14

Page 66: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.NP-hard even when p is a complete leaf labeling.

Problem 2. Find a relax-minor labeling with the minimalarc-cost.

Since the relax-minor allows addition, the arc-cost isnot always finite.

Small phylogeny problem:character evolution trees – p. 13/14

Page 67: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.NP-hard even when p is a complete leaf labeling.

Problem 2. Find a relax-minor labeling with the minimalarc-cost.

Since the relax-minor allows addition, the arc-cost isnot always finite.NP-hard if p is the empty leaf labeling.

Small phylogeny problem:character evolution trees – p. 13/14

Page 68: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Relax-minor problems

Problem 1. Find a relax-minor labeling with the minimalbag-cost.

NP-hard if p is the empty leaf labeling.NP-hard even when p is a complete leaf labeling.

Problem 2. Find a relax-minor labeling with the minimalarc-cost.

Since the relax-minor allows addition, the arc-cost isnot always finite.NP-hard if p is the empty leaf labeling.Open problems.

Is it possible to solve Problem 2 in polynomial timewhen p is a complete leaf labeling?Is it possible to decide whether there is arelax-minor labeling with finite arc-cost in P time?

Small phylogeny problem:character evolution trees – p. 13/14

Page 69: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Pseudo-minor problems

Problem 3. Find a pseudo-minor labeling with theminimal bag-cost.

Small phylogeny problem:character evolution trees – p. 14/14

Page 70: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Pseudo-minor problems

Problem 3. Find a pseudo-minor labeling with theminimal bag-cost.

Can be done in linear time:compute the LCA-tree;if a species does not belong to a bag containing aleaf, change its label to the label of the root.

Small phylogeny problem:character evolution trees – p. 14/14

Page 71: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Pseudo-minor problems

Problem 3. Find a pseudo-minor labeling with theminimal bag-cost.

Can be done in linear time:compute the LCA-tree;if a species does not belong to a bag containing aleaf, change its label to the label of the root.

Problem 3. Find a pseudo-minor labeling with theminimal arc-cost.

Small phylogeny problem:character evolution trees – p. 14/14

Page 72: Small phylogeny problem: character evolution treesstelo/cpm/cpm04/17_Jan.pdf · set of characters set of states for each character costs of transitions from one state to another extant

Complexities of Pseudo-minor problems

Problem 3. Find a pseudo-minor labeling with theminimal bag-cost.

Can be done in linear time:compute the LCA-tree;if a species does not belong to a bag containing aleaf, change its label to the label of the root.

Problem 3. Find a pseudo-minor labeling with theminimal arc-cost.

Can be done in linear time:compute the LCA-tree.

Small phylogeny problem:character evolution trees – p. 14/14