Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C....

17
Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley
  • date post

    30-Jan-2016
  • Category

    Documents

  • view

    223
  • download

    0

Transcript of Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C....

Page 1: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Tropical Geometry for Biology

Lior Pachter and Bernd SturmfelsDepartment of Mathematics

U.C. Berkeley

Page 2: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Tropical arithmetic• Annotation is sequence labeling• Annotation is important for biology• Annotation is tropical arithmetic

Tropical geometry• Tree basics• Tree reconstruction is important for biology • Tree space is the tropical Grassmanian

Back to the data

Page 3: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

What is annotation?

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

INPUT: ..t..r…o..p..i..c..a..a..l...g..e..e..t..r..y..OUTPUT: ..t..r…o..p..i..c..a..a..l...g..e..e..t..r..y..

Annotation is the labeling of the input sequence,in this case with 3 colors:

ome

Page 4: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

TAAT ATGTCCACGG TTGTACACGGCA G GTATTGAGGTATTGAG ATGTAAC TGAA

Input: TAATATGTCCACGGGTATTGAGCATTGTACACGGGGTATTGAGCATGTAATGAA

Biology example: gene annotation

Output:

Leucine

Page 5: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

x

y

z

Best annotation for TAAT is obtained by evaluating

Example: assign “scores”, say x,y,z to each color regardless of letter

Finding a good annotationwith tropical arithmetic

Page 6: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Tropical arithmetic• Annotation is sequence labeling• Annotation is important for biology• Annotation is tropical arithmetic

Tropical geometry• Tree basics• Tree reconstruction is important for biology • Tree space is the tropical Grassmanian

Back to the data

Page 7: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

What is a phylogenetic X-tree?

In Darwin’s exampleX = {A,B,C,D,1}

Page 8: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Tree basics1 3

2 4

1 2

3 4

1 2

4 3

In general, the number of trees is the Schröder number(2n-5)!! = (2n-5)*(2n-7)*… 3*1

12

34

0.1

0.2

0.40.2

0.3

Page 9: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Data

Page 10: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Metrics and trees

[ dij ]Distance between species i and j

Page 11: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

A primate tree from genome sequences

Page 12: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Tree space is the tropical Grassmanian

Page 13: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Example: X={1,2,3,4,5}

31

2

4 5

Page 14: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Back to the data

Page 15: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.
Page 16: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

Alignment

Phylogeny

AnnotationMulti HMM Generalized HMM

Tree Markov models

GeneralizedMulti HMM

Evol. HMM Generalized hidden MarkovPhylogeny

Graphical Models

Final message: Tropical mathematics is important for comparative genomics.

Page 17: Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.

For more on mathematics and tropical geometry (and combinatorics and algebra and statistics…):L. Pachter and B. Sturmfels, Tropical Geometry of Statistical Models, PNAS 101, 2004L. Pachter and B. Sturmfels, Parametric Inference for Biological Sequence Analysis, PNAS 101, 2004D. Speyer and B. Sturmfels, The Tropical Grassmanian, Advances in Geometry 4, 2004.L. Pachter and B. Sturmfels, Mathematics of Phylogenomics, arxiv math.ST/0409132, 2004.

and coming soon:

Book (to be published by Cambridge University Press)

Algebraic Statistics for Computational Biologyedited by Pachter and Sturmfels