Homology assessment and molecular sequence alignment.

43
Homology assessment and Homology assessment and molecular sequence molecular sequence alignment. alignment. Chris Stewart and Ka Chris Stewart and Ka Yi Ling Yi Ling Genetics 677 Genetics 677

description

Homology assessment and molecular sequence alignment. Chris Stewart and Ka Yi Ling Genetics 677. Classical Phylogenetics. Molecular Phylogenetics. Homology. Big picture. Evolution. Divergent. Convergent. Homology. Analogy. Orthologs. Paralogs. Systematics. Homology. - PowerPoint PPT Presentation

Transcript of Homology assessment and molecular sequence alignment.

Page 1: Homology assessment and molecular sequence alignment.

Homology assessment and Homology assessment and molecular sequence alignment.molecular sequence alignment.

Chris Stewart and Ka Yi LingChris Stewart and Ka Yi Ling

Genetics 677 Genetics 677

Page 2: Homology assessment and molecular sequence alignment.

ClassicalPhylogenetics

MolecularPhylogeneticsHomology

Page 3: Homology assessment and molecular sequence alignment.

Big pictureBig pictureEvolution

Divergent Convergent

Orthologs Paralogs

AnalogyHomology

Systematics

Page 4: Homology assessment and molecular sequence alignment.

HomologyHomology1. Equal in position and details

in structure

2. Equal in developmental origin (i.e. cellular/tissue structure)

3. Logical and continual series of character state transformations

Figure from http://images-eu.amazon.com/images/P/0895262002.01.LZZZZZZZ.jpg

Page 5: Homology assessment and molecular sequence alignment.

HomologyHomology

Speciation

Duplication

CBA

Page 6: Homology assessment and molecular sequence alignment.

CharacterCharacterTrait from group of organisms, which has

two or more independent states that can be evaluated

http://www.choose-life.org/Map_states_color.jpg

Page 7: Homology assessment and molecular sequence alignment.

ParsimonyParsimonyWorking principle that prefers the least complex

explanation for an observation

Figure from http://www.cartoonchurch.com/cartoons/large/simple-living-cartoon.gif

Page 8: Homology assessment and molecular sequence alignment.

Classical phylogeneticsClassical phylogeneticsMethod of parsimony analysis used to

develop cladograms explaining evolutionary relationships.

Fig 1. Hypothetical cladogram

Page 10: Homology assessment and molecular sequence alignment.

MatrixMatrixSegmented Jawed Hair Placenta Multi-cell Limbs

Cat

Kangaroo

Lizard

Salmon

Earthworm

Sponge

Amoeba

1 1 1 1 1 1

1 1 1 0 1 1

0 0 0 0 1 0

1 0 0 0 1 0

1 1 0 0 1 0

1 1 0 0 1 1

0 0 0 0 0 0

Page 11: Homology assessment and molecular sequence alignment.

Possible cladogramsPossible cladograms

….which one do you pick?

Page 12: Homology assessment and molecular sequence alignment.

Homoplasy & subjective Homoplasy & subjective characterscharacters

A faulty assignment of primary homology

Figure from: http://www.blackwellpublishing.com/ridley/images/analogies.jpg

Page 13: Homology assessment and molecular sequence alignment.

Things to considerThings to consider

• Auxillary Principle

• Congruence Test

Page 14: Homology assessment and molecular sequence alignment.

More things to considerMore things to consider

• Weighting–Needs to respond to homoplasy

• Independent characters

http://ksuoncampus.com/2008/01/29/evolution-of-mario/

Page 15: Homology assessment and molecular sequence alignment.

Molecular PhylogeneticsMolecular Phylogenetics

• Goal:– to infer process from

pattern

• Why– Not just the observables

– Alternative method to derive evolutionary relationships

Page 16: Homology assessment and molecular sequence alignment.

Sequence alignment programs

Figure modified from http://bioinfo.ochoa.fib.es/docus/courses/Ali2005Filogenias/seq_analysis/images/SeqAnalFloChart.gif

Protein or Gene of interest

Sequence alignment programs

Page 17: Homology assessment and molecular sequence alignment.

Molecular charactersMolecular characters

• What can be used? – Nucleotide sequences– Protein sequences– DNA– RNA– Protein

• NO single universally accepted recipe

Figure from http://www.ittc.ku.edu/bioinfo_seminar/images/wheel.gif

Page 18: Homology assessment and molecular sequence alignment.

List of alignment softwareList of alignment software

Page 19: Homology assessment and molecular sequence alignment.

Sequence AlignmentSequence Alignment

• Types–Pairwise alignment

–Multiple sequence alignment

Figure from http://en.wikipedia.org/wiki/Sequence_alignment

Page 20: Homology assessment and molecular sequence alignment.

Human Molecular Genetics, 2006, Vol. 15, Review Issue 1, R54

Page 21: Homology assessment and molecular sequence alignment.

The nuts and bolts The nuts and bolts

1.1. Gene/ protein of Gene/ protein of interestinterest

2. Homolog search

3. Sequence alignment

4. Tree building

Figure from http://www.usingneuralnetworks.com/images/Face_But_Not_The_Name_Cartoon.jpg

Page 22: Homology assessment and molecular sequence alignment.

Database Search: BLASTDatabase Search: BLAST

• Basic Local Alignment Search Tool

• Input: Protein and Nucleotide

• Default algorithm: Blosum62

• Other algorithms: PAM family, Blosum family

• Sites that use BLAST: NCBI, EBI, GenomeNet, PIR, DDBJ

Figure: NCBI alignment result site.

Page 23: Homology assessment and molecular sequence alignment.

How does BLAST score an alignment?How does BLAST score an alignment?

Default matrix in BLAST 2.0

BLOSUM= BLOcks Substitution Matrix

Based on local alignments

Page 24: Homology assessment and molecular sequence alignment.

BLOSUM62: contributions from proteins more than 62% identical are weighted to sum to one.

Scores: Number values

How does BLAST score an alignment?How does BLAST score an alignment?

Page 25: Homology assessment and molecular sequence alignment.

BLASTBLAST

Page 26: Homology assessment and molecular sequence alignment.

The nuts and bolts The nuts and bolts

1. Gene/protein of interest

2.2. Homolog searchHomolog search

3. Sequence alignment

4. Tree building

Figure from http://www.usingneuralnetworks.com/images/Face_But_Not_The_Name_Cartoon.jpg

Page 27: Homology assessment and molecular sequence alignment.

HomologeneHomologene

Page 28: Homology assessment and molecular sequence alignment.

Aligning GenesAligning Genes

Page 29: Homology assessment and molecular sequence alignment.

Homologene scoringHomologene scoring

Page 30: Homology assessment and molecular sequence alignment.

The nuts and bolts The nuts and bolts

1. Gene/protein of interest

2. Homolog search

3.3. Sequence Sequence alignmentalignment

4. Tree building

Figure from http://www.usingneuralnetworks.com/images/Face_But_Not_The_Name_Cartoon.jpg

Page 31: Homology assessment and molecular sequence alignment.

Pair-alignment

• Algorithm: Needleman-wunsch dynamic programming

• Global alignment

• DNA, protein

• Find positional

primary homology

• Sites that use N-W: EBI server

Figure from http://ww2.cs.fsu.edu/~hui/research/scanalyze_tutorial/pics/registered_group.jpg

Page 32: Homology assessment and molecular sequence alignment.

Needleman-Wunsch algorithmNeedleman-Wunsch algorithm

Figure from Journal of Medical Physics 39 (2006) pg 29

Page 33: Homology assessment and molecular sequence alignment.

Needle-wunsch algorithmNeedle-wunsch algorithm

Page 34: Homology assessment and molecular sequence alignment.

BLAST alignBLAST align

Page 35: Homology assessment and molecular sequence alignment.

Multiple sequence alignmentMultiple sequence alignment

• Example: ClustalW

• Progressive alignment

• Nucleotide and Protein

sequences

• Local or Global alignment

• Sites that use MSA: EBI, DDBJ, PBIL, EMBNet, GenomeNet

Figure from www.cs.umbc.edu

Page 36: Homology assessment and molecular sequence alignment.

ClustalW @ EBIClustalW @ EBI

Page 37: Homology assessment and molecular sequence alignment.

The nuts and bolts The nuts and bolts

1. Gene/protein of interest

2. Homolog search

3. Sequence alignment

4.4. Tree buildingTree building

Figure from http://www.usingneuralnetworks.com/images/Face_But_Not_The_Name_Cartoon.jpg

Page 38: Homology assessment and molecular sequence alignment.

SRD5A2 @ TreeFAMSRD5A2 @ TreeFAM

Page 39: Homology assessment and molecular sequence alignment.

DiscussionDiscussion

• Does sequence orthology relate to functional equivalence?

• Can paralogs be functionally related?

• Do unsequenced genomic regions affect the understanding of orthology and paralogy?

Figures from: http://www.ndpgenderequality.ie/images/cartoons/cartoon_large_intro.gif and http://www.faithmouse.com/cartoon567.jpg

Page 40: Homology assessment and molecular sequence alignment.

ProsPros ConsCons

Page 41: Homology assessment and molecular sequence alignment.

Pros and consPros and cons

• Best guess

• Parsimony

• Algorithms

• Speed vs accuracy

• Evolution vs religion

• Evolutionary history

• Find animal models

• Relation between structure and function

• Biological processes

Figure from: http://www.pbrainprojects.com/images/angel_devil.jpg

Page 42: Homology assessment and molecular sequence alignment.

Assumptions, assumptions, Assumptions, assumptions, assumptionsassumptions

• If Xs is true then the tree is true…

Figure from: http://www.gdargaud.net/Humor/Pics/string_theory.png

Page 43: Homology assessment and molecular sequence alignment.