Bioinformatics 2 - The University of Edinburgh › teaching › courses › bio2 › lectures...1...

Armstrong, 2007

Bioinformatics 2

From genomics & proteomics tobiological networks

Armstrong, 2007

• Briefly review functional genomics• Biological Networks in general• Genetic Networks• Briefly review proteomics• Protein Networks

Armstrong, 2007

protein-geneinteractions

protein-proteininteractions

PROTEOME

GENOME

Citrate Cycle

METABOLISM

Bio-chemicalreactions

Bio-Map

Slide from http://www.nd.edu/~networks/

Armstrong, 2007

Biological Profiling

• Microarrays– cDNA arrays– oligonucleotide arrays– whole genome arrays

• Proteomics– yeast two hybrid– PAGE techniques

Armstrong, 2007

Why microarrays?

• What genes are expressed in a tissue andhow does that tissue respond to one of anumber of factors:– change in physical environment– experience– pharmacological manipulation– influence of specific mutations

Armstrong, 2007

What do we actually get?

• A snap-shot of the mRNA profile in abiological sample

• With the correct experimental conditions wecan compare two situations

• Not all biological processes are regulatedthrough mRNA expression levels

Armstrong, 2007

What can we learn?

• Identify functionally related genes• Find promoter regions (common regulation)• Predict genetic interactions• If we change one variable a network of gene

responses should compensate• Homeostasis is a fundamental principle of biology

- almost all biological systems exist in a controlledstate of negative feedback.

Armstrong, 2007

The Transcriptome

• Microarrays work by revealing DNA-DNAbinding.

• Transcriptional activators also bind DNA• Spot genomic DNA onto glass slides• Label protein extracts• Hybridise to the genomic probes• Reveals domains that include promoter

regions

Armstrong, 2007

ChIP to ChipChromatin Immunoprecipitation to Microarray (Chip)

Protein-DNA interactionsde-novo prediction has many false positivesWhich DNA sites do actually bind a specific TF?Requires an antibody to the protein

Armstrong, 2007

http://proteomics.swmed.edu/chiptochip.htm

Armstrong, 2007

http://proteomics.swmed.edu/chiptochip.htm

Armstrong, 2007

Biological networks

Armstrong, 2007

Yeast protein networkNodes: proteinsLinks: physical interactions (binding)

P. Uetz, et al. Nature 403, 623-7 (2000).

Prot Interaction map

Slide from http://www.nd.edu/~networks/

Armstrong, 2007

Building networks…

• Biological Networks– Random networks– Scale free networks– Small worlds

• Metabolic Networks• Proteomic Networks• The Mammalian Synapse• Other synapse models?

Armstrong, 2007

Biological Networks

• Genes - act in cascades• Proteins - form functional complexes• Metabolism - formed from enzymes and substrates• The CNS - neurons act in functional networks• Epidemiology - mechanics of disease spread• Social networks - interactions between individuals

in a population• Food Chains

Armstrong, 2007

Protein Interactions

• Individual Proteins form functionalcomplexes

• These complexes are semi-redundant• The individual proteins are sparsely

connected• The networks can be represented and

analysed as an undirected graph

Armstrong, 2007

Large scaleorganisation

– Networks in biology generally modeled usingclassic random network theory.

– Each pair of nodes is connected withprobability p

– Results in model where most nodes have thesame number of links <k>

– The probability of any number of links pernode is P(k)≈e-k

Armstrong, 2007

Non-biological networks

• Research into WWW, internet and humansocial networks observed different networkproperties– ‘Scale-free’ networks– P(k) follows a power law: P(k)≈k-γ

– Network is dominated by a small number ofhighly connected nodes - hubs

– These connect the other more sparselyconnected nodes

Armstrong, 2007

Internet-Map

the internet

Armstrong, 2007

Small worlds

• General feature of scale-free networks– any two nodes can be connected by a relatively

short path– average between any two people is around 6

• What about SARS???– 19 clicks takes you from any page to any other

on the internet.

Armstrong, 2007

Biological organisation

• Pioneering work by Oltvai and Barabasi• Systematically examined the metabolic

pathways in 43 organisms• Used the WIT database

– ‘what is there’ database– http://wit.mcs.anl.gov/WIT2/– Genomics of metabolic pathways

Jeong et al., 2000 The large-scale organisation of metabolicnetworks. Nature 407, 651-654

Armstrong, 2007

Using metabolic substrates asnodes

archae

all 43eukaryote

bacteria

=scale free!!!

Armstrong, 2007

Random mutations in metabolicnetworks

• Simulate the effect of random mutations ormutations targeted towards hub nodes.– Measure network diameter– Sensitive to hub attack– Robust to random

Armstrong, 2007

Consequences for scale freenetworks

• Removal of highly connected hubs leads to rapid increasein network diameter– Rapid degeneration into isolated clusters– Isolate clusters = loss of functionality

• Random mutations usually hit non hub nodes– therefore robust

• Redundant connectivity (many more paths between nodes)

Armstrong, 2007

Network Motifs

• Do all types of connections exist innetworks?

• Milo et al studied the transcriptionalregulatory networks in yeast and E.Coli.

• Calculated all the three and four genecombinations possible and looked at theirfrequency

Armstrong, 2007

Milo et al. 2002 Network Motifs: Simple Building Blocks of ComplexNetworks. Science 298: 824-827

Biological Networks

Three node possibilities

Armstrong, 2007

Gene sub networks

Heavy bias in both yeast and E.coli towards these two subnetwork architectures

Armstrong, 2007

Gene networks and network inference

Armstrong, 2007

What is a gene network

• Genes do not act alone.• Gene products interact with other genes

– Inhibitors– Promoters

• The nature of genetic interactions in complex– Takes time– Can be binary, linear, stochastic etc– Can involve many different genes

Armstrong, 2007

What makes boys boys and girls girls?

Sugar, Spice and synthetic Oestrogens?

Armstrong, 2007

Sex determination: a genecascade (in flies…)

RuntSisterlessScute

DaughterlessDeadpanExtramachrochaete

6 Genes detect X:A ratioFemales Males

Armstrong, 2007

Sex determination(in flies…)

RuntSisterlessScute

6 Genes regulate ‘Sexlethal’

Sexlethal

+ effect

- effect

Armstrong, 2007

RuntSisterlessScute

Sexlethal can then regulate itself...

Sexlethal

Armstrong, 2007

Downstream cascade builds...

RuntSisterlessScute

Sexlethal transformer doublesex

Armstrong, 2007

Gene expression and time1 Runt2 Sisterless3 Scute

4 Daughterless5 Deadpan6 Extramachrochaete

7 Sexlethal 8 transformer 9 doublesex

Armstrong, 2007

Gene microarrays

Armstrong, 2007

Gene Network Inference

• Gene micro-array data• Learning from micro-array data• Unsupervised Methods• Supervised Methods• Edinburgh Methods

Armstrong, 2007

Gene Network Inference

• Gene micro-array data– Time Series array data– Tests under ranges of conditions

• Unlike example - 1000s genes• Lots of noise• Clustering would group many of these genes

together• Aim: To infer as much of the network as possible

Armstrong, 2007

Learning from Gene arrays

• Big growth industry but difficult problem• Initial attempts based on unsupervised

methods:– Basic clustering analysis - related genes– Principal Component Analysis– Self Organising Maps– Bayesian Networks

Armstrong, 2007

Bayesian ‘gene’ networks

• Developed by Nir Friedman and Dana Pe’er• Can be easily adapted to a supervised

method

Armstrong, 2007

Learning Gene Networks

• The field is generally moving towards moresupervised methods:– Bayesian networks can use priors– Support Vector machines– Neural Networks– Decision Trees

Armstrong, 2007

• Scale free architecture– Chance of new edges is proportional to existing ones– Highly connected nodes may well be known to be

lethal

• Network motifs– Constrain the types of sub networks

• Prior Knowledge– Many sub networks already known

Can we combine networkknowledge with gene inference?

Armstrong, 2007

Conclusions

• Gene network analysis is a big growth area• Several promising fields starting to converge

– Complex systems analysis– Using prior knowledge– Application of advance machine learning algorithms– AI approaches show promise

Bioinformatics 2 - The University of Edinburgh › teaching › courses › bio2 › lectures...1...

Documents

Transcript of Bioinformatics 2 - The University of Edinburgh › teaching › courses › bio2 › lectures...1...

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology.

bioinformatics for proteomics - GitHub Pages

Bio2 · 2006-01-25 · 1 Armstrong, 2006 Bioinformatics 2 Bio2 Lecture 3 Heuristics, Databases ; Multiple Sequence Alignment Armstrong, 2006 Bioinformatics 2 Heuristic Methods •FASTA

Bio2 - The University of Edinburgh · 1 Armstrong, 2007 Bioinformatics 2 Bio2 Heuristics, Databases ; Multiple Sequence Alignment ; Gene Finding Armstrong, 2007 Bioinformatics 2 Biological

Proteomics & Bioinformatics Part II...Bioinformatics Part II David Wishart 3-41 Athabasca Hal david.wishart@ualberta.ca 3 Kinds of Proteomics* • Structural Proteomics – High throughput

Bioinformatics 2 › teaching › courses › bio2 › ... · •‘wet lab’ and bioinformatics projects •Office in FH, Lab in HRB Armstrong, 2008 Bio2 lecture 1 The class (2008)

10. Standards in Proteomics MS bioinformatics analysis for proteomics Salvador Martínez de Bartolomé smartinez@proteored.org Bioinformatics support – ProteoRed.

Introduction to the Proteomics Bioinformatics Course 2016

Bio2 - The University of Edinburgh · Bio2 Lecture 3 Heuristics, Databases ; Multiple Sequence Alignment Armstrong, 2006 Bioinformatics 2 Heuristic Methods •FASTA •BLAST •Gapped

Proteomics & Bioinformatics: A Canadian Perspective

Journal of Proteomics & Bioinformatics Open Access of Proteomics & Bioinformatics - Open Access ... (DTX) and non-DTX or GSM Basic phases. ... Journal of Proteomics & Bioinformatics

Databases in bioinformatics II - Göteborgs universitetbio.lundberg.gu.se/courses/ht08/bio2/DB2_2008_toprint.pdfDatabases in bioinformatics 13 Sequencing methods 1954 Whitfeld PR.

Proteomics & Bioinformatics 2

Austin Journal of Proteomics, Bioinformatics Austin ...

Proteomics myriad Bioinformatics Industrial Applications in High Throughput Proteomics Bioinformatics Industrial Applications in High Throughput Proteomics.

Bioinformatics Proteomics Lecture 4Bioinformatics −Proteomics Lecture 4 Prof. Lászl óPoppe BME Department of Organic Chemistry and Technology Bioinformatics –Proteomics Lecture

DAILY PROGRAMME - HUPO€¦ · Drug & Biopharmaceutical Proteomics Methods, Systems & Bioinformatics Disease & Clinical Proteomics Precision & Personalised Proteomics Human Proteome

Journal of Proteomics & Bioinformatics - Open Access · 2019. 6. 24. · Journal of Proteomics & Bioinformatics - Open Access Research Article JPB/Vol.1/April 2 008 J Proteomics Bioinform

Proteomics Bioinformatics & Protein Structural Analysis ...oscar.iitb.ac.in/onsiteDocumentsDirectory/650... · Bioinformatics & Protein Structural Analysis Proteomics. The protein

Journal of Proteomics Bioinformatics - OMICS International · Journal of Proteomics & Bioinformatics encourages and also depends upon data interpretation, efforts and analysis of