Biological background: Gene Expression and Molecular Laboratory Techniques Class web site: ...

47
Biological background: Gene Expression and Molecular Laboratory Techniques Class web site: http://statwww.epfl.ch/davison/teaching/Microarr ays/ Statistics for Microarrays
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    216
  • download

    1

Transcript of Biological background: Gene Expression and Molecular Laboratory Techniques Class web site: ...

Page 1: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Biological background: Gene Expression and Molecular

Laboratory Techniques

Class web site: http://statwww.epfl.ch/davison/teaching/Microarrays/

Statistics for Microarrays

Page 2: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Basic principles in physics, chemistry and biology

Principles Known?

Physics

Matter

Chemistry

Compound

Biology

Organism

ElementaryParticles

Yes

Elements

Yes

Genes

No

Page 3: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Central Paradigm

Page 4: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

(RT)

Page 5: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.
Page 6: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Protein Synthesis

Page 7: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Transcription

• Transcription is a complex process involving several steps and many proteins (enzymes)

• RNA polymerase synthesizes a single strand of RNA against the DNA template strand (anti-sense strand), adding nucleotides to the 3’ end of the RNA chain

• Initiation is regulated by transcription factors, including promoters, usually an initiator element and TATA box, usually lying just upstream (at the 5’ end) of the coding region

• 3’ end cleaved at AAUAAA, poly-A tail added

Page 8: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Exons and Introns

• Most of the genome consists of non-coding regions

• Some non-coding regions (centromeres and telomeres) may have specific chomosomal functions

• Other non-coding regions have regulatory purposes

• Non-coding, non-functional DNA often called junk DNA, but may have some effect on biological functions

• The terms exon and intron refer to coding and non-coding DNA, respectively

Page 9: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Intron Splicing

Page 10: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Transcription Overview

Page 11: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.
Page 12: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Transcription Illustration

Page 13: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Translation

• The AUG start codon is recognized by methionyl-tRNAi

Met

• Once the start codon has been identified, the ribosome incorporates amino acids into a polypeptide chain

• RNA is decoded by tRNA (transfer RNA) molecules, which each transport specific amino acids to the growing chain

• Translation ends when a stop codon (UAA, UAG, UGA) is reached

Page 14: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Translation Illustrated

Page 15: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

From Primary Transcript to Protein

Page 16: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Alternative Splicing (of Exons)

• How is it possible that there are over 1,000,000 human antibodies when there are only about 30,000 genes?

• Alternative splicing refers to the different ways the exons of a gene may be combined, producing different forms of proteins within the same gene-coding region

• Alternative pre-mRNA splicing is an important mechanism for regulating gene expression in higher eukaryotes

Page 17: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.
Page 18: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Molecular Laboratory Techniques

• Hybridizing DNA

• Copying DNA

• Cutting DNA

• Probing DNA

Page 19: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Hybridization

• Hybridization exploits a potent feature of the DNA duplex – the sequence complementarity of the two strands

• Remarkably, DNA can reassemble with perfect fidelity from separated strands

• Strands can be separated (denatured) by heating

Page 20: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.
Page 21: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Polymerase Chain Reaction (PCR)

• PCR is used to amplify (copy) specific DNA sequences in a complex mixture when the ends of the sequence are known

• Source DNA is denatured into single strands • Two synthetic oligonucleotides complementary

to the 3’ ends of the segment of interest are added in great excess to the denatured DNA, then the temperature is lowered

• The genomic DNA remains denatured, because the complementary strands are at too low a concentration to encounter each other during the period of incubation, but the specific oligonucleotides hybridize with their complementary sequences in the genomic DNA

Page 22: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

PCR, ctd

• The hybridized oligos then serve as primers for DNA synthesis, which begins upon addition of a supply of nucleotides and a temperature resistant polymerase such as Taq polymerase, from Thermus aquaticus (a bacterium that lives in hot springs)

• Taq polymerase extends the primers at temperatures up to 72˚C

• When synthesis is complete, the whole mixture is heated further (to 95˚C) to melt the newly formed duplexes

• Repeated cycles (25—30) of synthesis (cooling) and melting (heating) quickly provide many DNA copies

Page 23: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.
Page 24: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

(BREAK)

Page 25: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Types of Viruses

Reverse transcriptase makes a complementary DNA copy from RNA.

A virus is a nucleic acid in a protein coat.

Page 26: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Reverse transcription

Clone cDNA strands, complementary to the mRNA

G U A A U C C U C

Reverse transcriptase

mRNA

cDNA

C A T T A G G A G C A T T A G G A G C A T T A G G A G C A T T A G G A G

T T A G G A G

C A T T A G G A G C A T T A G G A G C A T T A G G A G

C A T T A G G A G

C A T T A G G A G

Page 27: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

RT-PCR

Page 28: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Restriction Enzymes Cut DNA

Page 29: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Restriction Enzymes

• When a bacterium is invaded by a DNA-containing organism (e.g. virus), it can defend itself with restriction enzymes (REs; also called restriction endonucleases)

• REs recognize a specific short sequence of DNA and cut both strands

• The recognition sequence is typically a palindrome – i.e. the sequence in one strand is the same as in the other, read in the other direction (e.g. GAATTC)

• REs named after the bacteria in which they occur, plus sequence number (e.g. Eco RI)

Page 30: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

RE Example (Eco RI)

(cut)

5’ – GAATTC – 3’

3’ – CTTAAG – 5’

(cut)

Page 31: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Probing DNA

• One way to study a specific DNA fragment within a genome is to probe for the sequence of the fragment

• A probe is a labeled (usually radioactive or fluorescent) single-stranded oligonucleotide, synthesized to be complementary to the sequence of interest – probe sequence is known

• Attach single-stranded DNA to a membrane (or other solid support) and incubate with the probe so that it hybridizes

• Visualize the probe (e.g. by X-ray for radioactive probes)

Page 32: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

The Southern blotting technique

Page 33: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Sample Autoradiogragh (Gel)

Page 34: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Types of Blots

• Southern Blot – use DNA to probe DNA

• Northern Blot – use DNA to probe RNA

• Western Blot – use antibodies to probe

Protein

Page 35: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein would be more direct, but is currently harder.

Measuring Gene Expression

Page 36: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Microarrays provide a means to measure gene expression

Page 37: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Areas Being Studied with Microarrays

• Differential gene expression between two (or more) sample types

• Similar gene expression across treatments• Tumor sub-class identification using gene

expression profiles• Classification of malignancies into known

classes• Identification of “marker” genes that

characterize different tumor classes• Identification of genes associated with clinical

outcomes (e.g. survival)

Page 38: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

cDNA microarray experiments

mRNA levels compared in many different contexts

• Different tissues, same organism (brain v. liver) • Same tissue, same organism (ttt v. ctl, tumor v. non-

tumor) • Same tissue, different organisms (wt v. ko, tg, or

mutant)

• Time course experiments (effect of ttt, development)

• Other special designs (e.g. to detect spatial patterns).

Page 40: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Yeast genome on a chip

Page 41: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Brief outline of steps for producing a microarray

• cDNA probes attached or synthesized to solid support

• Hybridize targets

• Scan array

Page 42: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

cDNA microarrays

cDNA clones

Page 43: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

cDNA microarrays

Compare the genetic expression in two samples of cells

PRINTcDNA from one gene on each spot

SAMPLEScDNA labelled red/green

e.g. treatment / control

normal / tumor tissue

Page 44: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

HYBRIDIZE

Add equal amounts of labelled cDNA samples to microarray.

SCAN

Laser Detector

Page 45: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Quantification of expression

For each spot on the slide we calculate

Red intensity = Rfg - Rbg

(fg = foreground, bg = background) and

Green intensity = Gfg - Gbg

and combine them in the log (base 2) ratio

Log2( Red intensity / Green intensity)

Page 46: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Gene Expression Data On p genes for n slides: p is O(10,000), n is

O(10-100), but growing,

Genes

Slides

Gene expression level of gene 5 in slide 4

= Log2( Red intensity / Green intensity)

slide 1 slide 2 slide 3 slide 4 slide 5 …

1 0.46 0.30 0.80 1.51 0.90 ...2 -0.10 0.49 0.24 0.06 0.46 ...3 0.15 0.74 0.04 0.10 0.20 ...4 -0.45 -1.03 -0.79 -0.56 -0.32 ...5 -0.06 1.06 1.35 1.09 -1.09 ...

These values are conventionally displayed on a red (>0) yellow (0) green (<0) scale.

Page 47: Biological background: Gene Expression and Molecular Laboratory Techniques Class web site:  Statistics.

Biological questionDifferentially expressed genesSample class prediction etc.

Testing

Biological verification and interpretation

Microarray experiment

Estimation

Experimental design

Image analysis

Normalization

Clustering Discrimination

R, G

16-bit TIFF files

(Rfg, Rbg), (Gfg, Gbg)