Making Sense of the ENCODE Project (ENCyclopedia Of DNA Elements) Data Ansuman Chattopadhyay, PhD...

Post on 24-Dec-2015

218 views 0 download

Tags:

Transcript of Making Sense of the ENCODE Project (ENCyclopedia Of DNA Elements) Data Ansuman Chattopadhyay, PhD...

Making Sense of the ENCODE Project (ENCyclopedia Of DNA Elements) Data

Ansuman Chattopadhyay, PhDHead, Molecular Biology Information ServicesHealth Sciences Library SystemUniversity of Pittsburghansuman@pitt.edu

Topics Introduction

Gene Regulation Epigenetics

ENCODE Project Plus and minuses

UCSC Encode Browser Noteworthy Tools

Regulome db, NCBI Epigenome Genboree

Topics

retrieve promoter sequences

determine transcription factor occupancy

browse through the epigenetic biochemical markers

Histone modifications, DNA methylation etc.,

-predict the location of enhancers, silencers and promoters

INTRODUCTION

Genomic achievements since the Human Genome Project

http://www.hsls.pitt.edu/guides/genetics

DNA Sequencing Cost

http://www.hsls.pitt.edu/molbio

Progress in Genomics

1990 2003 2013

Time

Technology

6-8 year 3-4 months 2-3 days

Time

1B 10-50 M 4-6 K

Cost Source: Eric Green; HGP10 Symposium

Genome Biology : Time Line

1976

RNA Bacteriophage MS2

2001

Human Genome Draft Seq

2003

Published Complete Human Ref Genome

2007

Diploid Genome seq ofan Individual Human

2011

Published Complete Genomes: 1863 organisms

1995

HaemophilusInfluenza

2008

Jim Watson Genome

Yeast

1996

1998

C. elegans

2002

Drosophila

http://www.hsls.pitt.edu/molbio

Big DATA Biology

Single GeneSingle Protein

Single lab

Small Science

Multi Gene – System Wide –

High throughputMulti Institution

Big Science

ENCODE

Epigenome andEncyclopedia of DNA

Elements Project

ENCODE

An excellent movie on transcription

http://www.hsls.pitt.edu/guides/genetics

http://vcell.ndsu.edu/animations/transcription/index.htm

Promoter, Enhancer and Silencer

Source: http://www.cbs.dtu.dk/dtucourse/cookbooks/dave/Lekt03bkg.html

http://www.hsls.pitt.edu/guides/genetics

Retrieve promoter sequence for a gene

UCSC Genome Browser

http://genome.ucsc.edu/cgi-bin/hgGateway

Gene of Interest

EGFR BDNF

BIOBASE TransPro

http://www.hsls.pitt.edu/guides/genetics

Promoter Sequence

Generic Promoter Seq UCSC Genome Browser

Human Curated Promoter Seq Biobase TransPro CSH TRED Eukaryotic Promoter Database (EPD) Epigenome Data

http://www.hsls.pitt.edu/guides/genetics

http://www.hsls.pitt.edu/molbio

Link to the video tutorial:http://media.hsls.pitt.edu/media/clres2705/sequence.swfhttp://media.hsls.pitt.edu/media/clres2705/sequence_2.swf

Resources

UCSC Genome Browser: http://genome.ucsc.edu/NCBI Entrez Gene: http://www.ncbi.nlm.nih.gov/gene

Find sequence information for a gene

-genomic -promoter - intron-exon coordinates-mRNA -protein

Spatiotemporal gene expression

TP53

EGFR

A movie on regulated transcription

http://vcell.ndsu.edu/animations/regulatedtranscription/index.htm

Epigenetic mechanisms

Source: NCBIhttp://www.ncbi.nlm.nih.gov/books/NBK45788/#epi_sci_bkgrd.About_Epigenetics

Chromatin Immuno-Precititation-Seq(ChIP-Seq)

Epigenetic MarkersLandmark Paper:

http://www.nature.com/ng/journal/v39/n3/full/ng1966.html

NCBI-Epigenomics

http://www.ncbi.nlm.nih.gov/epigenomics

Histone Modifications

http://goo.gl/GQ9V8

http://www.hsls.pitt.edu/guides/genetics

Encode Project

http://www.genome.gov/10005107

http://www.nature.com/encode/#/threads

http://www.nature.com/encode/#/threads

ENCODE DATA 30 papers 1640 data sets - a matrix of Assay Vs Cell

Types

74.7% of the genome is transcribed, 56.1% is associated with modifed histones 15.2% is found in open-chromatin areas 8.5% binds transcription factors 4.6% consists of methylated CpG dinucleotides

ENCODE Project

http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001046

Encode Cell Types

http://genome.ucsc.edu/ENCODE/cellTypes.html

UCSC ENCODE BROWSER

Sec61g and EGFR

human chr7:54,801,956-55,305,954

http://goo.gl/QVsvN

EGFR and Sec61g

http://www.hsls.pitt.edu/guides/genetics

EGFR and Sec61g

Role of the Sec61 translocon in EGF receptor trafficking to the nucleus and gene expression.Liao HJ, Carpenter G.Mol Biol Cell. 2007 Mar;18(3):1064-72. Epub 2007 Jan 10.

http://www.hsls.pitt.edu/guides/genetics

Sec61g and EGFR

http://www.hsls.pitt.edu/molbio

Link to the video tutorial:http://media.hsls.pitt.edu/media/molbiovideos/encode1-ac0212.swfhttp://media.hsls.pitt.edu/media/molbiovideos/encode2-ac0212.swfhttp://media.hsls.pitt.edu/media/molbiovideos/encode3-ac0212.swf

Resource

UCSC Genome Browser: http://genome.ucsc.edu/

Identify promoter, enhancer and silencer sequences by browsing the epigenomic markers generated by the ENCODE project

Cell Lines

K562

NHLF

UCSC browser link-genes: http://goo.gl/QVsvN

Video Tutorials

 

Browse the region of human chromosome 7 part

1: http://media.hsls.pitt.edu/media/clres2705/ucsc_genes.swf

Browse the region of human chromosome 7 part 2: http://media.hsls.pitt.edu/media/clres2705/ucsc_snp.swf

NCBI Mapviewer: http://media.hsls.pitt.edu/media/clres2705/ncbimapviewer.swf

Place a mRNA or peptide sequence into the human genome: http://media.hsls.pitt.edu/media/clres2705/blat.swf

ENCODE Criticisms

ENCODE Summary

http://goo.gl/0IfZ9

Latest Paper

http://goo.gl/3rJC7

Noteworthy Tools

Regulome, Haploreg and Genebore

http://goo.gl/jhBvS

http://goo.gl/oP5gj

Regulome

HaploReg

ENCODE Tutorials

http://www.genome.gov/27553901

NCBI Roadmap Epigenomics Page

Thank you!Any questions?

Ansuman Chattopadhyayansuman@pitt.edu 412-648-1297

http://www.hsls.pitt.edu/guides/genetics