UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11,...

36
UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010

Transcript of UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11,...

Page 1: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICSHIGH THROUGHPUT FACILITY

Institute for Genomics and BioinformaticsReview

October 11, 2010

Page 2: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Personnel• Director, Suzanne Sandmeyer, Ph.D.• Manager, Robert Chadwick, Ph.D.• Staff Research Associates

Seung Ah Chung, SRAIII

Valentina Ciobanu, SRAII

Sharmila Mallya, SRAII• Co-Director for Bioinformatics, Pierre Baldi,

Ph.D. • Staff postdoctoral fellow/programmer

Christophe Magnan, Ph.D.

Page 3: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

HT Processing and Affymetrix GeneChip Analysis

Hybridization Oven 640Fluidics Station 450 (3 each)GeneArray Scanner 7GArrayPlex/Beckman FX BioRobot

Page 4: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Illumina HiSeq Next Gen Sequencer

Illumina GAIIX Next Gen Sequencer

Cbot cluster StationPaired-end Module

MJ Research Tetrad ThermalcyclerAgilent Bioanalyzer 2100Covaris shearer

Page 5: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Genomics ServicesAffymetrix microarray analysisSample preparation and QCExpression analysisSNP and Copy Number VariationRe-sequencingChIP-Chip

Illumina HT DNA sequence analysisCovaris DNA shearingDe novo & Re-Sequencing of DNARNA-seq (Digital Expression)ChIP-seq (Chromatin sequencing)library preparation (coming soon)

Sample evaluationAgilent Bioanalyzer RNA and DNA analysisNanoDrop RNA & DNA quantificationqPCR of Illumina libraries

Page 6: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

WEB REDESIGN

Page 7: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITYGHTF Analysis PackagesBasic consulting on data analysis approaches (free) & perform defined data analysis routines (recharge basis)

Recharged access to users on 2 facility computers dedicated for data analysis with the following software:•Partek’s Genomic Suite•Affymetrix Expression Consol•Affymetrix Genotyping Console & Integrated Genome Browser•SAS JMP Genomics•Various Web-based & Freeware Application Solutions•Genomics Workbench from CLC Bio

Web based statistical program Cyber-T on Linux Server: http://genomics.biochem.uci.edu/genex/cybert/

Work with the Institute for Genomics and Bioinfomatics: http://www.igb.uci.edu/

Page 8: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Experimental design consultation

Investigator

Institute for Genomics and Bioinformatics

Pierre Baldi/Christophe MagnanRichard LathropPadhraic SmythXiaohui Xie

UCI Statistical CoreRobert NewcombeHal SternRita Petersen

CFCCC Bioinformatics Core

Chad GarnerJenny Wu

Experiment

Analysis

Page 9: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Fixed expenses:PersonnelTotal recharges $2009-2010: $586,4262010-2011 (est.): $698,085

Page 10: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Grants:NIH Shared Instrument Grant $500,000 (SBS PI; PB co-PI)

Campus cost sharing:Year 1: $425,000Year 2: $85,000 salariesYear 3-5 $100,000 salaries plus service contract

contribution

Stem Cell Center$360,000 toward HiSeq2000 Annual campus supportCFCCC: $35,000

Page 11: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 12: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

•llumina pipeline software for raw reads, quality scoring, and alignment (ELAND)

•Genomics Workbench can be found here: http://www.clcbio.com/

•MAQ information can be found here: http://maq.sourceforge.net/ •VELVET information can be found here: http://www.ebi.ac.uk/~zerbino/velvet/ •Bowtie information can be found here: http://bowtie-bio.sourceforge.net/tutorial.shtml •A discussion comparing some aligners can be found here:http://massgenomics.wordpress.com/2008/05/14/short-read-aligners-maq-eland-and-others/or http://www.sanger.ac.uk/Users/lh3/NGSalign.shtml

Page 13: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Financial Support

Affymetrix Use

CCSG

Charge Backs

InstitutionalSupport

Illumina Use

CC Members

Non-centerMembers

Non-centerMembers

Page 14: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

LeadershipSuzanne Sandmeyer, Ph.D., Biological Chemistry, DirectorRobert Chadwick, Ph.D., Manager GHTFPierre Baldi, Ph.D., Computer Science, Co-Director for Bioinformatics GHTFChad Garner, Ph.D., Director CFC Cancer Center Bioinformatics

Advisory BoardDan Mercola, M.D., Pathology, Director CFCCC Personal GenomicsHung Fan, Ph.D., Director, Cancer Research Institute, Dept. Molecular Biology and

BiochemistryAnthony Long, Ph.D., Dept. Ecology and Evolutionary BiologyPadhraic Smyth, Ph.D., Dept. Computer SciencesBogi Andersen, M.D., Dept. Medicine, Div. of EndocrinologyRobert Newcomb, Ph.D., Dept. Statistics Peter Donovan, Ph.D., Director, Stem Cell CenterLisa Dahm, Ph.D., Director, Clinical InformaticsKyoko Yokomori, Ph.D., Dept. Biological Chemistry

Page 15: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 16: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 17: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Reduced Cluster Density Variability Using cBot vs Cluster Station

Page 18: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITYImproved Library Quantitation by qPCR Reduces Cluster Generation Variability

Page 19: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITYGAIIX versus HiSeq 2000 at the Broad Institute

Page 20: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Accuracy of HiSeq 2000 at the Broad Institute

Page 21: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

NEBNext paired-end genomic library $350 for 10 libraries (primers not included)NuGen Encore paired-end genomic library $680 for 8 librariesEpicentre Nextera paired-end genomic library $625 for 5 librariesIllumina RNA-Seq library $3500 for 8 libraries (Superscript not included)Ambion/NuGen RNA-Seq library $1480 for 8 libraries Epicentre ScriptSeq RNA-Seq library $1200 for 6 libraries

In processNuGen Barcoded RNA-Seq Libraries $3200 for 4 libraries (8 barcodes x 4 RNA)Epicentre Barcoded ScriptSeq Libraries $1400 for 4 libraries (12 barcodes x for Paired End Genomic Libraries take 4 to 6 hours to makeRNA-Seq Libraries take 6 hours (Epicentre) to 3 partial days (Illumina) to make RNA

Library Methods Tested in the GHTF

Page 22: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITYStandard Curve : Standards from KAPA Library qPCR Kit

Efficiency of the standard qPCR reaction is 98%

Page 23: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

              

Sample Nanodrop   QPCR

  ng/ul   ng/ul

       

Nugen DNA lib: SM 65.81   48.77

       

Nugen DNA lib :VC 60.8   33.66

       

PhiX control     3.12

       

RNA seq Illumina 35.51   0.00007

       

RNA seq 1 Ambion_Nugen Lib kit 9.67   1.2

       

RNA seq 2 AmbionSecond cycle cDNA_Nugen lib kit 27.22   18.99

Comparison Between Nanodrop and qPCR Quantification

Note that by Nanodrop the Illumina RNA-Seq library gives a good yield but by qPCR the yield was very poor indicating the library is mostly primer artifacts.

Page 24: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 25: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 26: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 27: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 28: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 29: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 30: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 31: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 32: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 33: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

UCI GENOMICS

HIGH THROUGHPUT FACILITY

Page 34: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

* Whole genome re-sequencing and targeted re-sequencing of genomes of any size and type – from bacteria and viral to humans * De novo assembly of an unlimited number of reads, for genomes up to human size (3 billion bp) * SNP/DIP detection and identification of genomic rearrangements * Digital Gene Expression based on RNA-Seq, including a wide range of downstream gene expression analyses * Discovery of novel transcripts/exons * Ability to work with Expression Arrays and RNA-seq results at the same time, enabling comparison of results * Interactive views of assemblies and derived gene expression data * Small RNA-seq * Tag Profiling, including SAGE screen analyses * Chromatin immunoprecipitation sequencing (ChIP-seq) analysis * Peak finding and peak refinement * Graph and table of background distribution and false discovery rate * Peak table and annotations

Genomics Workbench (CLC Bio)

Page 35: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

The Broadcom Distributed Unified Cluster (BDUC) is, as the name suggests, a distributed group of clusters unified by running under a single Sun Grid Engine (SGE) Resource Manager. BDUC consists of 2 subclusters of 40 2core AMD64 Opteron252/2.6GHz nodes running 64bit/CentOS 5. One group of 40 nodes is in the NACS Academic Data Center and another of 40 nodes is in the ICS data center, for a total of 80 nodes / 160 cores. The nodes are interconnected with 1Gb ethernet and have the MPICH and MPICH2 environments for parallel jobs.

BDUC

Page 36: UCI GENOMICS HIGH THROUGHPUT FACILITY Institute for Genomics and Bioinformatics Review October 11, 2010.

Sequence Capture Microarrays Can Be Used to Enrich Genomic DNA for Specific Regions of the Genome

Following enrichment and DNA amplification, the enriched DNA regions can be directly sequenced using next-gen sequencers.This can greatly expedite the discovery of causative genes ingenetic linkage association studies.