GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the...
-
Upload
carmella-byrd -
Category
Documents
-
view
217 -
download
0
Transcript of GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the...
![Page 1: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/1.jpg)
geWorkbench
John Watkinson
Columbia University
![Page 2: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/2.jpg)
geWorkbench
The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic and Cellular Networks (MAGNet).
Also, part of the NCI’s cancer Biomedical Informatics Grid (caBIG) initiative. The project was formerly called caWorkbench.
![Page 3: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/3.jpg)
geWorkbench (cont.)
A desktop application for integrative genomics.
Runs on Windows, Linux and Macintosh. Includes a variety of informatics tools, but
specializes in microarray analysis. Open-source and free for non-commercial
use. Includes an API for plugin development.
![Page 4: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/4.jpg)
![Page 5: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/5.jpg)
geWorkbench (cont.)
![Page 6: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/6.jpg)
Integrative Genomics
Increasingly, researchers need to combine several data sources (microarray assays, DNA/RNA/protein sequences, protein structure, gene ontology, clinical data, etc.)
geWorkbench attempts to move past simple microarray analysis to include integrative methods.
Plugin framework allows geWorkbench to interact with other major software packages, including BioConductor, GenePattern and Cytoscape.
![Page 7: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/7.jpg)
Data Support
Microarray assays (one-color and two-color, as well as caARRAY assays).
Sequence files. BLAST queries. Gene-Gene interaction networks
(Interactomes). Gene Ontology Terms. caBIO pathways and annotations. Protein structure files (PDB).
![Page 8: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/8.jpg)
Components
geWorkbench has a plugin interface for the development of 3rd-party components.
Documentation and developer support is available from the geWorkbench team.
All visualizations and analyses have been written using the API. Several groups at Columbia are developing for the platform.
![Page 9: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/9.jpg)
Microarray Analysis
Summarization of raw chip data (via BioConductor).
Normalization and Filtering. Differential expression analysis. Clustering (Hierarchical and Self-Organizing
Maps). Classification (SVM and SMLR). Many visualization tools.
![Page 10: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/10.jpg)
Hierarchical Clustering
![Page 11: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/11.jpg)
Scatter Plot Visualization
![Page 12: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/12.jpg)
caBIO Pathway Viewer
![Page 13: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/13.jpg)
Sequence Analysis
BLAST and HMM search interface. Pattern discovery. Synteny analysis. Promoter region analysis. A variety of sequence viewers.
![Page 14: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/14.jpg)
Pattern Discovery Viewer
![Page 15: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/15.jpg)
Promoter Viewer
![Page 16: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/16.jpg)
GO Term Enrichment
Traditional t-tests on microarray data determine differentially expressed genes between two different phenotypes.
Gene Ontology (GO) term enrichment can determine which functional or structural categories show significant differentiation.
Supported in geWorkbench’s GO Panel component. A similar technique can be applied to other gene
sets, such as KEGG pathways.
![Page 17: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/17.jpg)
GO Terms (cont.)
![Page 18: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/18.jpg)
Reverse Engineering
Microarray data can be used to infer biological pathways.
geWorkbench’s Reverse Engineering component uses the ARACNE algorithm to build gene-gene interaction networks.
These can be compared and combined with an online database of interactions, curated by Columbia.
![Page 19: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/19.jpg)
Reverse Engineering (cont.)
![Page 20: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/20.jpg)
Reverse Engineering (cont.)
![Page 21: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/21.jpg)
Matrix REDUCE
Given microarray data and upstream sequences for genes, transcription factor binding sites can be inferred.
The Matrix REDUCE component in geWorkbench provides this analysis and tools to visualize the results.
![Page 22: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/22.jpg)
For More Information
http://www.geworkbench.org Mailing List:
[email protected] John Watkinson: [email protected]
![Page 23: GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.](https://reader036.fdocuments.net/reader036/viewer/2022062804/5697bf731a28abf838c7edab/html5/thumbnails/23.jpg)
Acknowledgements
ARACNE algorithm by Califano et al. Matrix REDUCE algorithm by Bussemaker, et
al. geWorkbench team: Aris Floratos, Eileen
Daly, Kenneth Smith, Kiran Keshav, Xiaoqing Zhang, Manjunath Kustagi, Matthew Hall, Bernd Jagla, Mary VanGinhoven, John Watkinson.