BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas...
Transcript of BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas...
![Page 1: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/1.jpg)
Andrea de Souza
Director, Informatics, Data Analysis & Finance
Center for the Science of Therapeutics
May 29, 2013
BioAssay Research Database
![Page 2: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/2.jpg)
Direct Contributors
NIH Molecular Libraries – Glenn McFadden, Ajay Pillai
NIH Chemical Genomics Center – Chris Austin (PI), John Braisted, Marc
Ferrer, Rajarshi Guha, Ajit Jadhav, Dac-Trung Nguyen, Tyler Peryea, Noel
Southall, Henrike Veith
Broad Institute – Benjamin Alexander, Jacob Asiedu, Kay Aubrey, Joshua
Bittker, Steve Brudz, Simon Chatwin, Paul Clemons, Vlado Dancik, Siva
Dandapani, Andrea de Souza, Dan Durkin, David Lahr, Jeri Levine, Judy
McGloughlin, Phil Montgomery, Jose Perez, Stuart Schreiber (PI), Gil
Walzer, Xiaorong Xiang
University of New Mexico – Cristian Bologa, Steve Mathias, Tudor Oprea,
Larry Sklar (PI), Oleg Ursu, Anna Waller, Jeremy Yang
University of Miami – Saminda Abeyruwan, Hande Küküc, Vance
Lemmon, Ahsan Mir, Magdalena Przydzial, Kunie Sakurai, Stephan
Schürer, Uma Vempati, Ubbo Visser
Vanderbilt University – Eric Dawson, Bill Graham, Craig Lindsley (PI),
Shaun Stauffer
Sanford-Burnham Medical Research Institute – “T.C.” Chung, Jena
Diwan, Michael Hedrick, Gavin Magnuson, Siobhan Malany, Ian Pass,
Anthony Pinkerton, Derek Stonich, John Reed (PI)
Scripps Research Institute – Yasel Cruz, Mark Southern,
Hugh Rosen (PI)
![Page 3: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/3.jpg)
BARD: BioAssay Research Database
Mission: Enable biomedical researchers and cheminformatic scientists to effectively use MLP data to generate new hypotheses
• Unique collaboration amongst 7 NIH & academic centers
• Develop and adopt an Assay Definition Standard (ADS)
• Provide tools for assay registration, querying & visualization o Deploy predictive models o Foster new methods to interpret chemical biology data o Enable private data sharing
• Developed as an open-source, industrial-strength platform to support public translational research
![Page 4: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/4.jpg)
BARD: BioAssay Research Database
Mission: Enable biomedical researchers and cheminformatic
scientists to effectively use MLP data to generate new
hypotheses
Team Science
• Provide tools for assay registration and data querying &
visualization o Deploy predictive models
o Foster new methods to interpret chemical biology data
o Enable private data sharing
• Developed as an open-source, industrial-strength platform to
Research Data Management
Technology
Predictive Models
The BARD platform will support public translational research
![Page 5: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/5.jpg)
Research Data Management
The Value of Context
![Page 6: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/6.jpg)
The Value of Context
Research Data Management
![Page 7: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/7.jpg)
PubChem BioAssay
![Page 8: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/8.jpg)
PubChem BioAssay and BARD
structure the data
![Page 9: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/9.jpg)
PubChem BioAssay and BARD
PubChem BARD
Missing or fuzzy assay definitions,
experiments and project concepts
Introduce assay definitions,
experiments and projects
‘Column header’ centric with
concentration details embedded
Result types and concentrations as
experimental variables
Extensive use of unstructured text Transition to structured use of
common language
PubChem
MLP-BioAssay structure
the data
![Page 10: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/10.jpg)
Entrez
Uniprot
Gene Ontology Gene Ontology
Disease Ontology
BioAssay Ontology BioAssay Ontology BioAssay Ontology BioAssay Ontology
Unit Ontology
Uniprot Uniprot
Unit Ontology
BARD Dictionary & Term Hierarchy
Chemical Ontology
BARD Assay Definition Hierarchy
• Annotate all assays to a minimum standard
• Integrate and extend ontologies
• Enable assay registration
• Represent assays, results, experiments using ADS
• Exchange information in ADS via ADF
Structuring the Data
![Page 11: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/11.jpg)
BARD Technology Components
Define & Register
Assays Data Dictionary – std terms
Catalog of Assay Protocols
High Quality Data &
Result Deposition Calculations & Results
Project-experiment association
Query & Interpret
Information Intuitive Guided Queries
Cross Assay & SAR centric views
Advance applications
En
ab
le H
yp
oth
esis
Ge
ne
ratio
n
Novice Expert
![Page 12: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/12.jpg)
BARD Technology Components
Define & Register
Assays Data Dictionary – std terms
Catalog of Assay Protocols
High Quality Data &
Result Deposition Calculations & Results
Project-experiment association
Query & Interpret
Information Intuitive Guided Queries
Cross Assay & SAR centric views
Advance applications
En
ab
le H
yp
oth
esis
Ge
ne
ratio
n
Novice Expert
![Page 13: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/13.jpg)
Web Client
Filter on annotations, such as detection method type
Google-like searching of: 4,000+ assays, 35M+ compounds, 300+ projects
Save items of interest for further analysis
Amazon-like Query Cart
![Page 14: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/14.jpg)
Web Client - Project Specific Views
![Page 15: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/15.jpg)
Web Client – Probe Development Workflow
![Page 16: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/16.jpg)
Sunburst Visualization
Molecular activity against target classes
Target classifications from PantherDB
PANTHER in 2013: modeling the evolution of gene function,
and other gene attributes, in the context of phylogenetic trees.
Huaiyu Mi, Anushya Muruganujan and Paul D. Thomas
Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118
![Page 17: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/17.jpg)
Jersey
D3.js
Web Query & Desktop Clients Data Warehouse & REST API Catalog of Assay Protocols
Commercial License
MySQL support for CAP coming soon
As open source as possible
JGoodies
![Page 18: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/18.jpg)
Chemaxon Usage in BARD
UNM Promiscuity Plugin JChem for scaffold decomposition
REST API & Warehouse JChem for rendering structures and molecule fingerprint generation
http://bard.nih.gov/api/latest/compounds/6915727/image?s=200
http://bard.nih.gov/api/latest/compounds/?filter=n1cccc2ccccc12%5Bstructure%5D&type=sim&cutoff=0.9&expand=true
http://bard.nih.gov/api/latest/plugins/badapple/prom/cid/6915727?expand=true
![Page 19: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/19.jpg)
Chemaxon Usage in BARD
Web Query Client JChem for rendering structures
Desktop Client JChem for rendering structures, molecule import & export Marvin for drawing query structures
![Page 20: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/20.jpg)
• BioActivity Data Associative
Promiscuity Pattern Learning Engine
• Associations via scaffolds for chemical
space navigation
Example URI* description
<base>/badapple/prom/cid/752424
For compound with specified ID, return scaffold IDs and scores.
<base>/badapple/prom/cid/752424?expand=true
Additional statistics, scaffold smiles, and inDrug flag.
<base>/badapple/prom/scafid/233
For scaffold with specified ID, return statistics and smiles.
Predictive Models
![Page 21: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/21.jpg)
Predictive Models
• Predicts CYP450 isoforms
metabolism sites with 2D
structures
• Patrik Rydberg et. al
• Released under LGPL
• BARD plugin
– Summary HTML view
– Data view
![Page 22: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/22.jpg)
Navigating the Maze
![Page 23: BioAssay Research Database - · PDF fileHuaiyu Mi, Anushya Muruganujan and Paul D. Thomas Nucl. Acids Res. (2012) doi: 10.1093/nar/gks1118 . Jersey D3.js Data Warehouse & REST API](https://reader033.fdocuments.net/reader033/viewer/2022051320/5aad3f207f8b9a9c2e8dfa6a/html5/thumbnails/23.jpg)
Long-Term Path Forward
MLP
TBD
NCI-60
TBD
Datasets
CAP Web Query
Desktop APIs
Tools
BAD Apple
CYP450
TBD
TBD
Methods Data Analysis
Workflow 1
Workflow 2
Workflow 3
as a Platform
Sustained Community Engagement
ADS