Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia...
-
Upload
howard-hood -
Category
Documents
-
view
216 -
download
0
Transcript of Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia...
![Page 1: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/1.jpg)
Analysis of environmental genomes using Pathway Tools
Steven Hallam | University of British ColumbiaSRI International, 2013
![Page 2: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/2.jpg)
Overview
• Through the looking glass…
• Environmental Pathway/Genome Databases
• MetaPathways Pipeline Development
2
![Page 3: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/3.jpg)
Metabolism
• Metabolism, or the synthesis and decomposition of chemicals in a cell can be organized into pathways represented by graphs.
3
Vertex = chemical [substrate, product]
Edge = enzyme
![Page 4: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/4.jpg)
Cellular Pathways
• Our genetic and biochemical understanding of metabolism is based largely on the study of complete pathways within cells.
4
Genome Management Information System, Oak Ridge National Laboratory
![Page 5: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/5.jpg)
Distributed Pathways
• However, microbial communities form distributed metabolic pathways directing matter and energy exchange. 5
![Page 6: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/6.jpg)
Community Metabolism
• The goal is to predict and compare distributed pathways to better understand biogeochemical cycling and community metabolism in the environment.
6
![Page 7: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/7.jpg)
7
Predicting Community Metabolism
Plurality Sequencing Single-Cell Sequencing
Fragment Recruitment, SOM, PCA
Environmental PGDB (ePGDB)with Taxonomic Binning
Simulated ePGDB
![Page 8: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/8.jpg)
Metagenome Distributed Pathways
From Genomes to Biomes
8
Biogeochemical Cycles
• “The regulation of the pools and fluxes in biogeochemical cycles have their origins in the genetic inventory of individual microbes, and the regulation of these genes within the organism is determined by the environment. As such, one can look at the microbial food web as a collection of genomes whose expression and replication is coordinated through complex feedback loops at the organismal, population, and ecosystem level. “Chisholm
Falkowski et al., (2008) Science 320, 1034-1038
![Page 9: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/9.jpg)
Foundational Questions
• What is the taxonomic and functional structure of the ecosystem?
• How does this structure change in response to environmental perturbation?
• What are the ecological consequences of this change?
• What are relevant units of selection, conservation or utilization for ecological genomic resources?
9
![Page 10: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/10.jpg)
Overview
• Through the looking glass…
• Environmental Pathway/Genome Databases
• MetaPathways Pipeline Development
10
![Page 11: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/11.jpg)
Gene Products
Genes/ORFs
Genomic Map
11
Gene Products
Genes/ORFs
Genomic Map
Pathologic*
Compunds
Reactions
Genomic Map
Genes/ORFs
Gene Products
Pathways
PGDB
Organisms
Pathways
Reactions
Compounds
Inference of Metabolic Pathways
* Integrates genome and pathway data to identify putative metabolic networks
PGDB Navigator
![Page 12: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/12.jpg)
Pathway/Genome Navigator
*http://ecocyc.org/META/new-image?type=PATHWAY&object=GLYCOLYSIS
PGDB*
Homepage
Pathway Viewer
Evidence Glyph
Pathway Information Gene Information
MetaboliteEnzyme Found
Unique Enzyme
12
![Page 13: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/13.jpg)
Gene Products
Genes/ORFs
Genomic Map
13
Gene Products
Genes/ORFs
Genomic Map
Pathologic*
Compounds
Reactions
Genomic Map
Genes/ORFs
Gene Products
Pathways
ePGDB
Pathways
Reactions
Compounds
* Integrates genome and pathway data to identify putative distributed metabolic networks
???
Environmental PGDB
![Page 14: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/14.jpg)
14
ePGDB Navigation
![Page 15: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/15.jpg)
15
http://engcyc.org/
![Page 16: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/16.jpg)
Overview
• Through the looking glass…
• Environmental Pathway/Genome Databases
• MetaPathways Pipeline Development
16
![Page 17: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/17.jpg)
17
MetaPathways
• A modular pipeline for constructing Pathway/Genome Databases from environmental sequence information
• MetaPathways currently supports four “data products” including i) GenBank submission, ii) LCA, iii) MLTreeMap, and iv) ePGDBs with associated feature summary tables and GFF files
• MetaPathways externalizes compute-intensive processes onto a user defined cluster using Sun Grid Engine or the Amazon elastic cloud
![Page 18: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/18.jpg)
18
MetaPathways• ePGDBs facilitate pathway-centric
exploration of environmental sequence information using Pathway Tools and the MetaCyc web interface
• Provides inference-based approach to metabolic reconstruction based on explicit computational rules to predict presence or absence of distributed metabolic networks
• MetaPathways can be used with multi-molecular data sets (DNA, RNA or protein) sourced from cultured isolates, single-cells and natural or human engineered ecosystems
http://www.github.com/hallamlab/MetaPathways http://hallam.microbiology.ubc.ca/MetaPathways
![Page 19: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/19.jpg)
19
ePGDB Navigation
![Page 20: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/20.jpg)
20
ePGDB Validation
![Page 21: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/21.jpg)
21
EcoCyc Pathways
• The number of E. coli pathways identified using the MetaCyc blast database decreases with increasing blast score ratio (BSR) cut-off while the others stay relatively constant. From this an optimal BSR between 0.4-0.6 can be inferred.
![Page 22: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/22.jpg)
22
MetaSim Pathways
![Page 23: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/23.jpg)
23
Synthetic Ecology
• The pathway (S-adenosyl-L-methionine cycle II) was identified by Pathway Tools in the simulated metagenome based on the combined contribution of two genomes (a + b).
![Page 24: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/24.jpg)
24
Infering Trophic Interactions
• An ePGDB constructed for the Mealybug symbionts Tremblaya princeps and Moranella endobia predicted interpathway complementarity in essential amino acid biosynthetic pathways.
McCutcheon, J.P. and von Dohlen, C.D. “An interdependent metabolic patchwork in the nested symbiosis of mealybugs.” Current Biology, 2011, DOI: 10.1016/j.cub.2011.06.051
![Page 25: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/25.jpg)
25c1988-2012
Hawaii Ocean Time Series (HOT)
DeLong et al. Community Genomics Among Stratified Assemblages in the Ocean’s Interior. (2006) Science 311T. Danhorn, C. R. Young, E. F. Delong, Comparison of large-insert, small-insert and pyrosequencing libraries for metagenomic analysis, ISME J (2012), doi:10.1038/ismej.2012.35.
![Page 26: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/26.jpg)
26
Environmental Sequence Information
HOT Sample Depth (m) Description Information Sequencing
PlatformNumber of Sequences
Average Sequence
Length
Protein Coding
Sequences
Annotated Coding
SequencesMetaCyc Reactions
MetaCyc Pathways
25 upper euphotic DNA Roche 454 623559 257 405613 214149 4138 864
75 upper euphotic DNA Roche 454 673674 244 430689 222572 4052 854
110 chlorophyll max DNA Roche 454 473166 270 336035 165775 4133 860
500 mesopelagic DNA Roche 454 995747 276 714743 361193 4464 949
25 upper euphotic RNA Roche 454 561821 248 234404 85781 3433 723
75 upper euphotic RNA Roche 454 557718 239 203359 66855 3208 669
110 chlorophyll max RNA Roche 454 398436 228 135107 36912 2549 532
500 mesopelagic RNA Roche 454 479661 266 207465 71400 3034 641
• ePGDBs were generated for environmental sequence information (DNA and RNA) sourced from the HOT water column.
![Page 27: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/27.jpg)
27
Core Pathways
Top 50
![Page 28: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/28.jpg)
28
Cellular Overview
• Comparison of DNA (Blue) and RNA +DNA (Red) pathway predictions
![Page 29: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/29.jpg)
29
• Comparison of genetic potential and gene expression data in photic and dark ocean waters
Pathway Partitioning
![Page 30: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/30.jpg)
30
Diagnostic Pathways
![Page 31: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/31.jpg)
31
• For each depth interval, a small number of cryptic pathways were predicted in RNA that were not predicted in DNA data sets
• These pathways showed depth distributions consistent with niche-partitioning between sunlit and dark ocean waters
Cryptic Pathways
![Page 32: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/32.jpg)
32
Known Hazards
• Missing ATP citrate lyase indicates false positive for rTCA
![Page 33: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/33.jpg)
Things to Keep in Mind…
• Pathologic cannot predict pathways not present in MetaCyc
• Evidence for short pathways is hard to interpret
• False positives due to shared enzymes in multiple pathways or incorrect annotations create hazards
• Currently no taxonomic assignment or coverage information is mapped onto identified pathways
• Limited functional validation for pathways in metagenomes
33
![Page 34: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/34.jpg)
34
“One gene is many hypotheses”Anonymous
![Page 35: Analysis of environmental genomes using Pathway Tools Steven Hallam | University of British Columbia SRI International, 2013.](https://reader037.fdocuments.net/reader037/viewer/2022110206/56649d145503460f949e8cc4/html5/thumbnails/35.jpg)
Joint Genome Institute
Susannah TringeTijana Glavina del Rio
Institute for Ocean Sciences
Marie RobertRobin Brown
University of British Columbia
Pacific Northwest National Laboratory
Angela NorbeckLjiljana Pasa-TolicHeather Brewer
35
Sam KheirandishKishori Konwar
Keith MewisAntoine Page
Melanie ScofieldYoung Song
Nicole SukdeoJody Wright
Elena Zaikova
Maya BhatiaMonica Torres Beltran
Annie CoxEvan DurnoDiane FairlyEsther Geis
Alyse HawleyAria Hahn
Niels Hansen
SRI
Peter KarpTomer Altman