Informatics in the Manchester Centre for Integrative Systems Biology
-
Upload
carly-armstrong -
Category
Documents
-
view
21 -
download
0
description
Transcript of Informatics in the Manchester Centre for Integrative Systems Biology
![Page 1: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/1.jpg)
Informaticsin the
Manchester Centre for Integrative Systems Biology
Daniel Jameson, Neil SwainstonManchester Centre for Integrative Systems Biology
SysMO-DB Workshop – Connecting Models and Data, Berlin23 November 2009
![Page 2: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/2.jpg)
The MCISB
• Currently employs 9.5 multidisciplinary people– All share same office, lab
• Pioneer the development of new experimental and computational technologies in systems biology
• Develop an annotated, kinetic model of yeast metabolism
![Page 5: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/5.jpg)
Definition of the problem
• Experimentalists generate data• Modellers require data• How do we pass data from the experimentalist
to the modeller?
• Traditional method– Experimentalist analyses data, produces spreadsheet– Experimentalists e-mails spreadsheet to modeller– Modeller cuts-and-pastes data into modelling tool– Do the experimentalist and the modeller speak the
same language?
![Page 6: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/6.jpg)
Informatics challenges
• How do we map experimental data to models?– How do we know what data applies to what molecule
or reaction?– How do we identify molecules or reactions?
• (Same problem in merging models)
• Use names…?
![Page 7: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/7.jpg)
Computers don’t like names
…because they are non-unique / ambiguous / imprecise / etc.
![Page 8: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/8.jpg)
(3R,4R,5S,6S)-6-(hydroxymethyl)
oxane-2,3,4,5-tetrol
Biochemists like names a little too much…
GlucoseGlcAnhydrous dextrose
Cerelose 2001TraubenzuckerStaleydex 95M
![Page 9: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/9.jpg)
Solution
• Utilise unique, public identifiers for identifying molecules– Don’t re-invent your own…– Use ChEBI terms to uniquely identify metabolites– Use UniProt terms to uniquely identify enzyme
![Page 10: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/10.jpg)
![Page 11: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/11.jpg)
Solution
• Further advantage:• Using links into existing databases (ChEBI, UniProt)
provide additional information immediately• Chemical formulae, structures• Protein sequences, phosphorlyation sites, SNPs
• Use unique, public IDs
![Page 12: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/12.jpg)
But names are still important
• Names are for humans (human-ish)• Unique ids (e-mail addresses, bank account
numbers) are for computers (geek-ish)
• BOTH are needed
![Page 14: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/14.jpg)
Models
• Useful to have a standard to allow models to be shared / re-used• Use SBML• Very well developed / supported• Tool set increasing all the time
• Identifying metabolites / proteins in models?• Use MIRIAM standards• http://www.ebi.ac.uk/miriam/• Allows unique, public IDs to be embedded into SBML
as annotations (along with human-readable names)
![Page 15: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/15.jpg)
Models
• Genome-scale SBML model of yeast metabolism• Annotated model
– All >2000 molecules have unique database references– MIRIAM standards have been followed– Should be entirely unambiguous for third party users– Should be usable in third party tools– Should allow data to be imported “easily”
![Page 16: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/16.jpg)
![Page 17: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/17.jpg)
SBML annotation
<species id=”glc" name="D-Glucose">
<annotation>
<rdf:li rdf:resource="urn:miriam:obo.chebi:CHEBI:17634"/>
</annotation>
</species>
![Page 18: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/18.jpg)
Solution on the experimental side
• Ensure that unique identifiers are captured and associated with data at the time of the experiment– BUT… this is all a bit geek-ish for biologists
• So… generate intuitive tools to do this by stealth
![Page 20: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/20.jpg)
Project overview
Enzyme kineticsQuantitativemetabolomics
Quantitativeproteomics
SBML Model
Parameters(KM, Kcat)
Variables(metabolite, proteinconcentrations)
PRIDE XML MeMo SABIO-RK
Web serviceWeb serviceWeb service
MeMo-RK
Web service
![Page 21: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/21.jpg)
![Page 22: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/22.jpg)
![Page 23: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/23.jpg)
![Page 25: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/25.jpg)
But…
• …MCISB has to manage “only” three types of experiment• Proteomics, metabolomics, enzyme kinetics
• Informatics team share office with experimentalists and modellers
• We’ve been doing this for years…• Lots of time, lots of people, lots of resource• Infrastructure development is part of our remit
![Page 26: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/26.jpg)
And…
• …SYSMO projects are far more diverse
• Informatics team separated from experimentalists, who are separated from modellers
• Less informatics resource
• Heavyweight approach of MCISB (bespoke tools for each experiment) probably not applicable
![Page 27: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/27.jpg)
So…
• …lightweight approach may be more suitable
• Store only secondary data necessary for modelling• Not raw data
• Daniel…
![Page 28: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/28.jpg)
Einfach Klasse!
![Page 33: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/33.jpg)
Model construction
Input: list of ORFs
Output: SBML file
1. Get reaction info
3. Create species
2. Create compartments
4. Create reactions
Get
ann
otat
ions
![Page 34: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/34.jpg)
Model construction
![Page 35: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/35.jpg)
Model parameterisation
• Data requirements• SBML model• Starting concentrations for enzymes and source
metabolites• Key results database• Enzyme kinetics• SABIO-RK database web service
![Page 36: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/36.jpg)
SABIO-RK web service
![Page 37: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/37.jpg)
Model parameterisation
![Page 38: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/38.jpg)
Model calibration
• Data requirements• Parameterised SBML model• Experimental data• Metabolite concentrations from key results database• Calibration by COPASI web service
![Page 39: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/39.jpg)
COPASI web service
Design and Architecture of Web Services for Simulation of Biochemical Systems. Dada JO, Mendes P. Data Integration in the Life Sciences, Manchester, UK (2009).
![Page 40: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/40.jpg)
Model calibration
![Page 41: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/41.jpg)
Model simulation
• Using COPASI web service
![Page 42: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/42.jpg)
Conclusion
• Integrating experimental data with models is “easy” and can be automated– If we adopt some standards
• Data can be shared “easily” between groups– If we all adopt some standards
• Lightweight approach more achievable• Key Results Database
![Page 43: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/43.jpg)
Thanks…
![Page 44: Informatics in the Manchester Centre for Integrative Systems Biology](https://reader036.fdocuments.net/reader036/viewer/2022062517/56813507550346895d9c589d/html5/thumbnails/44.jpg)
Informaticsin the
Manchester Centre for Integrative Systems Biology
Daniel Jameson, Neil SwainstonManchester Centre for Integrative Systems Biology
SysMO-DB Workshop – Connecting Models and Data, Berlin23 November 2009