BIOMEDICAL INFORMATICS: Computer Applications in Health Care
Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones &...
-
Upload
miriam-douthat -
Category
Documents
-
view
224 -
download
1
Transcript of Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones &...
![Page 1: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/1.jpg)
Cardiff School of Computer Science & Informatics
Biodiversity Informatics at COMSCBiodiversity Informatics at
COMSC
Andrew Jones & Richard White
School of Computer Science & Informatics
[email protected]@cs.cf.ac.uk
![Page 2: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/2.jpg)
2Cardiff School of Computer Science & Informatics
Richard White’s interests
• Design and construction of database systems to deliver biodiversity data
• Methods for making these systems – interoperable with other systems– adaptable for multiple uses– capable of following concept changes
• deducing and maintaining information on changes
• (Extracting numerical information from images, e.g. in “Morphidas” project, not described here)
![Page 3: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/3.jpg)
3Cardiff School of Computer Science & Informatics
Premise
• Bioinformaticians want to use information about the species whose genetic material is being studied to understand their development
• Biodiversity scientists (including taxonomists, ecologists, etc.) want to use molecular data to enhance their classifications, phylogenies and models
![Page 4: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/4.jpg)
4Cardiff School of Computer Science & Informatics
Biodiversity informatics
Therefore
• Bioinformatic and biodiversity data need to be linked together in many analyses
• Links often involve the species name as the key linking element
![Page 5: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/5.jpg)
5Cardiff School of Computer Science & Informatics
Species naming in a nutshell (Corylus avellana L.)
• Common (vernacular) names• Latin descriptive phrases• Linnaeus: binomial nomenclature• Adanson: rules for precedence etc. • Accepted names and synonyms• Checklists (e.g. the Catalogue of Life …)• Data (in different formats, e.g. Buffie …) is
usually linked to species names• Taxon concepts (including species and higher
taxa such as genera, families, etc.)• Tracking changes in taxon concepts …
![Page 6: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/6.jpg)
6Cardiff School of Computer Science & Informatics
Species 2000 & ITIS
• International programme to assemble data from “Global Species Databases” (GSDs) and deliver the Catalogue of Life (CoL)
• Authoritative up-to-date checklist of all the world’s species (1.3 out of 1.8m)
• Reference list of taxon concepts (with unique identifiers) to aid indexing and cross-referencing of species data sources
• Available on DVD, through the Web (www.sp2000.org) and by using electronic (“web”) services
![Page 7: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/7.jpg)
7Cardiff School of Computer Science & Informatics
The Catalogue of Life
![Page 8: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/8.jpg)
8Cardiff School of Computer Science & Informatics
4D4Life project
• “Distributed Dynamic Diversity Databases for Life”, EU project 2009 – 2012
• Carry the Catalogue of Life forward with improved sustainable infrastructure
• In COMSC we are designing a new architecture and will deliver a working prototype
• Service-oriented, re-usable components
![Page 9: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/9.jpg)
9Cardiff School of Computer Science & Informatics
Re-usable components
1. GSD editors create a data resource “GSD1”
2. CoL partners create the Catalogue of Life from such resources
3. A user creates a new product using the Catalogue of Life
1 2 3
![Page 10: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/10.jpg)
10Cardiff School of Computer Science & Informatics
Interoperability
• Catalogue of Life– GSDs are heterogeneous in
• Content
• Access methods
• More generally– Multiple data representations & exchange
formats– Changing concepts of taxa (and geography)
![Page 11: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/11.jpg)
11Cardiff School of Computer Science & Informatics
ENBI project and BUFFIE
• “European Network for Biodiversity Information”, EU project 2003-2006
• Mostly reporting on standards, practices and recommendations
• In COMSC, R. Sundaravadivelu developed a prototype interoperability demonstrator (BUFFIE, “Biodiversity Users Framework For Information Exchange”)
• Accepts data sources using different protocols and XML formats
• Provides a merged response in an XML format and protocol of the user’s choice
![Page 12: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/12.jpg)
12Cardiff School of Computer Science & Informatics
THIS SLIDE INTENTIONALLY LEFT NOT QUITE BLANK
![Page 13: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/13.jpg)
13Cardiff School of Computer Science & Informatics
A world of resources
• Imagine a digital world full of biodiversity data and analytical resources like these, just as there is in bioinformatics
• How will users be able to find out what resources there are and how to use them in combination to answer scientific questions?
![Page 14: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/14.jpg)
14Cardiff School of Computer Science & Informatics
The cross-mapping problem
Taxonomy 1
Vicia faba
Caesalpinia crista L.
Taxonomy 2
Faba faba
Caesalpinia crista L.
Caesalpinia bonduc (L.) Roxb.
Caesalpinia crista L., p.p.
![Page 15: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/15.jpg)
15Cardiff School of Computer Science & Informatics
i4Life
4D4Life
![Page 16: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/16.jpg)
16Cardiff School of Computer Science & Informatics
Constraints and checklists
• (From Litchi 1)
• “A full name which is not a pro-parte name may not appear as both an accepted name and a synonym in the same checklist”
![Page 17: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/17.jpg)
17Cardiff School of Computer Science & Informatics
Persistent identifiers and change
In i4Life we need to
• Use persistent identifiers for taxon concepts– (started in TDWG-TIP project)
• Link taxonomies and track change– create and maintain “cross-maps”
![Page 18: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/18.jpg)
18Cardiff School of Computer Science & Informatics
Joining things up: workflow systems
![Page 19: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/19.jpg)
19Cardiff School of Computer Science & Informatics
![Page 20: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/20.jpg)
20Cardiff School of Computer Science & Informatics
Workflow problems addressed
• Incorporation of biodiversity services in workflows (BiodiversityWorld)
• Authentication in a workflow environment (ASMIMA)
• Rich annotation of services; discovery (Ewen Orme’s PhD)
• Knowledge-based assistance for workflow creators (Russell McIver’s PhD)
• Improving the User Experience (ACJ’s main contribution to BioVeL proposal)
![Page 21: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/21.jpg)
21Cardiff School of Computer Science & Informatics
Andrew Jones’ interests
• Naming & concepts– Accurately identifying concepts– Tracking change
• Making scientific workflow systems usable by non-computer scientists– Hiding “programming” complexity– Helping to find resources & build workflows
• Environments to support collaborative scientific research– E.g. “doing” taxonomy
![Page 22: Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.](https://reader036.fdocuments.net/reader036/viewer/2022062307/551ae7ae55034606048b5c71/html5/thumbnails/22.jpg)
22Cardiff School of Computer Science & Informatics
Future projects
• We research solutions for data-handling problems faced by biologists and bioinformaticians
• If you think you might have an interesting and challenging problem, please get in touch