Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
-
Upload
anne-thessen -
Category
Technology
-
view
119 -
download
1
description
Transcript of Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Performance Metrics
• 1631 URIs assigned to 487 text objects from 21 test species• 83% were correct• 20% of the text objects were not assigned a URI• 239 keys in the dictionary• Precision 0.89, Recall 1, F1 Score 0.942
Challenges and Errors
• Many ways to say the same thing– Uterine cannibalism = oophagy
• Negation (9%)• Describing related taxa (30%)• Word/phrase part (27%)• Generalities (15%)• Homonym (13%)
Finding Taxonomic Names
Challenges
Koko
Горилла
Guerilla
Eastern Lowland Gorilla
Gorilla graueri
Gorilla berengeiGorilla beringei
MatschieGorilla beringei mikenensisKing kong
Gorilla gorilla
Virunga
Gorila
Gorille
Mountain gorilla
大猩猩
ゴリラ
Challenges
Aotus trivirgatus Aotus Illiger 1811
Aotus Aotus Smith 1805 Aotus ericoides
.
Contextual data
PrimateMonkeyEyesFoodPanamaAotus nancymaae
Disambiguate by authority, species, contextual data
Contextual data
LegumePlant
FlowerMirbelieaAustralia
Aotus mollis