Dante al tempo del web semantico
-
Upload
laboratorio-di-cultura-digitale-universita-di-pisa -
Category
Education
-
view
194 -
download
0
Transcript of Dante al tempo del web semantico
Valentina Bartalesi Lenzi ISTI-CNR
Pisa, 3 dicembre 2014 1
• “Per una Enciclopedia Dantesca Digitale” is an Italian Na0onal project (PRIN, 2013-2016) that aims at building a digital library endowed with services suppor0ng scholars in crea0ng, evolving and consul0ng a digital encyclopaedia of Dante Alighieri’s works
• In collabora0on with the Dipar1mento di Filologia, Le8eratura e Linguis1ca of the University of Pisa Data set of Dante works along with their commentaries
3 dicembre 2014 Valentina BARTALESI LENZI 2
Current focus: cita1ons to primary sources
• Sta0s0cs • Evolu0on of Dante’s knowledge of primary sources • Impact of primary sources on Dante’s thought
3 dicembre 2014 Valentina BARTALESI LENZI 3
1. Collec0ng and organizing literary texts and knowledge about them, for the purposes of: • dissemina1ng • producing new knowledge
3. Developing an ontology for represen0ng the knowledge of texts, used for implemen0ng the following func0ons: • storing • accessing • processing
The mean: Seman1c Web technologies
3 dicembre 2014 Valentina BARTALESI LENZI 4
"The Seman)c Web provides a common framework that allows data to be shared and reused across applica)on, enterprise, and community boundaries" (Tim-‐Berners Lee, 2001)
Standard URI (Uniform Resource Iden0fiers)
E.g. Eneide: h>p://dbpedia.org/page/Aeneid
3 dicembre 2014 Valentina BARTALESI LENZI 5
• RDF (Resource Descrip1on Framework), standard language defined by W3C for the knowledge representa0on
RDF triple: asser0on in the form of – subject: resource – predicate: property (of the resource) – object: value (of property)
• Modello RDF: set of RDF triples
an RDF model is a graph!
3 dicembre 2014 Valentina BARTALESI LENZI 6
http://dbpedia.org/resource/Eneide
http://dbpedia.org/resource/Virgilio
dc:subject
http://purl.org/bncf/tid/21865/Epica_latina
dc:creator
The Ontology Web Language is an extension of RDF that allows defining a major number of axioms and rules
Axioms -‐ hasBrother(Federico, Paolo) -‐ hasFather(Valen0na, Livio)
Rules -‐ Symmetric Property: E.g., hasBrother
“Federico hasBrother Paolo” “Paolo hasBrother Federico” -‐ Func0onal Property:
E.g. hasFather “ Valen0na hasFather Livio”
AIM: to extract new knowledge from data 3 dicembre 2014 Valentina BARTALESI LENZI 7
• Linked Open Data describes a method of publishing structured data so that it can be interlinked and become more useful
• Standard Web technologies, such as HTTP, RDF and URIs, allow to share informa0on in a way that can be read automa1cally by computers
• Data from different sources are connected and queried
3 dicembre 2014 Valentina BARTALESI LENZI 8
An ontology is a formal framework for representing knowledge. This framework names and defines the classes and relationships in a domain of discourse using axioms
3 dicembre 2014 Valentina BARTALESI LENZI 9
John
Bill
Mary
knows
<foaf:Person> <foaf:name>John</foaf:name> <foaf:knows> <foaf:Person> <foaf:name>Susan</foaf:name> </foaf:Person> </foaf:knows> </foaf:Person>
Data: some parts of Convivio in Excel format
3 dicembre 2014 Valentina BARTALESI LENZI 10
…other Dante’s works and commentaries are in WORD!
Informa0on added to the Excel knowledge: • En1re text of the note • Kind of reference:
Explicit reference «Sì come dice lo Filosofo nel principio della Prima Filosofia» Strict reference SI MANUCA: il pane degli angeli, nella tradizione veterotestamentaria è la manna (cfr. Ps. 77, 25 «Panem angelorum manducavit homo») Generic reference «Dante affronta il tema degli ostacoli che allontanano di fa>o gli uomini dal filosofare seguendo uno schema presente nei commen) agli scriX aristotelici»
3 dicembre 2014 Valentina BARTALESI LENZI 11
Tool (in Java) to transform the data set into an annotated corpus, using a text processing approach
1. number of the book of the text fragment to which the note refers to 2. number of the chapter
3. number of the paragraph
4. the text fragment to which the note applies (e.g., “Sì come dice lo filosofo nel principio della Prima Filosofia”)
5. the en0re text of the note 3 dicembre 2014 Valentina BARTALESI LENZI 12
1. author (e.g. Aristotle) 2. title (e.g. Metaphysics) 3. thematic area (e.g. Aristotelianism) 4. Fragment of the text of the note citing a primary source (e.g. «Queste sono
le parole con cui si apre la Metafisica di Aristotele») 5. Kind of reference (explicit, strict or generic reference)
3 dicembre 2014 Valentina BARTALESI LENZI 13
Soggegario Nazionale conforms to the standard of Federa0on of Library Associa0ons and Ins0tu0ons (IFLA)
E.g. Scolas0ca, Aristotelismo, Bibbia, Astronomia, Epica la0na, Medicina, Matema0ca, Teologia
3 dicembre 2014 Valentina BARTALESI LENZI 14
h8p://thes.bncf.firenze.sbn.it/
RDF format
• We extended the tool with a Java program (Jena library) to transform the CSV file into RDF/XML file
3 dicembre 2014 Valentina BARTALESI LENZI 15
• We adopted the same approach to tranform from Word to RDF format the content of the Dante’s works (text of the Convivio, Monarchia etc.)
3 dicembre 2014 Valentina BARTALESI LENZI 16
…ontology expressed in the Seman1c Web languages: • RDF, OWL
… following the Linked Data recommenda1ons: • HTTP URIs for deno0ng resources • a descrip0on for every resource • links to (and hopefully from) other datasets
… re-‐using popular ontologies: • CIDOC-‐CRM, FRBR, FRBRoo, Dublin Core, SKOS, FOAF, DoCO, FaBiO,
CiTO, The Open Annota0on Core Data Model 3 dicembre 2014 Valentina BARTALESI LENZI 17
• To maximize interoperability we have re-‐used exis1ng ontologies WHERE POSSIBLE, adding our own classes and rela1onships IF NECESSARY
• CIDOC-‐CRM, FRBR, FRBRoo, SAWS, DM2E Model, Dublin Core, SKOS, FOAF, DoCO, FaBIO, CiTO, Annota0on Ontology, Open Annota1on Core Data Model, NEPOMUK
• Model popula1on
• Knowledge base in RDF stored in the triple store Virtuoso
3 dicembre 2014 Valentina BARTALESI LENZI 18
3 dicembre 2014 Valentina BARTALESI LENZI 19
3 dicembre 2014 Valentina BARTALESI LENZI 20
• Currently, we are transla0ng our ontology from RDF to OWL in order to add axioms that allow to infer new knowledge
3 dicembre 2014 Valentina BARTALESI LENZI 21
3 dicembre 2014 Valentina BARTALESI LENZI 22
3 dicembre 2014 Valentina BARTALESI LENZI 23
• Researchers can add classes and rela1onships to our ontology, thereby refining it
• Our ontology can be linked to other ontologies to extend the represented domain
• Any user can download and use our model freely, using the paradigm of Linked Data
3 dicembre 2014 Valentina BARTALESI LENZI 24
• We developed a web applica1on (Java, JavaScript, Ajax) to extract and display the knowledge stored in our seman0c model
• We used the SPARQL query language to extract knowledge
• The extracted knowledge is represented as charts (Highcharts library) and tables, as well as in CSV format
3 dicembre 2014 Valentina BARTALESI LENZI 25
• At the moment, the applica0on allows the user to see: The cited primary sources The cited authors The cited thema1c areas
in Dante’s works
• The data refer to the following Dante’s works: Convivio Monarchia De vulgari eloquen1a Vita Nova
3 dicembre 2014 Valentina BARTALESI LENZI 26
Commentaries
• Dante Alighieri, De Vulgari Eloquen1a, a cura di Mirko Tavoni, in Giunta C., Gorni G., Tavoni M., Opere. Vol. I: Rime, Vita Nova, De vulgari eloquen0a, Milano, Mondadori, 2011
• Dante Alighieri, Vita Nova, a cura di Guglielmo Gorni, in Giunta C., Gorni G., Tavoni M., Opere. Vol. I: Rime, Vita Nova, De vulgari eloquen0a, Milano, Mondadori, 2011
• Dante Alighieri, Monarchia, a cura di Diego Quaglioni, in AA.VV. (a cura di), Opere. Vol. II: Convivio, Monarchia, Epistole, Egloghe, Milano, Mondadori, 2014
• Dante Alighieri, Convivio, a cura di Giancarlo Fioravan1, in AA.VV. (a cura di), Opere. Vol. II: Convivio, Monarchia, Epistole, Egloghe, Milano, Mondadori, 2014
3 dicembre 2014 Valentina BARTALESI LENZI 27
• Our web applica0on is compliant with the W3C guidelines for usability and accessibility
• Having this informa0on available in digital format improves and makes the research of primary sources by the scholars more efficient
3 dicembre 2014 Valentina BARTALESI LENZI 28
• Current running application: http://dante1.isti.cnr.it:8080/perunaenciclopediadantescadigitale
3 dicembre 2014 Valentina BARTALESI LENZI 29
An event refers to an ac1on taking place at a certain 1me at a specific loca1on
3 dicembre 2014 Valentina BARTALESI LENZI 30
Current informa0ve services offered to the user present limita1ons
http://www.europeana.eu/portal/
…introducing of a new search func1onality for DLs that does not return just a list of objects but returns a narra1ve
3 dicembre 2014 Valentina BARTALESI LENZI 31
• Aristotle • Russian formalists • Computa1onal narratology studies narra0ves from a computa0onal perspec0ve and develops narratological models Ar0ficial Intellingence field development of storytelling systems using narra0ve as interac0ve method
3 dicembre 2014 Valentina BARTALESI LENZI 32
• In our computational narratology perspective, a narrative is a semantic representation telling a story
3 dicembre 2014 Valentina BARTALESI LENZI 33
3 dicembre 2014 Valentina BARTALESI LENZI 34
Our proposed search func0onality for DLs will use Seman0c Web technologies (RDF, OWL etc.) to:
1. Create narra0ves which link objects by means of events that use a computa0onal narra0ve approach
2. Make digital curators and web users able to create their own narra1ves for the DL objects they would like to promote
3 dicembre 2014 Valentina BARTALESI LENZI 35
We analyzed various ontologies for represen0ng events on the Seman0c Web. E.g. Event Ontology, CIDOC-‐CRM, Europeana Data Model, Linking Open Descrip0ons of Events, F-‐Model, ABC ontology
The events that the ontologies describe include both historical events (births, deaths, wars…) and events concerning the objects collected in (deposit, changes of ownership, restora0on)
3 dicembre 2014 Valentina BARTALESI LENZI 36
We are working to create an ontology that represents the knowledge about the training of the cultural background of an author and how this cultural background influences their works
1. We are searching for a standard methodology, if exists, used by scholars to describe and outline the cultural background of an author
2. We would like formalizing this methodology into an ontology using the Seman0c Web languages
3 dicembre 2014 Valentina BARTALESI LENZI 37
• Improve the usability of the DLs more informa0ve results using narra0ves
• Digital curators or web users can create their own narra1ves
• Enrichment of the DLs by digital curators or web users who can create narra0ves adding their own objects to the DLs
3 dicembre 2014 Valentina BARTALESI LENZI 38
• Developing an ontology (RDF, OWL) for represen1ng the narra1ves
• Extending the ontology to represent literary texts knowledge with narra1ves
• Enriching the current web applica0on with a SW allowing users to create narra1ves
• Evalua1ng the usability and accessibility of our web applica0on
Interna1onal Conferences with Peer Review
• V. Bartalesi, C. Meghini, M. Tavoni, P. Andriani, E. Locuratolo, L. Versien0: "Towards a Seman1c Network of Dante’s Works and their Contextual Knowledge", in Proceeding of Digital Humani0es 2014, July 2014
• C. Meghini, V. Bartalesi: "Steps towards Enhancing the User Experience in Accessing Digital Libraries", in Proceedings of HCI Interna0onal 2014, Springer, LNCS, vol. 8522, pp. 555-‐566, June 2014
• V. Bartalesi, C. Meghini, E. Locuratolo, L. Versien0: “A Preliminary Study on the Seman1c Representa1on of the Notes to Dante Alighieri’s Convivio”, in Proceedings of the 1st Interna0onal Workshop on Collabora0ve Annota0ons in Shared Environment: metadata, vocabularies and techniques in the Digital Humani0es (DH-‐CASE '13). ACM, New York, NY, USA, , Ar0cle 4 , 7 pages, September 2013
3 dicembre 2014 Valentina BARTALESI LENZI 39
3 dicembre 2014 Valentina BARTALESI LENZI 40
• Collabora0on in extending the tool that transforms the CSV files into RDF/XML files in order to be compliant with the ontology of narra0ves
• Collabora0on to the development of the SW to create narra1ves
• Collabora0on in developing of a new usable and accessible Web interface
• Evalua0on of usability and accessibility of the Web Interface (user tests, survey…)