Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil...
-
Upload
kaylee-barton -
Category
Documents
-
view
217 -
download
1
Transcript of Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil...
![Page 1: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/1.jpg)
Ontological Infrastructure for a Semantic Newspaper
Roberto García1, Ferran Perdrix1,2, Rosa Gil1
1GRIHO – Human Computer Interaction Research Group Universitat de Lleida, Spain2SEGRE Media Group, Spain
![Page 2: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/2.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 3: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/3.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 4: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/4.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Introduction
Press and Media companies getting digital and Web Segre: newspaper, radio, television and web portal.
Multiple kinds of media text, photo, video,…
Heterogeneous sources agencies, journalists, partners, institutions,…
Heterogeneity: difficult to integrate and manage.
![Page 5: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/5.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Introduction
Related standards: International Press
NewsCodes, subjects reference system, taxonomy NITF, news documents structure NewsML, model news as multimedia packages
Multimedia MPEG-7, descriptive multimedia metadata TV-Anytime, multimedia taxonomies
Common aspect: non formal semantics, XML-based
![Page 6: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/6.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Introduction
Journalists
News Agencies
LegacyNews+Media
ReceiverNews+Photos
Custom XML
NITF, NewsCodes, NewsML,…
Archivist
User
![Page 7: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/7.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 8: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/8.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Proposal
Semantic Metadata and Ontology facilitate management and integration.
Related previous work: ELIN (Electronic Newspaper Initiative) NEPTUNO (Semantic Web Technologies for Digital Newspaper) NewMARS (Multimedia Advanced Redistribution Surveillance)
![Page 9: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/9.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Proposal
Journalists
News Agencies
Legacy
Receiver
SemanticRepository
Ontologies Framework
User
![Page 10: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/10.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 11: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/11.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework
NewsML, NITF, NewsCodes, MPEG-7, TVAnytime XML Semantic Web
“XML Semantics Reuse Methodology”. ReDeFer implementation XSD2OWL: schema to ontology. XML2RDF: XML instance data to RDF instances. CS2OWL: classification scheme to ontology
![Page 12: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/12.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework ReDeFer
XSD2OWLMappings:
owl:intersectionOfowl:unionOf
sequencechoice
owl:maxCardinalityowl:minCardinality
@maxOccurs@minOccurs
rdfs:subClassOfextension@base|restriction@base
owl:RestrictioncomplexType//element
owl:ClasscomplexType|group|attributeGroup
rdfs:rangeelement@type
rdfs:subPropertyOfelement@substitutionGroup
rdf:Propertyowl:DatatypePropertyowl:ObjectProperty
element|attribute
OWLXML Schema
![Page 13: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/13.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework NewsCodes Subjects Ontology
Subjects taxonomy NITF 3.3 Ontology
Structure concepts (paragraph, subheadline,…) Metadata properties (copyright, authorship, issue date,…)
NewsML 1.2 Ontology News multimedia structure (envelope, component, item,…)
MPEG-7 Ontology Complete ontology (2372 classes and 975 properties)
TVAnytime Ontologies Content and Format CSs
![Page 14: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/14.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: MPEG-7
Validation, compare to other MPEG-7 Ontologies: Hunter02: not complete, RDF+DAML. Tsinaraki04: not complete, semantic part of MDS. Troncy03: not complete, from an ontology to MPEG-7.
![Page 15: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/15.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: MPEG-7
Hunter02 MPEG-7 Ontology
![Page 16: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/16.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: MPEG-7
MPEG-7 Ontology
![Page 17: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/17.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: MPEG-7
Tsinaraki04
MPEG-7 Ontology
<complexType name="AudioType"><complexContent>
<extension base="mpeg7:MultimediaContentType">
<sequence><element name="Audio"
type="mpeg7:AudioSegmentType"/></sequence>
</extension></complexContent>
</complexType>
Class (AudioType partial
restriction(Audio cardinality(1))
MultimediaContentType)
Class (AudioType partial
restriction(Audio cardinality(1))restriction(Audio
allValuesFrom(AudioSegmentType)))MultimediaContentType)
![Page 18: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/18.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: Instances
ReDeFer XML2RDF: XML tree RDF graph.
Deduce blank node types from XSD2OWL ontologies restrictions.
Root
elem elemelem
elem elem
Empty Text
elemattr
Empty Text Text Text
Blank nodes
rdf:Properties
XML tree model RDF graph model
![Page 19: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/19.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Ontological Framework: Instances
XML2RDF example
![Page 20: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/20.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 21: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/21.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Signal ProcessingAudio
Video
MPEG-7 XML
Content-based metadata
XML2RDF
NewsML Ontology
RDF
RDFContext-based
metadata
MPEG-7 Ontology
XML
Integration
Retrieval
Higher-level metadata
DL Classifier
SWRL Engine
XSD2OWL
XMLSchemas: NewsML, NITF, MPEG-7...
RDFS / OWL: IPTC SRS...
Integration Framework
Load Ontological Framework
![Page 22: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/22.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Integration Framework
NITF packaged in NewsML container IPTC’s NITF-to-NewsML Metadata Mapping Stylesheet
<NewsML><NewsItem>
<NewsComponent><DescriptiveMetadata>
<SubjectCode><Subject FormalName="04000000"/>
</SubjectCode></DescriptiveMetadata><ContentItem>
<DataContent><nitf><body>…</body></nitf>
</DataContent></ContentItem>
</NewsComponent></NewsItem>
</NewsML>
![Page 23: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/23.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Integration Framework
NewsML multimedia itemscontext and content-based MPEG-7 metadata
XML2RDF: RDF for NewsML-NITF instances Bridge subjects to NewsCodes ontology RDF for MPEG-7 metadata
![Page 24: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/24.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Integration Framework
Signal ProcessingAudio
Video
MPEG-7 XML
Content-based metadata
XML2RDF
NewsML Ontology
RDF
RDFContext-based
metadata
MPEG-7 Ontology
XML
Integration
Retrieval
Higher-level metadata
DL Classifier
SWRL Engine
XSD2OWL
XMLSchemas: NewsML, NITF, MPEG-7...
RDFS / OWL: IPTC SRS...
![Page 25: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/25.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Signal ProcessingAudio
Video
MPEG-7 XML
Content-based metadata
XML2RDF
NewsML Ontology
RDF
RDFContext-based
metadata
MPEG-7 Ontology
XML
Integration
Retrieval
Higher-level metadata
DL Classifier
SWRL Engine
XSD2OWL
XMLSchemas: NewsML, NITF, MPEG-7...
RDFS / OWL: IPTC SRS...
Integration Framework
![Page 26: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/26.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 27: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/27.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Conclusions
Signal ProcessingAudio
Video
MPEG-7 XML
Content-based metadata
XML2RDF
NewsML Ontology
RDF
RDFContext-based
metadata
MPEG-7 Ontology
XML
Integration
Retrieval
Higher-level metadata
DL Classifier
SWRL Engine
XSD2OWL
XMLSchemas: NewsML, NITF, MPEG-7...
RDFS / OWL: IPTC SRS...
![Page 28: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/28.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Conclusions
Press and Media domain: heterogeneous and metadata intensive
Semantic Web and Ontology facilitate management and integration
Existing workNewsML, NITF, NewsCodes, MPEG-7, TVAnytime,…
![Page 29: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/29.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Conclusions
XSD2OWL: take profit from XML Schema hidden semantics We formalise them when building ontologies, but also
implicitly when we make XML Schemas. XML2RDF:
reuse existing XML metadata to add momentum to the Semantic Web
![Page 30: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/30.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Contents
Introduction Proposal Ontological framework Integration framework Conclusions Future Work
![Page 31: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/31.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Future Work
Generate ontology for legacy system XML Map legacy ontology to NewsML-NITF ontologies Integrate automatic and assisted MPEG-7 metadata
multimedia annotation Complete the integration framework
![Page 32: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/32.jpg)
Semantic Integration and Retrieval of Multimedia Metadata
Future Work
User Interface: Rhizomik Media MPEG-7, TVAnytime, DC, Copyright Ontology… Rhizomer-based semantic portal
Rhizomer
![Page 33: Ontological Infrastructure for a Semantic Newspaper Roberto García 1, Ferran Perdrix 1,2, Rosa Gil 1 1 GRIHO – Human Computer Interaction Research Group.](https://reader036.fdocuments.net/reader036/viewer/2022070305/5514969c550346b0158b6337/html5/thumbnails/33.jpg)
Thank you for your attention
More at:
http://rhizomik.net …/redefer …/semanticnewspaper …/ontologies/mpeg7ontos
Contact:
{fperdrix,rgil}@diei.udl.es