MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA...
-
Upload
julianna-flowers -
Category
Documents
-
view
219 -
download
0
Transcript of MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA...
![Page 2: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/2.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
2
Outline
IntroductionMLIF [ISO WD 24616]Ongoing activitiesActionsDate Extension RequestConclusion
![Page 3: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/3.jpg)
Introduction
![Page 4: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/4.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
4
Introduction
The “Multi Lingual Information Framework” MLIFISO AWI 24616 (TC37/SC4 WG3)Nasredine Semmar (CEA, France), WG3 ConvenorSamuel Cruz-Lara (LORIA / INRIA, France), MLIF Project Leader
![Page 5: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/5.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
5
New Work Item Proposal
ISO TC37/SC4 MeetingChina, Beijing August 2006
ISO AWI 24616
![Page 6: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/6.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
6
Scope
This standard aims at proposing a specification platform for a computer-oriented representation of multilingual data within a large variety of applications such as translation memories, localization, computer-aided translation, multimedia, or electronic document management.
![Page 7: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/7.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
7
Scope
As with the “Terminological Markup Framework”, used in terminology [ISO 16642], the MLIF will introduce a metamodel in combination with chosen data categories that will be integrated within the TC37 Data Category Registry [ISO/DIS 12620.2] in order to allow the description of any specific domain.
![Page 8: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/8.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
8
Scope
The standard will thus provide a way to validate any instance of this metamodel, as well as, interoperability principles with numerous translation and localization standards.
![Page 9: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/9.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
9
Purpose and Justification
The extremely fast evolution of the technological development in the sector of Communication and Information Technologies, and in particular, in the field of natural language processing, makes particularly acute the question of standardization.The issues related to this standardization are of an industrial, economic and cultural nature.
![Page 10: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/10.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
10
Purpose and Justification
The control of the interoperability between the industrial standards currently used for localization (XLIFF) , translation memory (TMX) , or any other Multi Lingual Markup Language (ML2), constitutes a major objective for a coherent and global management of these data.
![Page 11: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/11.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
11
Purpose and Justification
The MLIF could be associated to several multimedia standards such as MPEG-4 [ISO/IEC 14496], MPEG-7 [ISO/IEC 15938], and W3C SMIL, in order to handle multilingual data within several multimedia applications such as, interactive TV, video conferencing, subtitling, karaoke and accessibility.
![Page 12: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/12.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
12
Purpose and Justification
The MLIF may also be used in cultural heritage related activities such as, digital museums, e-learning and electronic document management.
![Page 13: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/13.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
13
Purpose and Justification
As with the “Terminological Markup Framework” (TMF), used in terminology, the MLIF will introduce a metamodel in combination with chosen data categories.
![Page 14: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/14.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
14
Purpose and Justification
These data categories will be derived as a subset of a Data Category Registry (DCR) [ISO/DIS 12620.2], in order to ensure interoperability between several multilingual applications and corpora.
![Page 15: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/15.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
15
Purpose and Justification
A Data Category Specification (DCS) will define, in combination with the metamodel, the various constraints that apply to a given domain-specific information structure or interchange format.A DCS and a metamodel represent the organization of an individual application and the organization of a specific domain.
![Page 16: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/16.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
16
Purpose and Justification
The MLIF should be considered as a unified conceptual representation of multilingual content.The MLIF is not meant to replace or to compete with any other existing standard such as TMX or XLIFF.
![Page 17: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/17.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
17
Purpose and Justification
Rather, the MLIF is being designed with the objective of providing a common conceptual model and a platform allowing interoperability among several translation and localization standards, and by extension, their committed tools.
![Page 18: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/18.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
18
Purpose and Justification
The asset of MLIF is the interoperability which allows experts to gather, under the same conceptual unit, various tools and representations related to multilingual data.In addition, MLIF will also make it possible to evaluate and to compare these multilingual resources and tools.
![Page 19: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/19.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
19
Purpose and Justification
The description of all different XML elements will be done by using RelaxNG [ISO/IEC 19757-2] with the help of ODD, which is the creation and documentation language for XML schemas proposed by the TEI (Text Encoding Initiative).This follows a recent decision taken by the World Wide Web Consortium.
![Page 20: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/20.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
20
Purpose and Justification
MLIF will also provide a variety of filters allowing transformations between several MLIF-compatible formats, as well as, several automatic validation tools.The adoption of this new work item proposal may allow to instantiate the business plan proposal of TC 37/SC4 in the perspective of creating a work specification on multilingual content (WG 3).
![Page 21: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/21.jpg)
MLIF
ISO WD 24616
![Page 22: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/22.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
22
Introduction
The scope of research and development in the localization and translation memory (TM) process development is very large, and numerous independent groups are working on these aspects, such as LISA, OASIS, W3C, ISO, etc.
![Page 23: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/23.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
23
Introduction
Under the guidance of the above-mentioned groups, many formats have been developed.Some of the major formats of specific interest for localization and TM are TMX (LISA/OSCAR) and XLIFF (OASIS).
![Page 24: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/24.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
24
Introduction
There are many identical requirements for all the formats irrespective of the differences in final output.For example, all the formats aim at being user-friendly, easy-to-learn, and at reusing existing databases or knowledge.
![Page 25: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/25.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
25
Introduction
All these formats work well in the specific field they are designed for, but they lack a synergy that would make them interoperable when using one type of information in a slightly different context, giving rise to the fear of competition between them.
![Page 26: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/26.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
26
Scope
As with “Terminological Markup Framework” TMF [ISO 16642] in terminology, MLIF will introduce a metamodel in combination with chosen data categories [ISO/DIS 12620.2], as a means of ensuring interoperability between several multilingual applications and corpora.
![Page 27: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/27.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
27
Scope
MLIF deals with multilingual corpora, multilingual fragments, and the translation relations between them.
![Page 28: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/28.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
28
Scope
In each domain where MLIF can be used, we may consider a specific granularity of segmentation and description, built on MAF [ISO CD 24611], Synaf [ISO CD 24615], TMF [ISO 16642] or LAF [ISO CD 24612] respectively, for morphological description, syntactical annotation, terminological description, and linguistic annotation.
![Page 29: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/29.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
29
Scope
MLIF will thus describe elementary linguistic segments (i.e. sentence, syntactical component, word, …).
![Page 30: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/30.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
30
Scope
Supporting the construction and the interoperability of localization and “Translation Memories” (TM) resources, MLIF deals with the description of a metamodel for multilingual content.
![Page 31: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/31.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
31
Scope
MLIF will not propose a closed list of description features.Rather, it will provide a list of Data Categories, which is much easier to update and extend.This list represents a point of reference for multilingual information in the context of various application scenarios.
![Page 32: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/32.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
32
Normative References
ISO 16642 Computer applications in terminology -- Terminological Markup Framework.ISO CD 24615 Syntactical Annotation Framework.ISO/DIS 12620.2 Computer applications in terminology -- Data categories.
![Page 33: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/33.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
33
Normative References
ISO/IEC 639-1, Information technology - ISO 639:1988, Code for the representation of names of languages.ISO 639-2 Code for the representation of names and languages-part 2:Alpha-3 code.ISO 8601 Data elements and interchange formats - Information interchange - Representation of dates and times.
![Page 34: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/34.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
34
Starting point of MLIF
The MLIF promotes the use of a common framework for the future development of several different formats: TMX, XLIFF, …The MLIF can be considered as a parent for all these formats, since all of them deal with multilingual data expressed in the form of segments or text units. They all can be stored, manipulated and translated in a similar manner.
![Page 35: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/35.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
35
MLIF Metamodel
![Page 36: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/36.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
36
MLIF Metamodel
Multi Lingual Data Collection (MLDC)Represents a collection of data containing global information and several multilingual units.
Global Information (GI)Represents technical and administrative information applying to the entire data collection. Example: title of the data collection, revision history, …
![Page 37: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/37.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
37
MLIF Metamodel
Multi Lingual Component (MULTI)This component represents a unique multilingual entry.
Mono Lingual Component (MONO)Part of a multilingual component containing information related to one language.
Segment Component (SEG)Textual content itself (may be “decorated” with several attributes as in SynAF [ISO CD 24615])
![Page 38: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/38.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
38
MLIF Metamodel
History ComponentThis generic component allows to trace modifications on the component it is anchored to (i.e. versioning).
![Page 39: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/39.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
39
MLIF Metamodel
In order to provide a larger description of the linguistic content, the MLIF metamodel allows anchoring of other metamodels, such as MAF (morphological description), SynAF (syntactical annotation), TMF (terminological description), LAF (linguistic annotation), or any other metamodel based on ISO 12620.2.
![Page 40: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/40.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
40
MLIF Metamodel
![Page 41: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/41.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
41
Data Categories for MLIF
All models have very similar hierarchical structure but they have different terms and methods of storing metadata relevant to them in particular.The MLIF provides a generic structure that can establish basic foundation for all other models.
![Page 42: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/42.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
42
Data Categories for MLIF
![Page 43: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/43.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
43
Data Categories for MLIF
Global Information (GI)/domain/
Specifies the domain on which the MLDC is dependent.
/project/Specifies a project within the domain on which the MLDC is dependent.
/source/ “A complete citation of the bibliographic information pertaining to a document or other resource. “[ISO12620] Reference to a resource from which the present resource is derived.
![Page 44: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/44.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
44
Data Categories for MLIF
Global Information (GI)/sourceType/
“In multilingual and translation-oriented language resource or terminology management, the kind of text used to document the selection of lexical or terminological equivalents, collocations, and the like. “[ISO12620]“Both parallel and background texts serve as sources for information used in documenting multilingual terminology entries. “[ISO12620]
![Page 45: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/45.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
45
Data Categories for MLIF
Global Information (GI)/sourceLanguage/
“In a translation-oriented language resource or terminology database, the language that is taken as the language in which the original text is written.” [ISO12620]
/note/This is an optional descriptor providing further information on any part of a content.
![Page 46: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/46.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
46
Data Categories for MLIF
Multilingual Component (MULTI)/identifier/
A unique name [source:IMDI_Source_Tag]Dublin Core equivalent: DC:Identifier [source: IMDI_Source_Tag]XML equivalent “xml:id” [source: http://www.w3.org/TR/xml-id]
/class/A hierarchical high level description of the component it is anchored to.
![Page 47: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/47.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
47
Data Categories for MLIF
Multilingual Component (MULTI)/subclass/
A hierarchical low level description of the component it is anchored to.
/note/This is an optional descriptor providing further information on any part of a content.
![Page 48: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/48.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
48
Data Categories for MLIF
Monolingual Component (MONO)/languageIdentifier/
A unique identifier in a language resource entry that indicates the name of a language. [source:ISO12620]XML equivalent “xml:lang”
/languageLevel/ Specifies the language level of the unique language identifier associated to the monolingual component (e.g. adults, children, scientist, slang, …)
![Page 49: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/49.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
49
Data Categories for MLIF
Monolingual Component (MONO)/identifier/
A unique name [source:IMDI_Source_Tag]Dublin Core equivalent: DC:Identifier [source: IMDI_Source_Tag]XML equivalent “xml:id” [source: http://www.w3.org/TR/xml-id]
![Page 50: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/50.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
50
Data Categories for MLIF
Monolingual Component (MONO)/class/
A hierarchical high level description of the component it is anchored to.
/subclass/A hierarchical low level description of the component it is anchored to.
![Page 51: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/51.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
51
Data Categories for MLIF
Monolingual Component (MONO)/xlink/: Is a data category refinement composed by several data categories. It is used to identify a name or a resource:
/uri/: can be represented by an xlink:href attribute./type/: can be represented by an xlink:type attribute./label/: can be represented by an xlink:label attribute./title/: can be represented by an xlink:title attribute./from/: can be represented by an xlink:from attribute./to/: can be represented by an xlink:to attribute.
![Page 52: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/52.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
52
Data categories for MLIF
Segment Component (SEG)/identifier/
A unique name [source:IMDI_Source_Tag]Dublin Core equivalent: DC:Identifier [source: IMDI_Source_Tag]XML equivalent “xml:id” [source: http://www.w3.org/TR/xml-id]
![Page 53: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/53.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
53
Data Categories for MLIF
Segment Component (SEG)/class/
A hierarchical high level description of the component it is anchored to.
/subclass/A hierarchical low level description of the component it is anchored to.
![Page 54: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/54.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
54
Data Categories for MLIF
Segment Component (SEG)/xlink/: Is a data category refinement composed by several data categories. It is used to identify a name or a resource:
/uri/: can be represented by an xlink:href attribute./type/: can be represented by an xlink:type attribute./label/: can be represented by an xlink:label attribute./title/: can be represented by an xlink:title attribute./from/: can be represented by an xlink:from attribute./to/: can be represented by an xlink:to attribute.
![Page 55: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/55.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
55
Data Categories for MLIF
The HistoryComponent is a generic component allowing to trace modifications on the component it is anchored to (e.g., creation, modification, validation). It can be anchored onto any component of the metamodel.In MLIF metamodel, the HistoryComponent may be anchored to the GlobalInformation or to the MonoLingualComponent.
![Page 56: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/56.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
56
Data Categories for MLIF
In the GlobalInformation component, the HistoryComponent keeps all information related to any modification on the context or on the domain;In the MonoLingualComponent, the HistoryComponent allows keeping all evolutions or any enhancement of the content.
![Page 57: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/57.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
57
Data Categories for MLIF
HistoryComponent/transaction/
One of the steps involved in the creation, approval, and use of a specific component (approval, check, exportation, importation, input, modification, origination, standardization, userAccess, withdrawal).
![Page 58: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/58.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
58
Data Categories for MLIF
HistoryComponent/date/
A date. The date is encoded according to a profile of [ISO8601] as described in [W3CDTF] and follows the YYYY-MM-DD format.
/author/The person responsible for the creation of the content.
/note/This is an optional descriptor providing further information on any part of a content.
![Page 59: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/59.jpg)
MLIF example 1
![Page 60: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/60.jpg)
MLIF example 2
![Page 61: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/61.jpg)
MLIF example 3
![Page 62: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/62.jpg)
Ongoing Activities
![Page 63: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/63.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
63
Ongoing activities
MLIF can be used in e-learning, interactive television programs and any other application having a user interface. It may be very helpful for future interactive television broadcasting.Within ITEA’s “Jules Verne” and “Passepartout” projects, we have identified several potential implementations of MLIF in association with interactive TV and multimedia.
![Page 64: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/64.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
64
Ongoing activities
The MLIF and multimedia association presents ample opportunity for giving value to different languages and cultures, as is the case in Europe and Asia.
![Page 65: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/65.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
65
Ongoing activities
Within the framework of ITEA “Passepartout” project, we have experimented with some basic scenarios by using:XMT (“eXtensible MPEG4 Textual format”) and,W3C SMIL (“Synchronized Multimedia Integration Language”).
![Page 66: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/66.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
66
Ongoing activities
We are currently collaborating with the W3C SYMM (SYnchronized MultiMedia) Working GroupSMIL (Synchronized Multimedia Integration Language)
SMILTextNew text container element
Multilinguality (yes)Linguistic granularity (no)
![Page 67: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/67.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
67
SMILText
This module defines new functionality for SMIL 3.0. It extends the media types available for SMIL, but does not alter any other existing functionality from SMIL 2.1 or earlier versions.
Editors: Dick Bulterman (CWI, The Netherlands), Sjoerd Mullender (CWI, The Netherlands), Samuel Cruz-Lara (LORIA / INRIA, France)TEXT IS BEAUTIFUL !
![Page 68: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/68.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
68
SMILText
The SMILText modules provide a text container element with an explicit content model for defining in-line text, and a set of additional elements and attributes to control explicit in-line text rendering.
![Page 69: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/69.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
69
SMILText
Since the SMILText elements and attributes are defined in a series of modules, designers of other markup languages can reuse these modules when they need to include a simple form of timed text functionality into their language.
![Page 70: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/70.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
71
Ongoing activities
We have recently joint four new projects where MLIF should be used:ITEA2 SEMbySEM (INRIA - S. Cruz-Lara)ITEA2 METAVERSE1 (INRIA - S. Cruz-Lara)FP7-ICT MEDAR (CEA - N. Semmar)ANR WEBCROSSLING (CEA - N. Semmar)
![Page 71: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/71.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
72
SEMbySEM
This project will provide an innovative and comprehensive Semantic Services Management System, based on a common open infrastructure, to allow management of tomorrow mixed systems (made of thousands elementary software and hardware components) with facilities to build ad-hoc dynamic visualisations of the managed systems for information, management and SLA verification purposes.
![Page 72: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/72.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
73
SEMbySEM
To do so, it will extensively use semantic description with the help of a « new standard » that it will provide.Keywords: Software Engineering, UML, OMG (SBVR, MOF, KDM, …Countries: Finland, France, and TurkeyINRIA’s contribution:
Multilingual Ontologies
![Page 73: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/73.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
74
METAVERSE1
The Metaverse1 (global standards among real and virtual worlds) project will provide a standardized global framework that enables the interoperability between virtual worlds (as for example Second Life, World of Warcraft, IMVU, Google Earth and many others) and the real world (sensors, actuators, vision and rendering, social and welfare systems, banking, insurance, travel, real estate and many others).
QuickTime™ et undécompresseur
sont requis pour visionner cette image.
![Page 74: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/74.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
75
METAVERSE1
The ‘Metaverse for all’ will be a special attention point aiming at the e-Inclusion of minorities in the society.Countries: Belgium, France, Germany, Greece, Israel, Luxembourg, The Netherlands, and SpainINRIA’s contribution:
Standardise the management and the representation of multilingual textual data
QuickTime™ et undécompresseur
sont requis pour visionner cette image.
![Page 75: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/75.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
76
FP7-ICT MEDAR
Supporting the development of tools and resources in Machine Translation and MultiLingual Information Retrieval on the basis of other partners technologies and open source codeCEA’s contribution:
Multilingual Information Retrieval
![Page 76: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/76.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
77
ANR WebCrossling
Developing a Machine Translation prototype based on MultiLingual Information Retrieval technologyCEA’s contribution:Multilingual Information Retrieval
![Page 77: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/77.jpg)
Actions
![Page 78: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/78.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
79
Actions
MLIF is an ISO “Working Draft”[ISO WD 24616].
A Committee of Experts for MLIF has been constitutedThe proposal we have just presented needs comments, remarks, … so it will be shortly sent to the Committee of Experts
![Page 79: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/79.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
80
MLIF Experts Committee
Member bodiesAnnelies Kesting - NEN (The Netherlands)Bettina Seitl - ON (Austria)Young-Shik Kang - KATS (Republic of Korea)Marketa Jindrakova - CNI (Czech Republic)Roberto Ravaglia - UNI (Italy)Surayuth Boonmatat - TISI (Thailand)Toni Hittema - AFNOR (France)
![Page 80: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/80.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
81
MLIF Experts Committee
ExpertsDewi Bryn Jones - Canolfan Bedwyr (UK)Elena Montiel - UPM (Spain)Elsa Sklavounou - SYSTRAN (France)Emmanuel Planas - Lingua et Machina (France)Felix Sasaki - W3CGerhard Budin - University of Vienna (Austria)Guadalupe Aguado de Cea - UPM (Spain)Harry Bunt - Tilburg University (The Netherlands)
![Page 81: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/81.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
82
MLIF Experts Committee
ExpertsKey-Sun Choi - KORTERM/KAIST (Korea)Kiyong Lee - Korea University (Korea)Laurent Romary - Max Planck Digital Library (Germany), INRIA (France)Mourad Amine - Université de Montréal (Canada)Nasredine Semmar CEA (France)Nicoletta Calzolari ILC-CNR (Italy)
![Page 82: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/82.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
83
MLIF Experts Committee
ExpertsSamuel Cruz-Lara - LORIA / INRIA (France)Stéphane Albucher - Business Objects (France)Wim Peters - University of Sheffield (UK)Yves Savourel ENLASO - (USA)Julien Ducret - SAFARI Consulting (France)
![Page 83: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/83.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
84
Actions
The basis of the discussion must be the new proposalMetamodel & Data Categories
Do we need to modify them?Do we need to take into account any new aspect?
How can we progress?Use cases!!!
![Page 84: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/84.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
85
Actions
Use CasesInteroperabilityPublishingLinguistic GranularitySegmentationRelated StandardsMultimediaMultilingual Ontologies…
![Page 85: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/85.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
86
Interoperability
TMX“the sentence contains different formatting information”
![Page 86: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/86.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
87
Interoperability
![Page 87: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/87.jpg)
Interoperability TMX & MLIF
![Page 88: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/88.jpg)
Interoperability TMX & MLIF
![Page 89: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/89.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
90
Publishing
![Page 90: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/90.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
91
Interoperability & Publishing
![Page 91: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/91.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
92
Linguistic Properties
Sentence, word, lemma, POS, …Time related issues?
![Page 92: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/92.jpg)
Linguistic Properties
Example by: Nasredine Semmar
![Page 93: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/93.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
94
Segmentation
How segmentation issues will be taken into account by MLIF?
![Page 94: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/94.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
95
Related Standards
TEI (Text Encoding Initiative)The description of all different XML elements will be done by using RelaxNG [ISO 19757-2] with the help of ODD, which is the creation and documentation language for XML schemas proposed by the TEI.
![Page 95: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/95.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
96
Related Standards
W3C ITS (International Tag Set)ITS is a set of rules, expressed in elements, that provide information on how parts of a given DTD or XML Schema are related to specific internationalization & localization properties.Should ITS may be used inside MLIF (as ITS may be used in SMILText)?
![Page 96: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/96.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
97
Multimedia
W3C SMILSMILText
Multilinguality (yes)Linguistic Granularity (no)
![Page 97: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/97.jpg)
Multimedia
![Page 98: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/98.jpg)
Multimedia
![Page 99: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/99.jpg)
MultimediaTimed, Multilingual Textual DescriptionsTimed, Multilingual Textual Descriptions
W3C SMIL Standardization- Development of Interactive TV Profile- Integration of Annotation Support- Definition of Temporal Text Processing
ISO MLIF Standardization- Development of MLIF format- Development of a multilingual processing pipeline- Interaction with SMIL and MPEG standards
multilingual componentmultilingual component
multilingual DBmultilingual DB
linguistic segmentlinguistic segment
l’histoire du courage d’une femme pour l’histoire du courage d’une femme pour démasquer un mystèredémasquer un mystère
Monolingual componentMonolingual component
linguistic segmentlinguistic segment
la historia da la valentía de una mujer para la historia da la valentía de una mujer para desenmascarar undesenmascarar un misteriomisterio
Monolingual componentMonolingual component
![Page 100: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/100.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
101
Multilingual Ontologies
In what way can MLIF be related to Multilingual Ontologies?ITEA2 SEMbySEM
![Page 101: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/101.jpg)
Date Extension Request
![Page 102: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/102.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
103
MLIF [ISO CD 24616]
Current state:WarningUrgent actions to do?
![Page 103: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/103.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
104
Date Extension Request
Current Proposed
DIS 2008-10-13
2009-10-13
FDIS 2010-04-13
2011-04-13
IS 2010-10-13
2011-10-13
![Page 104: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/104.jpg)
Conclusion
![Page 105: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/105.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
106
Conclusion
We suggest to develop several use cases in order to test and to validate the new metamodel and related data categoriesEach use case should be leaded by one or several members of the MLIF Experts Committee
![Page 106: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/106.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
107
Conclusion
A new working draft of MLIF, taken into account all proposed use cases, should be submitted to the Committee of Experts soon
![Page 107: MLIF: The Multi Lingual Information Framework ISO WD 24616 Samuel CRUZ-LARA Samuel.Cruz-Lara@loria.fr LORIA / INRIA, France.](https://reader036.fdocuments.net/reader036/viewer/2022062518/56649e915503460f94b95c39/html5/thumbnails/107.jpg)
Pisa, 26/09/2008 ISO TC37/SC4 MLIF (S. Cruz-Lara) (c) ISO 2008
All rights reserved
108
Thank you!
Thank you for your attentionAny question?Mailing [email protected]
Web sitehttp://mlif.loria.fr