MPEG-7 Multimedia Content Description Standard

20
MPEG-7 Multimedia Content Description Standard January 8, 2003 John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19 Skyline Drive Hawthorne, NY 10532 USA [email protected]

description

MPEG-7 Multimedia Content Description Standard. January 8, 2003. John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19 Skyline Drive Hawthorne, NY 10532 USA [email protected]. Digital Media Value Chain (MPEG-7 Metadata Management). Content value chain: - PowerPoint PPT Presentation

Transcript of MPEG-7 Multimedia Content Description Standard

Page 1: MPEG-7 Multimedia Content Description Standard

MPEG-7 Multimedia Content Description Standard

January 8, 2003

John R. SmithPervasive Media Management Group

IBM T. J. Watson Research Center19 Skyline Drive

Hawthorne, NY 10532 [email protected]

Page 2: MPEG-7 Multimedia Content Description Standard

Digital Media Value Chain (MPEG-7 Metadata Management)

• Content value chain:– Life-cycle:

• Increase content asset value through repurposing

– Metadata layer:• First-class role of

metadata (smart bits) throughout digital media lifecycle

• Central role of metadata management (XML schemas, catalog records, ontologies)

• MPEG-7 for content description

• MPEG-21 for packaging, rights mgmt, transactions

– Digital media metadata access functions:

• Create: search, annotate, extract

• Manage: index, annotate, collate

• Transact/Distribute: search, package, adapt

Sell

Deliver

Distribute

Store

Maintain

Produce

OrganizeAcquire

Create

Plan MetadataMgmt.

Search

Collate

Package

Extract

Annotate

Adapt

Annotate

Search

Author

Tran

sact

Create

Man

age

Index

Page 3: MPEG-7 Multimedia Content Description Standard

User

N etwor k

M PE G- 7Pr oc essing

M PE G- 7S ear ch

Per vasiveU sage

E nvir onment

S ound s like ...L ooks like ...

D igit al M ed ia Respositor y

SemanticsQuery

MPEG-7S emanticsMPEG-7

DescriptorsMPEG-7Model

Model SimilarityQuery

Descript ions Descriptions

S earch

MPEG-7 Sear ch Engine(X ML Met adata)

MPEG-7SCHEMA

MPEG-7MetadataStorage

I BM Content Manager(Library Server &Object S erver)

MPEG-7 Multimedia Indexing and Searching

• MPEG-7 Indexing & Searching:– Semantics-based (people, places,

events, objects, scenes)– Content-based (color, texture, motion,

melody, timbre)– Metadata (title, author, dates)

• MPEG-7 Access & Delivery:– Media personalization– Adaptation & summarization– Usage environment (user

preferences, devices, context)

Page 4: MPEG-7 Multimedia Content Description Standard

MPEG-7 Overview (XML for Multimedia Content Description)

• MPEG-7 Normative elements:– Descriptors and Description Schemes– DDL for defining Description Schemes– Extensible for application domains

• Rich, highly granular multimedia content description:– Video segments, moving regions, shots, frames,

…– Audio-visual features: color, texture, shape, …– Semantics: people, events, objects, scenes, …

AcquisitionAuthoringEditing

BrowsingNavigation

FilteringManagement

TransmissionRetrievalStreaming

CodingCompression

SearchingIndexing

MPEG-1,-2,-4

MPEG-7

Reference Region Reference Region

Motion Motion

Reference Region

Motion

Segment Tree

Shot1 Shot2 Shot3

Segment 1

Sub-segment 1

Sub-segment 2

Sub-segment 3

Sub-segment 4

segment 2

Segment 3

Segment 4

Segment 5

Segment 6

Segment 7

Event Tree

• Introduction

• Summary

• Program logo

• Studio

• Overview

• News Presenter

• News Items

• International

• Clinton Case

• Pope in Cuba

• National

• Twins

• Sports

• Closing

TimeAxis

DDL

DS DS

D

D D D

D

DS

Applicationdomain

MPEG-7Standard

i.e., Medical imagingRemote-sensing imagesSurveillance videoComputer animations and graphics

Page 5: MPEG-7 Multimedia Content Description Standard

Harmonization of Multimedia Metadata Standards

MPEG-7, SMEF, P/Meta, SMPTE, Dublin Core, TV-Anytime, Indecs

• MPEG-7: Moving Picture Experts Group– Infrastructure standard for Multimedia Metadata– Supports interpretation of the information’s

meaning– Supports broad range of applications

• SMEF – Standard Media Exchange Framework:

– BBC developed data models for information involved in the Production, Development, Use, and Management of media assets

• P/Meta – EBU P/Meta Project:– Exchange of program content between high-level

business functions of EBU members: Production, Delivery/Broadcast, & Archive

• SMPTE – Metadata dictionary & MXF:– Addresses Program Interchange independent of

format

• Dublin Core Metadata Initiative:– Interoperable online metadata standards

supporting broad range of purposes and business models.

• TV-Anytime – TV-Anytime Metadata:– Attractors/descriptors used e.g. in Electronic

Program Guides (EPG), or in Web pages to describe content.

• Indecs – Indecs Metadata Framework– An international initiative of rights owners

creating metadata standards for e-commerce.

P/Meta

SMEF

TVAnytime

SMPTE

MPEG-7Descriptors

DublinCore

MPEG-7Harmonized

Model

• MPEG-7 Harmonized Model:– Harmonized elements with other

standards and existing practices– Extensible framework– Registration authority for

classification schemes, controlled terms, ontologies

Page 6: MPEG-7 Multimedia Content Description Standard

MPEG-7 MDS: Free Text Annotation Example

• The following example gives an MPEG-7 description of a car that is depicted in an image:<Mpeg7> <Description xsi:type="SemanticDescriptionType"> <Semantics> <Label> <Name> Car </Name> </Label> <Definition> <FreeTextAnnotation> Four wheel motorized vehicle </FreeTextAnnotation> </Definition> <MediaOccurrence> <MediaLocator> <MediaUri> image.jpg </MediaUri> </MediaLocator> </MediaOccurrence> </Semantics> </Description></Mpeg7>

Page 7: MPEG-7 Multimedia Content Description Standard

MPEG-7 MDS: Event DS Example

• The following example gives an MPEG-7 description of the event of handshake between people:

<Mpeg7> <Description xsi:type="SemanticDescriptionType"> <Semantics> <Label> <Name> Shake hands </Name> </Label> <SemanticBase xsi:type="AgentObjectType" id="A"> <Label href="urn:example:acs"> <Name> Person A </Name> </Label> </SemanticBase> <SemanticBase xsi:type="AgentObjectType" id="B"> <Label href="urn:example:acs"> <Name> Person B </Name> </Label> </SemanticBase> <SemanticBase xsi:type="EventType"> <Label><Name> Handshake </Name></Label> <Definition> <FreeTextAnnotation> Clasping of right hands by two people </FreeTextAnnotation> </Definition> <Relation type="urn:mpeg:mpeg7:cs:SemanticRelationCS:2001:agent" target="#A"/> <Relation type="urn:mpeg:mpeg7:cs:SemanticRelationCS:2001:accompanier“ target="#B"/> </SemanticBase> </Semantics> </Description></Mpeg7>

Page 8: MPEG-7 Multimedia Content Description Standard

MPEG-7 Classification Schemes

MPEG-7 Standard Schema (Syntax)

MPEG-7 Description

Schemes (DS) & Descriptors

(D)

3rd Party MPEG-7

Extensions (DS & D)

MPEG-7 Registration

Authority (Terms)

MPEG-7 Classification Schemes (CS) & Controlled

Terms

3rd Party Classification

Schemes (CS) &

Controlled Terms

MPEG-7 Description

STANDARD

MPEG-7 SchemaExamples:• Creation DS• Agent DS• Semantic DS

MPEG-7 CSExamples:• Genre CS• Role CS• Format CS

EXTENSION

MPEG-7 ExtensionExamples:• Broadcast

DS• Rhythm DS• Graphics DS

MPEG-7 CSRegistrationExamples:• Sports CS• News CS• TGM-I CS

Page 9: MPEG-7 Multimedia Content Description Standard

Some additional slides not by John R. Smith

Page 10: MPEG-7 Multimedia Content Description Standard

MPEG-7 Terminology

• Data– Audiovisual information that will be described using MPEG-7

• Feature– A distinctive part or characteristic of data (ex. Color, shape, ...)

• Descriptor– Associates a representation value to one or more features.

• Description Scheme– Defines a structure and semantics of descriptors and their

relationships to model data content.

• Description Definition Language (DDL)– A language to specify Description Scheme

• Coded description– A representation of description allowing efficient storage and

transmission

Page 11: MPEG-7 Multimedia Content Description Standard

MPEG-7

Description Definition Language

(DDL)

MPEG-7

Description Scheme

Bitstream Syntax

Description Generation

MPEG-7 Description

MM content

EncoderMPEG-7

Coded Description

Descriptors

Decoder

Search/Query Engine

Filter Agents

User

DDL

DS

D DD D

DS

An abstract representation of possible applications using MPEG-7

Page 12: MPEG-7 Multimedia Content Description Standard

Components of MPEG-71) MPEG-7 Systems2) MPEG-7 Description Definition

Language 3) MPEG-7 Visual4) MPEG-7 Audio5) MPEG-7 Multimedia DSs6) MPEG-7 Reference Software7) MPEG-7 Conformance

Page 13: MPEG-7 Multimedia Content Description Standard

MPEG 7 – standard parts• MPEG-7 Systems - the binary format for encoding MPEG-7 descriptions and

the terminal architecture.• MPEG-7 Description Definition Language - the language for defining the

syntax of the MPEG-7 Description Tools and for defining new Description Schemes.

• MPEG-7 Visual – the Description Tools dealing with (only) Visual descriptions.• MPEG-7 Audio – the Description Tools dealing with (only) Audio descriptions.• MPEG-7 Multimedia Description Schemes - the Description Tools dealing

with generic features and multimedia descriptions.• MPEG-7 Reference Software - a software implementation of relevant parts

of the MPEG-7 Standard with normative status.

• MPEG-7 Conformance Testing - guidelines and procedures for testing conformance of MPEG-7 implementations (under development).

• MPEG-7 Extraction and use of descriptions – informative material about the extraction and use of some of the Description Tools (under development).

Page 14: MPEG-7 Multimedia Content Description Standard

MPEG 7 – main elements 1

Description definition Description definition languagelanguage

D1D1D2D2

D4D4

D5D5D6D6

D3D3

DS1DS1

DS2DS2

DS4DS4

DS3DS3

D1D1

D2D2 D3D3

D4D4 D5D5 D6D6

structuring

<scene id=1><time>...<camera>..<annotation>

</scene>

Encoding and Encoding and deliverydelivery

10110101101011010110

Definition Tags

Page 15: MPEG-7 Multimedia Content Description Standard

MPEG 7 – main elements 2• Descriptors (Ds):

Bounding Box, Editing Effect, Camera Motion,Object Trajectory

• Description Schemes (DSs): – Region and segment hierarchy, object and event hierarchy– Entity relation graph– Media, meta, syntactic, and semantic features

• Description Definition Language (DLL): – The XML Schema structural language components – The XML Schema datatype language components – The MPEG-7 specific extensions.

Page 16: MPEG-7 Multimedia Content Description Standard

Description Definition Language (DDL)

• “…a language that allows the creation of new Description Schemes and, possibly, Descriptors.”

• “It also allows the extension and modification of existing Description Schemes.”

MPEG-7 Requirement Documents V.13

Page 17: MPEG-7 Multimedia Content Description Standard

DDL (2)

• It is based on XML Schema Language• Consists of

– XML Schema Structural Components– XML Schema Data Types– MPEG-7 Specific Extensions

• Array and matrix data types• Multiple media types

– Audio, video, and audiovisual presentations• Enumerated data types for

– MimeType, CountryCode, RegionCode, CurrencyCode and CharacterSetCode

Page 18: MPEG-7 Multimedia Content Description Standard

DDL (3)

• A Simplified Example:

Page 19: MPEG-7 Multimedia Content Description Standard

CSS DDL representation syntax

<complexType name=”ContourShapeType” final=”#all”>

<complexContent>

<extension base=”mpeg7:VisualDType”>

<element name=”GlobalCurvatureVector”

type=”mpeg7:curvatureVectorType”/>

<element name=”PrototypeCurvatureVector”

type=”mpeg7:curvatureVectorType” minOccurs=”0”/>

<element name=”HighestPeak” type=”mpeg7:unsigned7”/>

<element name=”Peak” maxOccurs=”62”>

<complexType>

<element name=”xpeak” type=”mpeg7:unsigned6”/>

<element name=”ypeak” type=”mpeg7:unsigned3”/>

</complexType>

</element>

<attribute name=”numberOfPeaks” type=”mpeg7:unsigned6”/>

</extension>

</complexContent>

</complexType>

<simpleType name=”curvatureVectorType”>

<restriction base=”mpeg7:listOfUnsigned6Type”>

<length value=”2”/>

</restriction>

</simpleType>

Page 20: MPEG-7 Multimedia Content Description Standard

CSS Descriptor components semantics

numberOfPeaks

This attribute specifies the number of peaks in the CSS image used for the shape definition. For convex contours there are no peaks in the CSS image and this attribute takes value 0.

GlobalCurvatureVector

This element specifies global parameters of the contour, namely the Eccentricity and Circularity.

The calculation of Circularity is as follows:

etc.