MPEG 4
-
Upload
tvutech -
Category
Technology
-
view
1.807 -
download
0
description
Transcript of MPEG 4
![Page 1: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/1.jpg)
MPEG 4: The ultimate low bit rate format
MPEG 4: The ultimate low bit rate format
www.chiariglione.org/mpeg/
![Page 2: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/2.jpg)
Overview: ISO/IEC 14496 Coding of audio-visual objects Overview: ISO/IEC 14496 Coding of audio-visual objects • Low bit rate multimedia system -
typically less than MPEG 2 video• Object based - each element is coded
separately• Open ended system that can continue
to develop in the future• Scalable & Interactive
• Version 1: October 1998
• Low bit rate multimedia system - typically less than MPEG 2 video
• Object based - each element is coded separately
• Open ended system that can continue to develop in the future
• Scalable & Interactive
• Version 1: October 1998
![Page 3: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/3.jpg)
VersionsVersions
Version 2
Version 1
![Page 4: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/4.jpg)
ProfilesProfiles
QualityQuality
ComplexityComplexity
DVD
Video CD
Mobiles
MPEG-1
MPEG-2
HDTV
Digital cinema
Advanced Simple Profile
Simple Profile
MPEG 4
![Page 5: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/5.jpg)
Current uses:Current uses:
• 3G mobile phones,• Portable devices, PDAs, video iPod• Interactive television / IPTV• New interactive multimedia formats• Web pages• Interactive music format• Security systems
• 3G mobile phones,• Portable devices, PDAs, video iPod• Interactive television / IPTV• New interactive multimedia formats• Web pages• Interactive music format• Security systems
![Page 6: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/6.jpg)
Basics:Basics:
• Object based system: using Natural and/or Synthetic objects
• Makes use of local processing power to recreate sounds and images
• This makes it one of the most efficient compression systems
• Object based system: using Natural and/or Synthetic objects
• Makes use of local processing power to recreate sounds and images
• This makes it one of the most efficient compression systems
![Page 7: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/7.jpg)
Basics: Object TypesBasics: Object Types
• Photos - JPE, GIF, PNG,• Video - MPEG-2, Divx, AVI, H.264,
QuickTime• Speech - CELP, HVXC, Text to Speech• Music - AAC, MP3, surround• Synthetic music• Graphics - Java code• Text• Animated objects, e.g., talking heads
• Photos - JPE, GIF, PNG,• Video - MPEG-2, Divx, AVI, H.264,
QuickTime• Speech - CELP, HVXC, Text to Speech• Music - AAC, MP3, surround• Synthetic music• Graphics - Java code• Text• Animated objects, e.g., talking heads
![Page 8: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/8.jpg)
Basics:Basics:
• The selected objects are put together into a 2D or 3D scene
• In 3D the viewer can change the shape of the image and view it from other positions in the 3D space. Similar to VRML
• Each object is compressed using the best and most optimum method for that type of data
• The selected objects are put together into a 2D or 3D scene
• In 3D the viewer can change the shape of the image and view it from other positions in the 3D space. Similar to VRML
• Each object is compressed using the best and most optimum method for that type of data
![Page 9: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/9.jpg)
Basics: Virtual StudioBasics: Virtual Studio
• Virtual production techniques are increasingly used in TV production
• Well known chroma-key method using a blue/green screen background
• Actors are overlaid on to a ‘virtual studio’ background image
• The composition of the screen image and the sound can now take place in the decoder at home
• Virtual production techniques are increasingly used in TV production
• Well known chroma-key method using a blue/green screen background
• Actors are overlaid on to a ‘virtual studio’ background image
• The composition of the screen image and the sound can now take place in the decoder at home
![Page 10: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/10.jpg)
![Page 11: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/11.jpg)
BIFS:BIFS:
• Binary Format for Scene descriptor• New description language like HTML
but written in binary not in English • Has scalable levels for audio and video
which can be set by access rights or by interrogation of the receiver to set the best ‘Quality of Service’ (QoS)
• Binary Format for Scene descriptor• New description language like HTML
but written in binary not in English • Has scalable levels for audio and video
which can be set by access rights or by interrogation of the receiver to set the best ‘Quality of Service’ (QoS)
![Page 12: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/12.jpg)
Synchronised StreamingSynchronised Streaming
• Each element can be time stamped to synchronise with other objects in the frame
• Flexi Time: The viewer can vary the time for playback
• There are 3 types, set by the producer, minimum, maximum and optimal
• Audio can be set to change pitch or stay fixed
• Each element can be time stamped to synchronise with other objects in the frame
• Flexi Time: The viewer can vary the time for playback
• There are 3 types, set by the producer, minimum, maximum and optimal
• Audio can be set to change pitch or stay fixed
![Page 13: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/13.jpg)
Compression: SpeechCompression: Speech
• HVXC - Harmonic Vector Excitation Coding• CELP - Code Excited Linear Prediction• 2 - 24 K bit/sec• Synthesised speech: • Text to speech synthesis, 200-1200 bit/sec• Very low delay, 20 ms, for video phone use
MP3 takes too long to encode/decode
• HVXC - Harmonic Vector Excitation Coding• CELP - Code Excited Linear Prediction• 2 - 24 K bit/sec• Synthesised speech: • Text to speech synthesis, 200-1200 bit/sec• Very low delay, 20 ms, for video phone use
MP3 takes too long to encode/decode
![Page 14: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/14.jpg)
Compression: Natural AudioCompression: Natural Audio
• MPEG - AAC (Advanced Audio Coding)
• MP3, AAC, 5.1 surround
• 6 - 380 K bit/sec
• MPEG - AAC (Advanced Audio Coding)
• MP3, AAC, 5.1 surround
• 6 - 380 K bit/sec
![Page 15: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/15.jpg)
Parametric audio codingParametric audio coding• Synthesised audio: Spectral noise re-synthesised • This process separates unique audio sounds
from predictable noise shapes which can then be re-synthesised locally
• Signal is represented by three objects:– Transients: localized in time,– Sinusoids: localized in frequency,– Noise: no strict localization…
• Synthesised audio: Spectral noise re-synthesised • This process separates unique audio sounds
from predictable noise shapes which can then be re-synthesised locally
• Signal is represented by three objects:– Transients: localized in time,– Sinusoids: localized in frequency,– Noise: no strict localization…
![Page 16: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/16.jpg)
Parametric audio codingParametric audio coding
Transients: CastanetsTransients: Castanets
![Page 17: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/17.jpg)
Parametric audio codingParametric audio coding
Sinusoids: HarpsichordSinusoids: Harpsichord
![Page 18: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/18.jpg)
Parametric audio codingParametric audio coding
Noise: Heavy metalNoise: Heavy metal
![Page 19: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/19.jpg)
Compression: Structured Audio Compression: Structured Audio
SAOL - Structured Audio Orchestra Language (pronounced sail)
• Down loadable sound fonts• Wavetable synth + GM2 type spec.• Any kind of virtual instruments• Virtual effects algorithms and mixers• MIDI data rates e.g. 300 bit/sec
SAOL - Structured Audio Orchestra Language (pronounced sail)
• Down loadable sound fonts• Wavetable synth + GM2 type spec.• Any kind of virtual instruments• Virtual effects algorithms and mixers• MIDI data rates e.g. 300 bit/sec
![Page 20: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/20.jpg)
Interactive Audio Interactive Audio
• Download and remix tracks• Flash interface and compressed
audio loops
• Download and remix tracks• Flash interface and compressed
audio loops
www.yourspins.com
![Page 21: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/21.jpg)
Compression: VideoCompression: Video
• Overall MPEG-4 supports a wide range of standards from very small, poor quality, pictures up to HDTV
• MPEG 1 & 2• MPEG 4 - part 2• MPEG 4 - part 10, 'Advanced Video Coding’
AVC H.264 - high quality video codec• Developed jointly by MPEG and the ITU
• Overall MPEG-4 supports a wide range of standards from very small, poor quality, pictures up to HDTV
• MPEG 1 & 2• MPEG 4 - part 2• MPEG 4 - part 10, 'Advanced Video Coding’
AVC H.264 - high quality video codec• Developed jointly by MPEG and the ITU
![Page 22: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/22.jpg)
Compression: Video - H.264Compression: Video - H.264
• Half to one quarter of the normal bit rate of MPEG-2
• Scalable from 3G to HD• More advanced B frame operation where
the frame can link to any frame in the video sequence
• Smaller 4X4 grids of pixels• 4 Profiles and 16 Levels• Bit rates from 64 Kbps - 240 Mbps
• Half to one quarter of the normal bit rate of MPEG-2
• Scalable from 3G to HD• More advanced B frame operation where
the frame can link to any frame in the video sequence
• Smaller 4X4 grids of pixels• 4 Profiles and 16 Levels• Bit rates from 64 Kbps - 240 Mbps
![Page 23: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/23.jpg)
Compression: VideoCompression: Video
• Mixture of pixel based and vector graphics
• Video is no longer a rectangular shape, it can be any shape
• Synthetic images with bit rates from 5 kbit/sec - 10 Mbit/sec
• Supports the mapping of video textures onto moving objects and meshes
• Mixture of pixel based and vector graphics
• Video is no longer a rectangular shape, it can be any shape
• Synthetic images with bit rates from 5 kbit/sec - 10 Mbit/sec
• Supports the mapping of video textures onto moving objects and meshes
![Page 24: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/24.jpg)
2D mesh model of a fish2D mesh model of a fish
By deforming the mesh the fish can be animated
By deforming the mesh the fish can be animated
![Page 25: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/25.jpg)
Animated Objects:Animated Objects:
• The Animation Framework eXtension, AFX• Face animation: The face models are not
part of MPEG-4 only the movement codes for the expressions, eye movement, etc.
• Body animation works in the same way and can be used in games
• The Animation Framework eXtension, AFX• Face animation: The face models are not
part of MPEG-4 only the movement codes for the expressions, eye movement, etc.
• Body animation works in the same way and can be used in games
![Page 26: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/26.jpg)
Future Options:Future Options:
• MPEG-4 is still being developed and all new parts will work with the old formats
• Studio quality versions for HDTV• Digital cinema 45 - 240 Mbit/sec H.264• Home video cameras with MPEG-4
output straight to the web from the hard drive
• MPEG-4 is still being developed and all new parts will work with the old formats
• Studio quality versions for HDTV• Digital cinema 45 - 240 Mbit/sec H.264• Home video cameras with MPEG-4
output straight to the web from the hard drive
![Page 27: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/27.jpg)
Future Options:Future Options:
• Integrated Service Digital Broadcast (ISDB)• Newspaper + TV + data• Integration with MPEG 7 databases• Games with 3D texture mapping
• Integrated Service Digital Broadcast (ISDB)• Newspaper + TV + data• Integration with MPEG 7 databases• Games with 3D texture mapping
![Page 28: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/28.jpg)
Future Options:Future Options:
• TeleVision Modelling Language (TVML)• Computer generated TV programs +
presenters - Max Headroom??
• TeleVision Modelling Language (TVML)• Computer generated TV programs +
presenters - Max Headroom??
![Page 29: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/29.jpg)
Future Options:Future Options:
• Information booths• Talking objects - fridge, cars, toaster?• Security cameras over the web• Interactive manuals and training materials• New downloadable interactive music format,
SAOL
• Information booths• Talking objects - fridge, cars, toaster?• Security cameras over the web• Interactive manuals and training materials• New downloadable interactive music format,
SAOL
![Page 30: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/30.jpg)
MPEG 7MPEG 7
![Page 31: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/31.jpg)
MPEG 7MPEG 7Multimedia Content Descriptor Standard
Database system to automatically define, organise and search for text, pictures, sound FX, graphics, video clips, songs, music, etc.
• On-line Music library• Automatic identification of music• Uses XML to store metadata
Multimedia Content Descriptor Standard
Database system to automatically define, organise and search for text, pictures, sound FX, graphics, video clips, songs, music, etc.
• On-line Music library• Automatic identification of music• Uses XML to store metadata
![Page 32: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/32.jpg)
MPEG 7MPEG 7Proposed uses:• Live broadcast monitoring, radio output• Digital libraries e.g., image catalogue,
musical dictionary, bio-medical imaging, sound FX, film, video and radio archives
• Cultural services history museums, art galleries, etc.
Proposed uses:• Live broadcast monitoring, radio output• Digital libraries e.g., image catalogue,
musical dictionary, bio-medical imaging, sound FX, film, video and radio archives
• Cultural services history museums, art galleries, etc.
![Page 33: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/33.jpg)
MPEG 7MPEG 7• Home entertainment e.g., systems for the
management of personal multimedia collections, e.g. music, home video, searching a game, karaoke
• E-Commerce e.g., personalised advertising, on-line catalogues, directories of e-shops
• Education e.g., repositories of multimedia courses, multimedia search for support material
• Home entertainment e.g., systems for the management of personal multimedia collections, e.g. music, home video, searching a game, karaoke
• E-Commerce e.g., personalised advertising, on-line catalogues, directories of e-shops
• Education e.g., repositories of multimedia courses, multimedia search for support material
![Page 34: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/34.jpg)
MPEG 7MPEG 7
• Investigation services e.g., human characteristics recognition, forensics
• Journalism e.g. searching speeches of a certain politician using their name, voice or face
• Multimedia directory services e.g. Yellow Pages, tourist information, geographical information systems
• Investigation services e.g., human characteristics recognition, forensics
• Journalism e.g. searching speeches of a certain politician using their name, voice or face
• Multimedia directory services e.g. Yellow Pages, tourist information, geographical information systems
![Page 35: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/35.jpg)
MPEG 7MPEG 7
• Multimedia editing e.g., personalised electronic news service, media authoring
• Social e.g. on-line dating services
• Surveillance e.g., traffic control
• Multimedia editing e.g., personalised electronic news service, media authoring
• Social e.g. on-line dating services
• Surveillance e.g., traffic control
http://www.eptascape.com/products/demoflv.htm
![Page 36: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/36.jpg)
MPEG 21MPEG 21
![Page 37: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/37.jpg)
MPEG 21MPEG 21
• An infrastructure for the delivery and consumption of multimedia content
• Users are seen as either creators, consumers, rights holders, content providers, or distributors
• An infrastructure for the delivery and consumption of multimedia content
• Users are seen as either creators, consumers, rights holders, content providers, or distributors
![Page 38: MPEG 4](https://reader035.fdocuments.net/reader035/viewer/2022081421/5557a698d8b42a696c8b47b9/html5/thumbnails/38.jpg)
MPEG 21MPEG 21
• Every media element is defined as a ‘Digital Item’
• Metadata defines what media we can use, what we can do with it and who owns it
• Designed to work with MPEG 4 files and MPEG 7 database
• Every media element is defined as a ‘Digital Item’
• Metadata defines what media we can use, what we can do with it and who owns it
• Designed to work with MPEG 4 files and MPEG 7 database