Overview of technologies for immersive visual experiences ......Basic representations used by MPEG...

25
Overview of technologies for immersive visual experiences: capture, processing, compression, standardization and display 1 Poznań University of Technology Chair of Multimedia Telecommunications and Microelectronics Poznań, Poland Marek Domański

Transcript of Overview of technologies for immersive visual experiences ......Basic representations used by MPEG...

Page 1: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Overview of technologies for immersive visual experiences:

capture, processing, compression, standardization and display

1

Poznań University of TechnologyChair of Multimedia Telecommunications and Microelectronics

Poznań, Poland

Marek Domański

Page 2: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive visual experiences• Arbitrary direction of viewing

System : 3DoF

• Arbitrary location of viewer

- virtual navigation- free-viewpoint television

• BothSystem : 6DoF Virtual viewer

2

Page 3: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video content Computer-generated

Natural content Multiple camera around a scene

also depth cameras, light-field cameras Camera(s) located in a center of a scene

360-degree cameras Mixed

e.g.: wearable cameras

3

Page 4: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Video captureCommon technical problems

• Synchronization of camerasCamera hardware needs to enable synchronization

Shutter release error < Exposition interval

• Frame rateHigh frame rate needed – Head Mounted Devices

4

Page 5: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Common task for immersive video capture:

CalibrationCamera parameter estimation:

• Intrinsic – the parameters of individual cameras –

remain unchanged by camera motion

• Extrinsic – related to camera locations in real word 

– do change by camera motion (even slight !)

Out of scope of standardization

• Improved methods developed5

Page 6: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Depth estimation

• By video analysis – from at least 2 video sequences

– Computationally heavy

– Huge progress recently 

• By depth cameras – diverse products

– Infrared illuminate of the scene

– Limited resolution mostly

Out of scope of standardization6

Page 7: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

MPEG Test video sequences

• Large collection of video sequences with depth information

• Multiview, omnidirectional, lightfield, volumetric, etc.

• Used for experiments in standardization and algorithm testing

7

Page 8: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video pathCONTENT ACQUISITION

REPRESENTATION SERVER

EDGE SERVER

USER DEVICE

Video cameras, depth cameras, omnidirectional cameras, lightfield cameras

System calibration, video correction, depth estimation, scene representation , …

Data buffering and restorationBitstream personalizationRendering (partially?)

Viewport selection, Rendering (partially?),Display

Standardcompression

Standardcompression

Immersive videocompression

Immersive videocompression

8

Page 9: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video pathCONTENT ACQUISITION

REPRESENTATION SERVER

EDGE SERVER

USER DEVICE

Video cameras, depth cameras, omnidirectional cameras, lightfield cameras

System calibration, video correction, depth estimation, scene representation , …

Data buffering and restorationBitstream personalizationRendering (partially?)

Viewport selection, Rendering (partially?),Display

Standardcompression

Standardcompression

Immersive videocompression

Immersive videocompression

9

Page 10: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

•Multiview plus depth

View 1 Depth for view 1 View N Depth for view N

Time…

Scene representations Basic representations used by MPEG

10

Page 11: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

•360‐degree  +  360‐degree depth

Scene representationsBasic representations used by MPEG

11

Page 12: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Scene representationsBasic representations used by MPEG

•Point clouds

12

Page 13: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video compression

Available techniques+ standards under development 

mostly try to exploitgeneral compression techniques

13

Page 14: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video pathCONTENT ACQUISITION

REPRESENTATION SERVER

EDGE SERVER

USER DEVICE

Video cameras, depth cameras, omnidirectional cameras, lightfield cameras

System calibration, video correction, depth estimation, scene representation , …

Data buffering and restorationBitstream personalizationRendering (partially?)

Viewport selection, Rendering (partially?),Display

Standardcompression

Standardcompression

Immersive videocompression

Immersive videocompression

14

Page 15: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

AVC (H.264, MPEG-4 Part 10)Advanced Video Coding

HEVC (H.265, MPEG-H Part 2)High Efficiency Video Coding

1994 2003 2012 2021 Year

16

8

4

2

MPEG-2 (H.262)

Approximate bitrate for HD(1920×1080, 25 frame/s) [Mb/s]

VVCVersatile Video Coding(H.266, MPEG-I part 3)

2020? 15

Page 16: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Compressed video bitrates

16

MbpsVAB A – technology coefficient,

A = 4 for MPEG-2, A = 1 for HEVC,

A = 2 for AVC, A = 0.5 for VVC (expect).

V – format coefficient,

V=1 for SDTV (720×576, 25i),

V=4 for HDTV (1920×1080, 25i),

V=16 for UHDTV (3840×2160, 50p),

Page 17: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

AVC (H.264, MPEG-4 Part 10)Advanced Video Coding

HEVC (H.265, MPEG-H Part 2)High Efficiency Video Coding

17

1994 2003 2012 2021 Year

16

8

4

2

MPEG-2 (H.262)

Approximate bitrate for HD(1920×1080, 25 frame/s) [Mb/s]

VVCVersatile Video Coding(H.266, MPEG-I part 3)

This step costedseveral thousandsof man‐year labor !

2020?

Page 18: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video compression

Enormous cost of development of new video technology from scratch

18

Page 19: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Compression of multiview plus depth video

View 1 Depth for view 1

View N Depth for view N

Time…

Coding exploits inter-view redundancy : Multiview extensions of video coding standards

MPEG-2, AVC, HEVC (standardized)Additionally exploits depth information:3D extensions of video coding standards

AVC, HEVC (standardized)19

Page 20: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Compression of multiview plus depth video

View 1 Depth for view 1

View N Depth for view N

Time…

Efficiency of Multiview/3D coding

10 – 50 % bitrate reduction vs. simulcast

Efficient for dense camera locations on a line

20

Page 21: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Immersive video compression360-degree

video

2D video

En

co

di

ng

IMMERSIVE VIDEO

ENCODER

2D v

ideo HEVC or

MV‐HEVCVVCencoder

HEVC orMV‐HEVCVVCencoder B

it

st

re

am

sParameters Parameters

HEVC orMV‐HEVCVVCdecoder

HEVC orMV‐HEVCVVCdecoder

De

co

di

ng

IMMERSIVE VIDEO

DECODER

2D v

ideo

2D v

ideo

2D v

ideo... ...

360-degreevideo

2D video

+ depth+ camera

parameters

+ depth+ camera

parameters

21

Page 22: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Video examplesInputs to HEVC encoders

22

Page 23: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Similar approach

Video‐based Point Cloud

Audio:MPEG surroundMPEG-H 3D Audio

23

Page 24: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Edge server – user devicelink

24

Page 25: Overview of technologies for immersive visual experiences ......Basic representations used by MPEG 10 •360‐degree + 360‐degree depth Scenerepresentations Basic representationsusedby

Thank you

25