Download - Mm presentation bkk

Transcript
Page 1: Mm presentation bkk

MODELING THE QOE OF

RATE CHANGES IN

SKYPE/SILK VOIP CALLS

CHIEN-NAN CHEN CING-YU CHU

SU-LING YEH

HAO-HUA CHU

POLLY HUANG

UNIVERSITY OF

ILLINOIS, URBANA-

CHAMPAIGN

NATIONAL TAIWAN

UNIVERSITY

1

Page 2: Mm presentation bkk

OUTLINE

• Motivation

• Preliminary Experiment

• Proposed Model

• Large-Scale Experiment

• Evaluation

• Conclusion

2

Page 3: Mm presentation bkk

VOICE OVER IP

Internet

Bandwidth FluctuationPacket LossDelay Jitter

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

3

Page 4: Mm presentation bkk

RATE ADAPTATION

• Available bandwidth

Ramping up the sending rate

• Available bandwidth

Tuning down the sending rate

Is the quality improved proportionally?

Rate change Disturbing users?

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

4

Page 5: Mm presentation bkk

GOAL

• Investigating the relationship of

Sending rate vs. Perceived quality

• To explore the influence of

Rate change magnitude/frequency

• Methodology

• Synthesized VoIP calls

• User study experiments

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

5

Page 6: Mm presentation bkk

CONTRIBUTION

• Sending bitrate vs. user perception

Logarithmic Relationship

• Frequency of rate change

Logarithmic Relationship

• Magnitude of rate change

Complicated, but Interesting

• Closed-form models to predict user

perception under bandwidth fluctuation

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

6

Page 7: Mm presentation bkk

PRELIMINARY EXPERIMENT

• To confirm the influence of

• sending bitrate

• rate change magnitude

• rate change frequency

• 5-level MOS (Mean Opinion Score)

• 14 participants

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

7

Page 8: Mm presentation bkk

AUDIO TRACK PRODUCTION

• Skype/SILK audio codec

• 30s audio track

• sentences without contextual connection

• Fixed-rate tracks

• Variable-rate tracks Bitrate (kbps)

5.6 9.5 13.3 17.2 21.1 25.0 28.9 32.8 36.6 40.6

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

8

Page 9: Mm presentation bkk

RESULT

FIXED-RATE

• MOS vs. sending bitrate

User Variation

Logarithmic

Trend

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

9

Page 10: Mm presentation bkk

RESULT

VARIABLE-RATE

• MOS - ΔT plot

Rate change

matters!

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

10

Page 11: Mm presentation bkk

EFFECT OF RATE CHANGE

FREQUENCY

• When ΔT varies…

Logarithmic

Trend

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

11

Page 12: Mm presentation bkk

EFFECT OF RATE CHANGE

MAGNITUDE

• When sharing the same average bitrate…

Magnitude

MOS

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

12

Page 13: Mm presentation bkk

EFFECT OF RATE CHANGE

MAGNITUDE

• However, with the same magnitude…

Higher (hr + lr)

Lower (hr + lr)

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

13

Page 14: Mm presentation bkk

SHORT SUMMARY

• Fixed-rate

• MOS – bitrate logarithmic

• Variable-rate

• MOS – ΔT logarithmic

• MOS – (hr, lr)

• hr - lr up MOS down

• hr + lr up MOS up

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

14

Page 15: Mm presentation bkk

PROPOSED MODELS

• Fixed-rate model

• Variable-rate model

Massive Data Numerical Fitting

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

15

Page 16: Mm presentation bkk

LARGE-SCALE EXPERIMENT

• Same methodology

• 127 participants

• Each track is scored by 30 participants

• Rate selection

Bitrate (kbps)40.627.719.414.1

5.6 6.1 7.1 8.5

10.7

r1r2r3r4r5r9 r8 r7 r6

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

16

Page 17: Mm presentation bkk

SIGNIFICANCE OF FACTORS

• ANOVA tests

• MOS – sending bitrate

Significant

• Interaction between ΔT and (hr, lr)

Significant

• MOS - ΔT

Test p-value Test p-value Test p-value

r1r2 .31 r6r7 .31 r7r8 .26

r3r4 .42 r6r8 .11 r7r9 .34

r4r5 .31 r6r9 .09 r8r9 .32

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

17

Page 18: Mm presentation bkk

MODEL SPECIFICS

FIXED-RATE MODEL

• α=4.091, β=1.515, and γ=1.000

• with R-square = 0.96

Lower bound of user perception (?)

close to the lowest bitrate of SILK

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

18

Page 19: Mm presentation bkk

MODEL SPECIFICS

VARIABLE-RATE MODEL

• Logarithmic regression on each (hr, lr)

pair

(r1, r2): p12 x ln(ΔT) + q12

(r1, r3): p13 x ln(ΔT) + q13

(r1, r4): p14 x ln(ΔT) + q14

(r1, r5): p15 x ln(ΔT) + q15

:

:

:

SCALE() SHIFT()

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

19

Page 20: Mm presentation bkk

MODEL SPECIFICS

SCALE()

• Polynomial regression

• x = hr – lr , y = hr + lr

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

20

Page 21: Mm presentation bkk

MODEL SPECIFICS

SHIFT()

• Independent to ΔT

• Basic idea

• ΔT approaches the track duration

• Fluctuation diminishes

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

21

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

Page 22: Mm presentation bkk

EVALUATION

GOODNESS OF FIT

• Training data

• R-square = 0.86

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

22

Page 23: Mm presentation bkk

EVALUATION

ACCURACY OF PREDICTION

• 2 dataset independent to training data

• Dataset I: Preliminary experiment

• Dataset II: Additional (New) experiment

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

23

Page 24: Mm presentation bkk

PESQ

• Perceptual Evaluation of Speech Quality

• Limited spectrum

• Narrow-band: 8k Hz

• Wide-band: 16k Hz

(SILK: 8k, 12k, 16k and 24 k Hz)

• Requires both original and degraded

audio files

24

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

Page 25: Mm presentation bkk

COMPARISON WITH

PESQ – FIXED RATE

25

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

model Proposed PESQ

R-square 0.9601 0.7841

model Proposed PESQ

Avg. Err. Ratio 3.68% 14.59%

Page 26: Mm presentation bkk

COMPARISON WITH

PESQ – VARIABLE RATE

26

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

model Proposed PESQ

R-square 0.2512 -0.3491

model Proposed PESQ

Avg. Err. Ratio 8.03% 12.60%

Page 27: Mm presentation bkk

COMPARISON ON

AMR-WB

• AMR-WB audio codec

• Older Codec

• Widely used in 3G network

• 9 difference coding bitrates

• User study experiment

• Same methodology

• 14 participants

27

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

Page 28: Mm presentation bkk

COMPARISON ON

AMR-WB

28

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

model Proposed PESQ

R-square 0.7878 0.6289

model Proposed PESQ

Avg. Err. Ratio 2.18% 2.86%

Proposed

PESQ

Proposed

PESQ

Page 29: Mm presentation bkk

CONCLUSION

• The logarithmic relationship (Weber-Fechner

Law) is observed in the MOS-bitrate relation-

ship of Skype/SILK

• Rate change frequency (W-F Law) and

magnitude (complicated) have significant

influence on perceived quality

• We have established both fixed- (SIGCOMM’12

W-MUST) and variable-rate models

• User-centric rate adaptation for VoIP

applications (coming next)

MotivationPre. Exp.Proposed ModelLarge Exp.EvaluationConclusion

29

Page 30: Mm presentation bkk

Q&A

30

Page 31: Mm presentation bkk

MODEL SPECIFICS

SHIFT()

• Dominant quality

• Expected quality when fluctuation diminishes

• ΔT approaches the track lengthdominant

quality

ΔT (second) 31

Page 32: Mm presentation bkk

MODEL SPECIFICS

SHIFT()

• Dominant quality: D()

hr = 14.1 kbps

D() = MOSh

32

Page 33: Mm presentation bkk

MODEL SPECIFICS

SHIFT()

• hr > 14.1 kbps

• normalized y-axis

33

Page 34: Mm presentation bkk

MODEL SPECIFICS

SHIFT()

• Linear to the MOS difference (hr > 14.1 kbps)

34