End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with...

35
End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia Institute of Technology

Transcript of End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with...

Page 1: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

End-to-End Estimation of Available Bandwidth

Variation Range

Constantine DovrolisJoint work with Manish Jain & Ravi Prasad

College of ComputingGeorgia Institute of Technology

Page 2: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Probing the Internet

Several network parameters are important for applications and transport protocols: Delay, loss rate, capacity, congestion, load, etc

Internet routers do not provide direct feedback to end-hosts Due to scalability, simplicity & administrative issues Except SNMP, ICMP

Alternatively: Infer network state through end-to-end measurements

Page 3: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

End-to-end bandwidth estimation

“Bandwidth” in data networks refers to throughput (bits/sec) Capacity: maximum possible throughput w/o cross traffic Available bandwidth (or residual capacity): capacity – cross traffic

Bandwidth estimation: measurement techniques & statistical analysis to infer bandwidth-related metrics of individual links and end-to-end network paths

Objectives: Accuracy: application-specific but typically within 10-20% Estimation latency: within a few seconds Non-intrusiveness: cross traffic should not be affected Scalability: important when monitoring many paths (not covered

in this talk)

Page 4: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Why to measure bandwidth? Large TCP transfers and congestion control

Bandwidth-delay product estimation TCP socket buffer sizing

Streaming multimedia Adjust encoding rate based on avail-bw

Intelligent routing systems Overlay networks and p2p networks Intelligent routing control & multihoming

Content Distribution Networks (CDNs) Choose server based on least-loaded path

SLA verification & interdomain problem diagnosis Monitor path load and allocated capacity

End-to-end admission control Network spectroscopy Several more..

Page 5: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Definitions and problem statement

Page 6: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Capacity Maximum possible end-to-end throughput at IP layer

In the absence of any cross traffic For maximum-sized packets

If Ci is capacity of link i, end-to-end capacity C defined as:

Capacity determined by narrow link

iHi

CC,...,1

min

Page 7: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Average available bandwidth Per-hop average avail-bw:

Ai = Ci (1-ui)

ui: average utilization A.k.a. residual capacity

End-to-end avg avail-bw A:

Determined by tight link ISPs measure average per-

hop avail-bw passively 5-min averaging intervals

iHi

AA,...,1

min

Page 8: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Avail-bw variability Avail-bw has significant

variability Variability depends on

averaging timescale Larger timescale, lower

variance Variation range:

Range between, say, 10th to 90th percentiles

Example: Path-1: variation range

[10Mbps, 90Mbps] Path-2: variation range

[20Mbps, 20Mbps] Which path would you

prefer?

Page 9: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

The avail-bw as a random process

Instantaneous utilization ui(t): either 0 or 1 Link utilization in (t, t+)

Averaging timescale: Available bandwidth in (t, t+)

End-to-end available bandwidth in (t, t+)

t

t

ii dttuttu )(1

),(

iii CttuttA )],(1[),(

)},({min),(,...,1

ttAttA iHi

Page 10: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Problem statement

Avail-bw random process, measured in timescale A(t)

Assuming stationarity, marginal distribution of A:

F(R) = Prob [A ≤ R]

Ap :pth percentile of A, such that p = F(Ap)

Objective: Estimate variation range [AL, AH] for given averaging timescale ALand AH are pL and pH percentiles of A

Typically, pL =0.10 and pH =0.90

Page 11: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Probing methodology

Page 12: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Probing a network path

Sender transmits periodic packet stream of rate R K packets, packet size L, interarrival T = L/R

Receiver measures One-Way Delay (OWD) for each packet D(k) = tarv(k) - tsnd(k) OWD variations: Δ(k) = D(k+1) – D(k)

Independent of clock offset between sender/receiver

With stationary & fluid-modeled cross traffic: If R > A, then Δ(k) > 0 for all k Else, Δ(k) = 0 for all k

Page 13: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Self-loading periodic streams

Increasing OWDs means R>A Non-increasing OWDs means R<A

Page 14: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Example of OWD variations 12-hop path from U-Delaware to U-Oregon

K=100 packets, A=74Mbps, T=100μsec Rleft = 97Mbps, Rright=34Mbps

Page 15: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Percentile sampling&

estimation algorithms

Page 16: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Percentile sampling

Given R and estimate F(R) F(R) is also referred to as the rank of rate R

Assume that F(R) is inversible

Sender transmits a periodic packet stream of rate R Length of stream: measurement timescale

Receiver classifies the stream, based on measured one-way delay trends, as: Type-G if A ≤ R:

I(R)= 1 with probability F(R) Type-L if A > R:

I(R)= 0 with probability 1-F(R)

Page 17: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Percentile sampling (cont’) Send N packet streams, and classify each packet

stream as Type-G if A ≤ R:

I(R)= 1 with probability F(R) Type-L if A > R:

I(R)= 0 with probability 1-F(R) Number of type-G streams:

Unbiased estimator for the rank of rate R:

N

NRIRF

RFNRIENNRIE

RINRI

i

N

ii

),()(

)()(),(

)(),(1

Page 18: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

How many streams do we need?

Larger N longer estimation duration

Smaller N larger variance in estimator I(R,N)/N

Choose N so that: I(R,N)/N within F(R) ± maximum percentile

error

P[N(p-) < I(R,N) < N(p+)] > 1- where p= F(R) and small I(R,N) ~ Binomial (N, p)

assuming independent sampling

With N=40-50 streams, the maximum percentile error for 10th-90th variation range is about 0.05

Page 19: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Non-parametric estimation

It does not assume specific avail-bw distribution Iterative algorithm

Stationarity requirement across iterations N-th iteration: probing rate Rn

Use percentile sampling to estimate percentile rank of Rn

To estimate the upper percentile AH with pH = F(AH): fn = I(Rn,N)/N If fn is between pH±report AH = Rn

Otherwise, If fn > pH + , set Rn+1 < Rn

If fn < pH - , set Rn+1 > Rn

Similarly, estimate the lower percentile AL

Page 20: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Non-parametric algorithm

•Parameter b • Upper bound on rate variation in successive iterations• Tradeoff between accuracy and responsiveness• Larger b:

• Faster convergence• Larger oscillations

Page 21: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Validation example (non-parametric)

Testbed experiments using real Internet traffic traces

b=0.05 b=0.15

Non-parametric estimator tracks variation range within 10-20% Optimal selection of b depends on traffic

Traffic spikes/dips may not be detected if b is too small But larger b causes larger MSRE

Page 22: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Parametric estimation Assume Gaussian avail-bw distribution

Justified assumption for large degree of traffic multiplexing And/or for long averaging timescale (>200msec)

Gaussian distribution completely specified by Mean and standard deviation

pth percentile of Gaussian distribution Ap = + -1(p)

Sender transmits N probing streams of rates R1 and R2 Receiver determines percentiles ranks corresponding to R1 and R2 and can be then estimated by solving

R1 = + -1(p1) R2 = + -1(p2)

Variation range is then calculated from: AH = + -1(pH) AL = + -1(pL)

Page 23: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Parametric algorithm

• Variation range estimate

• Non-iterative algorithm• More appropriate under non-stationary conditions

• Probing rates do not need to follow variation range

• Less intrusive probing

Page 24: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Validation example (parametric)

Gaussian traffic non-Gaussian traffic

Parametric algorithm is more accurate than non-parametric algorithm, when

traffic is good match to Gaussian model in non-stationary conditions

Page 25: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Comparison of the two algorithms

Non-parametric algorithm Stationarity assumption is more critical (iterative algorithm) Can be used with any cross traffic distribution

Parametric algorithm Provides variation range estimate at end of each round Accurate when underlying traffic close to Gaussian

Non-parametric: = 40msec Parametric: = 250msec

Page 26: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Avail-bw variability factors

Page 27: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

A sample measurement from the Internet

Path from Georgia Tech to University of Ioannina, Greece Average avail-bw increases over this 2-hour period Variation range decreases as average avail-bw increases

Page 28: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Objectives and methodology Examine effect of following factors on avail-bw

variability:1. Load at tight link2. Degree of multiplexing at tight link3. Averaging time scale

Single-hop simulation topology with TCP traffic Monitore load at tight link

Examine variation range width V

V = AH - A

L

Compare V with Coefficient of Variation (CoV) CoV : standard deviation (at time scale

over average avail-bw V : Absolute variability metric CoV : Relative variability metric

Page 29: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Tight Link Utilization

Variation range width V shows non-monotonic behavior V increases in low/medium load, due to increasing variance in input traffic (tight link rarely saturated) V decreases in heavy load due to “clamping” by tight link capacity

CoV increases monotonically with load

Page 30: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Statistical Multiplexing

Conventional wisdom:

Keeping the load constant, higher degree of multiplexing makes the traffic smoother

Two models for increasing degree of multiplexing1. Capacity Scaling

1. Increase capacity of tight link and proportionally increase number of flows

2. Average flow rate remains constant2. Flow Scaling

1. Increase number of flows and proportionally decrease average flow rate

2. Capacity of tight link remains constant

Page 31: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Capacity Scaling

Variation range width V increases with capacity scaling CoV decreases with capacity scaling Conventional wisdom true for relative variability (CoV) but not for absolute variation range (V)

Page 32: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Flow Scaling

Variation range decreases in both absolute and relative terms

Page 33: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Measurement Timescale

Avail-bw variability decreases with averaging time scale

Rate of decrease depends on correlation structure of avail-bw process Observed decrease rate consistent with scaling process in the 50-500ms (Hurst parameter=0.7)

Page 34: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Summary and future work

Page 35: End-to-End Estimation of Available Bandwidth Variation Range Constantine Dovrolis Joint work with Manish Jain & Ravi Prasad College of Computing Georgia.

Future work

Applications of bandwidth estimation: Overlay routing and multihoming: path selection

algorithms, avoidance of oscillations, provisioning Interdomain performance problem diagnosis TCP throughput prediction (see ACM Sigcomm’05)

Internet traffic analysis: Use of bw-estimation to explain traffic burstiness in

short time scales (see ACM Sigmetrics’05) Examine validity of single-bottleneck assumption Examine congestion responsiveness of Internet traffic

New estimation problems: Detect maximum possible shared available bandwidth

among set of network paths