Ph.D. Dissertation presented by Xiuduan Fang Department of Computer Science University of Virginia...
-
Upload
victoria-evans -
Category
Documents
-
view
215 -
download
1
Transcript of Ph.D. Dissertation presented by Xiuduan Fang Department of Computer Science University of Virginia...
Ph.D. Dissertation presented by
Xiuduan FangDepartment of Computer Science
University of VirginiaSeptember 19, 2008
On Using Circuit-switched Networks for File Transfers
Outline
• Overview Hypothesis Contributions & Publications
• Motivation• Theoretical component:
Design and evaluate algorithms to support file transfers on circuit-switched networks
• Experimental component: Implement and demonstrate architecture for
internetworking circuit-switched networks with the Internet
• Conclusions & Future work2
Hypothesis
Circuit-switched networks, with dynamic call-by-call bandwidth sharing and support for heterogeneous-rate circuits, can be used efficiently to support file transfers, and can be evolved gradually into the existing Internet.
3
Yes
end-to-end circuits?
No
Theoretical component
Dissertation organization
Internetworking architectureCall-admission control (CAC):
rate allocation
minimum file size
Experimental component
Key Contributions
• File transfers on a hybrid architecture Constructed analytical models Provided insights on how to design admission control Proposed a novel heterogeneous rate-allocation
scheme to lower file-transfer delay
• Internetworking architecture Designed and implemented a gateway to interconnect
circuit networks with the Internet Characterized the gateway performance
4
Publications
• Ph.D. dissertation: X. Fang and M. Veeraraghavan, On using circuit-switched
networks for file transfers,” accepted to IEEE Globecom, New Orleans, LA, Nov. 2008.
X. Fang, M. Veeraraghavan, M. E. McGinley, and R. W. Gisiger, “An overlay approach for enabling access to dynamically shared backbone GMPLS networks,” in Proc. of IEEE ICCCN2007, Honolulu, Hawaii, Aug. 2007.
X. Fang and M. Veeraraghavan, “On using a hybrid architecture for file transfers,” Submitted to IEEE Transactions on Parallel and Distributed Systems, 2008.
• MS thesis: M. Veeraraghavan, X. Fang, and X. Zheng, “On the suitability
of applications for GMPLS networks,” in Proc. of IEEE Globecom, San Francisco, CA, Nov. 2006.
X. Fang, X. Zheng, and M. Veeraraghavan, “Improving web performance through new networking technologies,” IEEE ICIW'06, Guadeloupe, French Caribbean, February 23-25, 2006.
5
Outline
• Overview Hypothesis Contributions & Publications
Motivation• Theoretical component:
Design and evaluate algorithms to support file transfers on circuit-switched networks
• Experimental component: Implement and demonstrate architecture for
internetworking circuit-switched networks with the Internet
• Conclusions & Future work6
Motivation
• Why File Transfers on Circuit Networks? Packet switching is considered better than circuit
switching for file transfers Pros: high throughput under light loads Cons:
– Unpredictable delays– Proportional fairness but no temporal fairness
eScience community is using high-speed circuit-switched networks for very large file transfers
Predictable service time (admission control) Temporal fairness: give deference to job seniority
7
Experimental component:Interconnect circuit networks
with the Internet
Call blocking circuit network
Homogeneous Heterogeneous
rate allocation
Designed a gateway
Implemented software
Characterized performance
Analytical model
Analytical model
Simulation model
Analytical model
Simulation model
Fairness issue
Submitted to TPDS
Accepted by Globecom2008
Published in ICCCN2007
Yes
end-to-end circuits?
No
Dissertation Organization
Call blocking for circuit network?
Yes
homogeneous rate allocation
Theoretical component:File transfers on
a hybrid architecture
No
Call queueing circuit network
Blocked calls rerouted to the Internet path
For large files, waiting for high-speed circuit s is a better option than being immediately rerouted to Internet path
Hybrid Architecture - Example
9
Yellow nodes: Ciena CD-CI SONET switches
Blue nodes: Juniper T640 IP routers
Courtesy: Rick Summerhill (2006)
Internet2's new Dynamic Circuit (DC) network
Experimental component:Interconnect circuit networks
with the Internet
Call blocking circuit network
Homogeneous Heterogeneous
rate allocation
Designed a gateway
Implemented software
Characterized performance
Analytical model
Analytical model
Simulation model
Analytical model
Simulation model
Fairness issue
Submitted to TPDS
Accepted by Globecom2008
Published in ICCCN2007
Yes
end-to-end circuits?
No
Dissertation Organization
Call blocking for circuit network?
Yes
homogeneous rate allocation
Theoretical component:File transfers on
a hybrid architecture
No
Call queueing circuit network
Blocked calls rerouted to the Internet path
For large files, waiting for high-speed circuits is a better option than being immediately rerouted to Internet path
Call-blocking Circuit Network• Goal: design efficient connection-admission control
(CAC) algorithms Metrics: file-transfer delay and utilization
• Block call if circuit is unavailable; reroute to Internet
• Our focus: What is an appropriate minimum file size?
Serve files sized x > minimum file size, Â, via the circuit network
What is an appropriate circuit rate, r, for a file transfer?
11
Analytical Model
Assumptions:• Single class
homogeneous rate allocation m circuits; per-circuit rate, r=C/m
• Call arrival process: Poisson with rate, ¸0 [Paxson95]
• Call holding times: Pareto distribution [Crovella97]
routing decisionrouting decision
x > ¸0
¸0 … Link L
capacity C
NN
YY
nn
11
Circuit network
Internet
[Paxson95] V. Paxson and S. Floyd, "Wide area traffic: the failure of Poisson modeling," Networking, IEEE/ACM Transactions on , vol.3, no.3, pp.226-244, Jun 1995[Crovella97] M. E. Crovella and A. Bestavros, Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes, IEEE/ACM Transactions on Networking, 5(6):835--846.
Key Insights Combine M/G/m/m loss model & TCP delay model
Erlang-B formula: input the number of channels, m, & traffic load; output: call blocking probability and utilization
TCP model: bottleneck link rate, round-trip time, packet loss rate [Padhye98]
Two criteria to select  Delay-based (user-perspective): compare delay estimates across
two paths Utilization (service provider-perspective): make circuit-setup
overhead a small fraction (e.g., 10%) of circuit file-transfer delay Define a metric to quantify mean delay reduction
R = s-1 (E[Ttcp(x)]-E[Tcircuit(x)])¢fX(x)dx
Compute mopt (ropt = C/ mopt) & Âopt that maximize R
1
[Padhye98] J. Padhye, V. Firoiu, D. Towsley, and J. Kurose, “Modeling TCP throughput: A simple modeland its empirical validation,” in Proceedings of the ACM SIGCOMM, Aug. 1998, pp. 303–314.
Key Results
• To maximize R, ropt should be much higher than effective throughput on the Internet path e.g., Internet path: bottleneck link rate = 100 Mb/s,
RTT = 50 ms, packet loss rate = 1%) effective throughput = 1.9 Mb/s Circuit path: link capacity = 10 Gb/s, call-setup delay = 1 sec ) ropt = 63 Mb/s & Âopt = 75 MB
If r = 2 Mb/s ) Â = 4.5 MB) Files of size (4.5 MB, 75MB) will get lower delay on circuits
But, mean delay will increase; hence directed to Internet
• Load sensitive: under low loads, Larger per-call circuit rate, ropt
Larger ropt ) Larger minimum file size, Âopt
Relax utilization criterion to decrease Âopt
• RTT sensitive: Larger ropt & Âopt for short-RTT path 14
Experimental component:Interconnect circuit networks
with the Internet
Call blocking circuit network
Homogeneous Heterogeneous
rate allocation
Designed a gateway
Implemented software
Characterized performance
Analytical model
Analytical model
Simulation model
Analytical model
Simulation model
Fairness issue
Submitted to TPDS
Accepted by Globecom2008
Published in ICCCN2007
Yes
end-to-end circuits?
No
Dissertation Organization
Call blocking for circuit networks?
Yes
homogeneous rate allocation
Theoretical component:File transfers on
a hybrid architecture
No
Call queueing circuit network
Blocked calls sent to the Internet path
For large files, waiting for high-speed circuits is a better option than being immediately rerouted to Internet path
Homogeneous Rate Allocation
16
• Key question: how much bandwidth should be allocated for each file transfer so that the system performance is optimized in terms of mean response time at a given effective utilization?
• Metrics: mean response time
• File size: bounded-Pareto distribution
• Call arrival: Poisson
M/G/m queueing model
• Goal: compute per-call circuit rate, ropt (i.e., C /mopt)
• Input: A set of m = {1, 10, 100, 1000} Link capacity C = 10 Gb/s
) r = {10Gb/s, 1Gb/s, 100Mb/s, 10Mb/s} Call setup delay = 1 sec Bounded-Pareto parameters
) the first two moments of service time Traffic load 2 (0, 1)
• Output: Effective utilization: call-setup delay overhead Mean waiting time
17
19
• Heterogeneous scheme: divide calls into classes based on file size & allocate each class a different-rate circuit
A complete-partitioning system
Heterogeneous Rate Allocation
Analytical model• Multiple separate M/G/m subsystems
Basis for classifying calls: cutoff points, Â1,…, Ân-1?
Bandwidth allocation per subsystem, C1, …, Cn?
Ideal per-call circuit rate for each class, r1, …, rn?
• To compute optimal operating point that minimizes mean response time: Mathematica optimization package e.g., for a 2-class system
Start with an initial value for Â1
Determine C1, C2 & r1, r2
Vary Â1 to study its impact
• Fairness: Fairness ratio: ratio of mean slowdown of 2 classes Slowdown: ratio of waiting time to service requirement
20
21
File-size distribution parameters: Smallest file size: l = 1 MB Largest file size: u = 1 TB Cutoff point: Â = 1000 MB
Homogeneous system is virtually divided into 2 subsystems by Â
Fairness Ratio (small-file to large-file)
• A complete-partitioning heterogeneous scheme treats small files more fairly when compared with a complete-sharing homogeneous scheme
22
Heterogeneous system
Homogeneous system (at all utilization levels)
Simulation Study
• Single-link: simulation results are consistent with analytical results
• Multi-link: fairness study Short-path vs. long-path calls
Work-conserving scheme: unfair to long-path callsProposed conditional-priority scheme: give priority
to long-path calls based on queue occupancy
Small-file vs. large-file callsComplete-partitioning heterogeneous scheme
23
Key Results
• Complete-partitioning heterogeneous rate allocation Large files allocated high-rate circuits Lowers mean response time Treats small files more fairly when compared with
complete-sharing Requires a network management system to monitor
traffic load & dynamically update partitions
• Conditional priority scheme improves the fair treatment between long-path and short-path calls
24
Experimental component:Interconnect circuit networks
with the Internet
Call blocking circuit network
Homogeneous Heterogeneous
rate allocation
Designed a gateway
Implemented software
Characterized performance
Analytical model
Analytical model
Simulation model
Analytical model
Simulation model
Fairness issue
Submitted to TPDS
Accepted by Globecom2008
Published in ICCCN2007
Yes
end-to-end circuits?
No
Dissertation Organization
Call blocking for circuit networks?
Yes
homogeneous rate allocation
Theoretical component:File transfers on
a hybrid architecture
No
Call queueing circuit network
Blocked calls rerouted to the Internet path
For large files, waiting for high-speed circuit s is a better option than being immediately rerouted to Internet path
Experimental Component• Motivation:
It is expensive to deploy a new networking technology on an end-to-end basis
As link speeds increase, high-capacity circuit switches are cheaper than packet switches
Circuit-switched (CS) networks operated in shared mode ) admission control (AC) phase
Connectionless (CL) networks have no admission control phase
So internetworking CL + shared CS is a challenge
• Our solution: gateway that implements all sub-layers of the network layer with data-plane and control-plane (AC)
• Metrics: reliable file transfer, circuit utilization, forwarding rate26
Related Work
• State-of-the-art: IP routers Original purpose: interconnect connectionless networks [Cerf74,
RFC791, Clark88] Connection-oriented networks when used in the Internet are
used only in leased-line mode
• Proposed but not deployed: IP-over-ATM internetworking: Ipsilon's IP switching
Routers have to "guess" which flows are long-lived TCP switching: IP switching with protocol classifier
27[Cerf74] V. G. Cerf and R. E. Kahn, “A protocol for packet network intercommunication,” IEEE Transactions on Communications, vol. 22, no. 5, pp. 637–648, May 1974.[Clark88] D. D. Clark, “The design philosophy of the DARPA Internet protocols,” in SIGCOMM. Stanford, CA: ACM, Aug. 1988, pp. 106–114.
Internetworking Architecture
Circuitsubnetwork
IP-router core subnetwork
R
RRR
CAG CAG
regional network
R
regional network
R
R
R
CAG
Web client Web server
R IP router circuit/VC switchEthernet switch
enterprisenetwork
S
enterprise network
S
S
S R
R
S
CAG Circuit-aware application gateway
RR
Connectionless Connectionless
28
Internetworking Architecture
29
Circuit/subnetwork
IP-router core subnetwork
R
RRR
CAG CAG
regional network
R
regional network
R
R
R
CAG
Web client Web server
R IP router circuit/VC switchEthernet switch
enterprisenetwork
S
enterprise network
S
S
S R
R
S
CAG Circuit-aware application gateway
HTTP/TCP/connectionless IP HTTP/CTCP/circuit
RR
Gateway Design
• Start with an open-source Web proxy software package called Squid
• Data-plane: Base functionality provided by Squid Integrated Circut-TCP (removes Slow Start, receive-side
autotuning)
• Control-plane: Integrated RSVP-TE signaling client module into Squid to initiate circuit setup/release
30
Gateway Design contd.
• Unpredictable rate across connectionless (CL) segments
• But fixed-rate across circuit-switched (CS) segments
• What if these are mismatched?
• Need buffering within gateways
• Buffers are finite: so possibility of losses?
• Squid implementation: back-pressure mechanism; Data not read from incoming TCP buffer if Squid buffer
(controlled by read_ahead_gap) is full Latter is full if outgoing TCP buffer is full
• Leads to circuit utilization problems
• Answer: main memory or disk buffering in gateways + multiplexing on circuits
31
Experimental Hypothesis
A modified version of Squid software can be used as a gateway to interconnect circuit-switched networks and connectionless packet-switched networks for reliable file transfers, and can support an effective throughput of 460 Mb/s when executed on a Linux 2.6.20 host with a 2.8GHz Xeon processor and 1 GB memory.
32
Experimental setup to test if there is buffer overflow
• NIC speeds: CHEETAH NIC (NIC2) = GbE, Internet NIC (NIC1) ¸ 100 Mb/s
• Circuit (zelda1 $ zelda4) rate=155Mb/s, link (zelda4 $ zelda5) rate=1Gb/s
• Control link rate on zelda1 ! zelda2 path to mismatch sending and receiving rates
• The parameter read_ahead_gap controls CAG’s application buffer for each flow, read_ahead_gap = 16 KB (default value)
33
34
CAG zelda1’s forwarding rate CAG zelda1’s CPU and memory usage
CAG zelda1’s receive window size for zelda1 $ zelda4 CTCP connection
• Key results: No packet loss in buffers within
CAGs due to a back-pressure mechanism
Drawback: low circuit utilizatione.g., only 1/155 < 1% for 1 Mb/s bottleneck link rate
Improving Circuit Utilization
• Configured read_ahead_gap: e.g., when read_ahead_gap (for CAG zelda1) = 1 GB, circuit
utilization = 90% for a 1-GB file transfer Problem: unscalable because Squid only uses main memory for
buffering in-transit data
• Disk buffering: used two instances of Squid on a CAG
35
Other Experiments & Analysis
• Measured maximum forwarding rate Stress test by using long flows: 460 ± 4.75 Mb/s
• Measured user-perceived throughput Throughput improvement when circuits replace
congested Internet paths.
• Related the internetworking architecture with the TCP/IP & OSI reference models Fits into the OSI model
36
Conclusions• File transfers on circuit-switched (CS) networks
Advantage relative to packet switching: predictable service time
• Packet switching (PS) better for small file transfers Call setup delay >> Transfer time (link rates ↑, transfer time ↓) Predictability not a concern when absolute delays are low
• Hence hybrid architecture: PS for small; CS for large
Call admission control algorithms designed to be fair across small-file, large-file & across short-path, long-path
37
Considered in routing decision
Not considered
Call blocking √ X
Call queueing X √
Internet path metricsCircuit
network operation
Conclusions contd.
• Designed a gateway called CAG to interconnect connectionless networks with circuit networks CAG implements all sub-layers of the network layer with
data-plane and control-plane (admission control) CAG supports reliable file transfers File transfers need high-speed links on whole path Gradually evolving circuit-switched networks for access
(current bottleneck) will lead to improved performance
38
Future Work
• More sophisticated bandwidth-sharing schemes Currently studied a complete partitioning scheme To avoid sensitivity to network management system
performance as is the case with partitioning
• Hardware-based implementation of CAG with the support of disk buffering for in-transit data Current software implementation could slow down
effective transmission rates
39
Questions from Form G111 -
Defining the problem
• Has the student stated the problem clearly, provided its motivation, and the requirements for a solution?
• In the context of new optical circuit-switched technologies and new application requirements, how do we support file transfers efficiently on a dynamically shared circuit-switched network and how can we interconnect a circuit network with a connectionless network?
42
Questions from Form G111 -
Analysis of previous and related work• Theoretical component: file transfers on circuit networks
Packet switching is considered better But circuit switching provides rate guarantees Very large file transfers on optical connection-oriented testbeds
e.g.: ESnet4, NSF DRAGON, CA*net4, UKLight, JGN2, etc. Focus: implementation & inter-domain usage Our work: how much bandwidth to allocate per file transfer
File transfers have not been considered on other circuit/virtual-circuit networks
e.g.: telephone networks, ATM
• Experimental component: interconnect circuit networks with connectionless networks State-of-the-art: IP routers
Original purpose: interconnect connectionless networks Used leased line modes to include circuit networks
Proposed but not deployed: IP switching & TCP switching Our work: gateway that handles service-type mismatch between
connectionless and circuit networks
43
Questions from Form G111 -
Success criteria
• Has the student adequately defined the measure(s) of success to be used to evaluate the work? Is there a well defined metric with a goal? Does the metric adequately represent the desired success criteria?
• Success criteria Theoretical work: use a hybrid architecture for file transfers
Call blocking circuit network: optimal design parameters to maximize mean delay reduction
Call queueing circuit network: optimal design parameters to minimize mean response time at a given effective utilization
Experimental work: designed an internetworking gateway called CAG CAG supports reliable file transfers Improved circuit utilization Measured maximum forwarding rate of CAG
• Metrics Theoretical work: file-transfer delay, utilization, mean delay reduction,
fairness ratio Experimental work: reliable file transfer, circuit utilization, forwarding rate,
user-perceived throughput44
Questions from Form G111 -
Solution
• Is the approach taken well executed? Does it appear to be correct? Is the work technically challenging? Does the student utilize appropriate professional standards?
• A combination of analytical, simulation, and experimental methods Call blocking circuit network for file transfers
Analytical model Call queueing circuit network for file transfers
Analytical model Simulation model
An internetworking gateway Software implementation Experimentation and measurements Architecture positioning
45
Questions from Form G111 -
Innovation and risk
• To what extent is the work innovative? Has the student taken a risk in applying the chosen approach?
• Bandwidth sharing problem on using circuit networks for file transfers has not been studied before
• The problem of internetworking connectionless networks and dynamically shared circuit networks has not been addressed widely (only one previous solution – from the 90s which proved unviable)
46
Questions from Form G111 -
Broader implications
• Has the student considered the broader implications of the work? Broader implication may include social, economic, political, technical, ethical, business, etc.
• Enable the deployment of high-speed circuit networks at low costs (sharing) to provide predictable-delay services New applications can be created with this type of service
• Integrated with Internet Avoids need for desert-start deployment
47
Background – High-Speed Circuit-Switching
• Data-plane technologies Switching: Time Division Multiplexing (TDM) &
Wavelength Division Multiplexing (WDM) Mapping: to carry Ethernet frames via SONET signals or
WDM lightpaths
• Control plane: Generalized MultiProtocol Label Switched (GMPLS) Three components: signaling, routing, & management Bandwidth sharing mode: immediate-request (IR)
• Equipment examples: SONET switches: Sycamore SN16000 WDM switches: Adva/Movaz RayExpress OADM
48
Layers in OSI reference modelAL: Application LayerTL: Transport Layer
Sublayers of network layer (NL)• SNICF: Subnetwork Independent Convergence Function• SNDCF: Subnetwork Dependent Convergence Function• SNACF: Subnetwork Access Function• SNSF: Subnetwork Switching Function
DLL or L: Data-Link LayerPHY or P: Physical Layer
[ITU X.200] http://www.itu.int/rec/T-REC-X.200-199407-I/en[Callon83] R. E. Callon, "Internetwork protocol,“ Proc of the IEEE, vol. 71, no. 12, pp. 1388-1393, Dec. 1983
Layers in the Internetworking Architecture
50
This internetworking architecture fits into OSI reference model
• File-size distribution: bounded-pareto, BP(®, l, u),
®: shape, l: minimum file size, u: largest file size
• Service time:
• Per-server traffic intensity:
• Effective utilization:
• Mean waiting time:
• Mean response time:
Analytical Model: Homogeneous Rate Allocation
• Each subsystem:
File-size distribution: bounded-pareto, BP(®, Âi-1, Âi),
®: shape, Âi-1 : minimum file size, Âi : largest file size
E[Y j ] = pi ¢ E[Yij]
Call arrival rate:
Capacity:
• Whole system:
Effective utilization:
Mean response time:
Analytical Model: Heterogeneous Rate Allocation
Model Validation & Verification
• Model validation Our models are for an initial design to support file transfers on circuit networks
No real-world measurements Model validation technique – peer/expert reviews
Real system measurements “available” for input parameters
E.g., real-system measurements for file transfer – Poisson call arrival process– Pareto distribution
• Model verification Compare analytical model results with simulation model results
54
[Jain91] R. Jain, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling, New York, Wiley-Interscience, 1991.
“Three validation techniques Expert intuition Real system measurements Theoretical results” [Jain91]
“Three aspects of model validation Assumptions Input parameter values and distributions Output values and conclusions” [Jain91]
Related Work – File Transfers on Other Testbeds
• Other testbeds: Large file transfers high per-circuit rate, long holding time Coarse-Grained Sharing (CGS)
• Our interest: Fine-Grained Sharing (FGS) for all files
55
Leased Line CGS FGS TCP/IP
Different bandwidth sharing modes
coarse fine
Circuit-switched networks: Signaling for call setup
Connection setup actions at each switch on the path:1. Parse message to extract parameter values2. Lookup routing table for next hop to reach destination3. Read and update CAC (Connection Admission
Control) table4. Select timeslots on output port5. Configure switch fabric: write entry into timeslot
mapping table6. Construct setup message to send to next hop
56
Host I-A Host
III-B
I
IV V
III
II
ab
c
a
b
c
d
d c
a
b
Connection setup (Dest: III-B; BW: OC1; Timeslot: a, 1)
Dest. Next hop
III-* IV
Routingtable
Connection-Admission Control (CAC)
57The procedure of CAC
Receive a file-transfer request with size x
x > ÂY
reject
N
allocate a circuit with rate rx
rejectaccept
Simple sum:rx < available bandwidth
NYSwitch model
Â: crossover file size rx: allocated bandwidth
CAC
12
N-1
N
¸ Simple sum
 & rx
Network Management System
configures Â
Design and Implementation
• How does a CAG select the “best” parent? Static configuration based on geographic location of
Web servers using ARIN WHOIS database
• How does a user configure the Web client to use the Web proxy server? Configuring for every request is not user-friendly Instead, use Proxy Auto-Configuration (PAC)
58
File Transfers on Other Circuit/VC Networks
• Has ATM implemented file transfers with a guaranteed service? No.
59
Service classes on the ATM layer Hard QoS for multimedia applications
CBR: voice & VBR: audio No QoS at the ATM level for all other data traffic
ABR & UBR File transfers are served by UBR
No guaranteed bandwidth allocation Loss recovered with ARQ in TCP No effort to maintain flow rate to match VC rate
Analytical Model (cont.)
6060
M/G/m/m loss model Erlang-B: compute call blocking probability, Pb, and utilization, Ub,
given the number of channels, m, and traffic load, ½
The computation of ½:
¸: aggregate call arrival rate offered to the switch 1/¹: mean call holding time
…1
NLink L, capacity CRD
0
RD: routing decisionRD: routing decision
A switch model for file transfers
The Derivation of Offered Traffic Load, ½
6161
Serve files with size x > Â
Original call arrival rate, N¢¸0
Offered traffic load
File size: Pareto distribution
®: shape, k: scale
Mean file sizeCircuit file-transfer delay
Mean call holding time Aggregate offered call arrival rate
Tprop: propagation delay
C/m: per-circuit rate
The Selection of Crossover File Size, Â
6262
Utilization-based, Âu: the mean call-setup delay, E[Tsetup], is
a small fraction of circuit file-transfer delay, Ttransfer(x)
Choose
Delay-based, Âd: compare two delay values
where
e.g., ¯=10
Pb: call blocking probability
Numerical Results - Input Parameters
6363
File size distribution: shape, ®=1.0009, scale, k=1000bytes
Circuit: Link capacity C=10Gbps Original call arrival rate, N¢¸0=1100calls/second
Mean call setup delay, E[Tsetup]=1second Round trip time, RTT=50ms Utilization factor, ¯=10
Internet path Bottleneck link rate, r=100Mbps Packet loss rate, Ploss=1% Round trip time, RTT=50ms
6464
Numerical Results: Impact of Per-circuit Rate on Â
Low per-circuit rateHigh per-circuit rate
Link capacity expressed in channels
Del
ayed
-bas
ed c
ross
over
file
siz
e
Util
izat
ion-
base
d cr
osso
ver
file
size
6565
For m=10, 100, and 1000 (i.e, per-circuit rate is 1Gb/s, 100Mb/s, and 10Mb/s)Âu is the limiting factor Simplifies the computation of Â
Del
ayed
-bas
ed c
ross
over
file
siz
e
Util
izat
ion-
base
d cr
osso
ver
file
size
Numerical Results: Impact of Per-circuit Rate on Â
m=1000m=1000
m=100m=100
6666
Numerical Results: Impact of Per-circuit Rate on File-transfer Delay
File
tra
nsfe
r de
lay
over
circ
uits
or
the
Inte
rnet
m=10, 100, and 1000 $per-circuit rate is 1Gb/s 100Mb/s, & 10Mb/s
(1.2GB)(11.9MB) (118.9MB)
68
Design and Implementation
• Gateway: Started with an open-
source Web proxy software package called Squid
Integrated RSVP-TE signaling client module into Squid to initiate circuit set up/release
Integrated Circut-TCP (removes Slow Start, receive-side autotuning)
Added monitoring module to watch circuit usage. Initiate circuit release if idle for time >T
Receive a Web request
Cache miss
Circuit toparent
Fork a process to attempt a circuit setup;
Meanwhile, serve the request via the IP-
router core subnetwork
Y
Serve the request
N
N
Serve the request via the circuit
Y
Experiments to Measure User-perceived Response Time
• Two sets Controllable experiments by loading specific-
size files on a Web server Operational Web sites
• For each set, two tests Direct without proxy CHEETAH proxy: via CAGs but without
caching
69
Experimental Results – 1st SetInternet
CHEETAHSN16000SN16000
NIC1
NIC2wuneng
NIC1
NIC2
zelda1
Web server: zelda2
CAGCAG
Raleigh, NC
Web client: unc-planetlab1
Chapel Hill, NC
Atlanta, GAcircuit
707046.40± 1.70
15.08± 0.55
1 MB 100 MB10 MB100 KB
58.59± 2.3064.64± 3.2113.61± 0.76CHEETAH proxy
16.71± 0.6617.25± 0.5210.70± 0.65Direct
File sizeTest(Mb/s)
direct CHEETAH proxy
unc-planetlab1 zelda2
unc-planetlab1 wuneng
wuneng zelda1 (circuit)
zelda1zelda2
10.77 ms 1.27 ms 8.87 ms 0.23 ms
Experimental Results – 2nd Set
Web server parameters RTT (ms) via the Internet
Bottleneck local RTT
(ms)
file size (MB)
Latencies (s)
direct CHEETAH Proxy
namelocation
kernel.org Carrollton, TX
86 61.6 48 70 33
sourceforge.net Atlanta, GA 32 14.6 113 520 140
Web client
CHEETAH wuneng zelda1
Web serverBallstein.cs.virginia.edu
7171
Background –High-Speed Circuit-Switched Networks
CHEETAH: Circuit-switched High-speed End-to-End Transport ArcHitecture
SN16000: circuit switch 7272
US: DOE’s UltraScience net, CHEETAH, Internet2 Dynamic Circuit network Europe: UKLight (UK), SURFnet (Netherland), VIOLA (Germany), MUPBED Canada: CA*net 4 Japan: JGN
Atlanta, GA
zelda1
zelda2
zelda3
Raleigh, NC
OC192card
ControlCard
GbE/10GbE
card
SN16000
H wukong
Oak Ridge, TN
To Cray X1
zelda4
zelda5
HH
OC192card
ControlCard
GbE/10GbE
card
SN16000
H
HH
OC192card
ControlCard
GbE/10GbE
card
SN16000OC-192 OC-192