Fail-Aware Publish/Subscribe

51
Motivation Overview The Upper Bound Evaluation Summary 1 of 27 slides Fail-Aware Publish/Subscribe NCA 2007 Zbigniew Jerzak, Christof Fetzer, Robert Fach Dresden University of Technology October 16, 2008 {Zbigniew.Jerzak, Christof.Fetzer, Robert.Fach}@inf.tu-dresden.de Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

description

In this paper we present a wide area distributed system using a content-based publish/subscribe communication middleware which can deterministically detect and report failures with respect to timely message delivery and message omission. Our approach does not require external clock synchronization nor does it impose any constraints on the publish/subscribe middleware. We show that our system performs better and is safer than when using NTP for external clock synchronization. We provide a proof of concept implementation and present results of experiments carried out in the PlanetLab environment.

Transcript of Fail-Aware Publish/Subscribe

Page 1: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 1 of 27 slides

Fail-Aware Publish/SubscribeNCA 2007

Zbigniew Jerzak, Christof Fetzer, Robert Fach

Dresden University of Technology

October 16, 2008

{Zbigniew.Jerzak, Christof.Fetzer, Robert.Fach}@inf.tu-dresden.de

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 2: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 2 of 27 slides

State-of-the-art

I Problems:I we have conversationI we need dissemination

I Dissemination:I you are interested in data (not source)

I Hence loose-coupling:I anonymousI asynchronous

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 3: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 2 of 27 slides

State-of-the-art

I Problems:I we have conversationI we need dissemination

I Dissemination:I you are interested in data (not source)

I Hence loose-coupling:I anonymousI asynchronous

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 4: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 2 of 27 slides

State-of-the-art

I Problems:I we have conversationI we need dissemination

I Dissemination:I you are interested in data (not source)

I Hence loose-coupling:I anonymousI asynchronous

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 5: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 3 of 27 slides

Content-Based Systems

I Peer-to-Peer, Publish/Subscribe, CBN...

I Content-/Name-drivenI Loosely-coupled and distributedI Failures are imminent [MPHD06]

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 6: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 3 of 27 slides

Content-Based Systems

I Peer-to-Peer, Publish/Subscribe, CBN...

I Content-/Name-drivenI Loosely-coupled and distributedI Failures are imminent [MPHD06]

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 7: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 4 of 27 slides

Fail-Awareness

...allows an application to [...] detect when it cannot guarantee allits safety properties and in particular, to detect when to switch toa fail-safe mode [FC03]

I Did we receive all messages?I Was the received message timely?

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 8: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 4 of 27 slides

Fail-Awareness

...allows an application to [...] detect when it cannot guarantee allits safety properties and in particular, to detect when to switch toa fail-safe mode [FC03]

I Did we receive all messages?I Was the received message timely?

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 9: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 5 of 27 slides

Publish/Subscribe Systems [CRW01]

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 10: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 6 of 27 slides

Time in Publish/Subscribe

I no clock synchronisationI no violation of loose-couplingI many-to-many communication

How long did it take for a message to be delivered from p to q?

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 11: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 6 of 27 slides

Time in Publish/Subscribe

I no clock synchronisationI no violation of loose-couplingI many-to-many communication

How long did it take for a message to be delivered from p to q?

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 12: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 7 of 27 slides

Time in Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 13: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 8 of 27 slides

Idea

I do not calculate the transmission timeI calculate upper bound on transmission time

Message transmission time is no longer than the upper bound

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 14: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 8 of 27 slides

Idea

I do not calculate the transmission timeI calculate upper bound on transmission time

Message transmission time is no longer than the upper bound

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 15: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 16: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 17: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 18: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 19: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 20: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 21: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 9 of 27 slides

Upper Bound – Point to Point

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 22: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 10 of 27 slides

Upper Bound – Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 23: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 10 of 27 slides

Upper Bound – Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 24: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 10 of 27 slides

Upper Bound – Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 25: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 10 of 27 slides

Upper Bound – Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 26: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 10 of 27 slides

Upper Bound – Publish/Subscribe

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 27: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 28: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 29: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 30: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 31: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 32: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 33: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 34: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 11 of 27 slides

Upper Bound – Optimizations

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 35: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 12 of 27 slides

Upper Bound vs NTP

NTP

I no guarantees whatsoeverI 2*max error gives estimated upper bound

Upper Bound

I preciseI more accurate

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 36: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 13 of 27 slides

Experiment Setup

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 37: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 14 of 27 slides

TSC as Time Source

0

10

20

30

40

50

60

01:04 01:08 01:12 01:16 01:20 02:00 02:04 02:08 02:12 02:16

drif

t rat

e[pp

m]

days:hours

planetlab-1a.ics.uci.eduplanetlab01.mpi-sws.mpg.de

planetlab-2.cse.ohio-state.eduplanetlab2.exp-math.uni-essen.de

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 38: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 15 of 27 slides

TSC as Time Source – Single Host

1.5

2

2.5

3

3.5

4

4.5

5

5.5

6

01:00 01:12 02:00 02:12 03:00 03:12 04:00 04:12 05:00 05:12

drif

t rat

e[pp

m]

days:hours

planetlab-2.cse.ohio-state.eduerror

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 39: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 16 of 27 slides

Publication Transmission Time (national)

2.0e+04

4.0e+04

6.0e+04

8.0e+04

1.0e+05

1.2e+05

1.4e+05

1.6e+05

1.8e+05

2.0e+05

0 500 1000 1500 2000 2500 3000noyes

time

[us]

omis

sion

err

or

received publication #

F-A P/S upper boundrealtime

erroromission error

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 40: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 17 of 27 slides

Publication Transmission Time (global)

105

106

107

108

0 500 1000 1500 2000noyes

time

[us]

omis

sion

err

or

received publication #

F-A P/S upper boundrealtime

erroromission error

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 41: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 18 of 27 slides

Publication Transmission Time – Bootup (national)

2.0e+04

4.0e+04

6.0e+04

8.0e+04

1.0e+05

1.2e+05

1.4e+05

1.6e+05

1.8e+05

2.0e+05

10 20 30 40 50 60 70 80 90 100noyes

time

[us]

omis

sion

err

or

received publication #

F-A P/S upper boundrealtime

erroromission error

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 42: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 19 of 27 slides

Publication Transmission Time – Error (national)

104

105

106

107

260 280 300 320 340 360 380 400noyes

time

[us]

omis

sion

err

or

received publication #

F-A P/S upper boundrealtime

erroromission error

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 43: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 20 of 27 slides

Processing Time vs Transmission Time (national)

103

104

105

106

107

0 50 100 150 200 250 300

time

[us]

message

network transmission delayprocessing time

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 44: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 21 of 27 slides

Upper Bound vs NTP (national)

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 45: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 22 of 27 slides

Upper Bound vs NTP (global)

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 46: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 23 of 27 slides

Upper Bound vs Real Time (national)

0

20

40

60

80

100

120

30000 35000 40000 45000 50000 55000 60000 65000 70000 75000 80000

num

ber

of m

essa

ges

error [us]

upper bound - realtime

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 47: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 24 of 27 slides

Upper Bound vs Real Time (global)

0

5

10

15

20

25

30

35

40

9.1e+05 9.3e+05 9.5e+05 9.7e+05 9.9e+05

num

ber

of m

essa

ges

error [us]

upper bound - realtime

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 48: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 25 of 27 slides

Summary & Outlook

I Fail-Awareness in Publish/SubscribeI Better or comparable with NTPI No constraints on the system

Outlook

I Take a closer look at omissions

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 49: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 25 of 27 slides

Summary & Outlook

I Fail-Awareness in Publish/SubscribeI Better or comparable with NTPI No constraints on the system

Outlook

I Take a closer look at omissions

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 50: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 26 of 27 slides

Thank You!http://wwwse.inf.tu-dresden.de/zib/

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach

Page 51: Fail-Aware Publish/Subscribe

Motivation Overview The Upper Bound Evaluation Summary 27 of 27 slides

References

Antonio Carzaniga, David S. Rosenblum, and Alexander L. Wolf.Design and evaluation of a wide-area event notification service.ACM Trans. Comput. Syst., 19(3):332–383, 2001.

Christof Fetzer and Flaviu Cristian.Fail-awareness: An approach to construct fail-safe applications.Journal of Real-Time Systems, pages 203–238, March 2003.

Alan Mislove, Ansley Post, Andreas Haeberlen, and Peter Druschely.Experiences in building and operating ePOST, a reliable peer-to-peer pplication.In EuroSys, 2006.

Fail-Aware Publish/Subscribe Zbigniew Jerzak, Christof Fetzer, Robert Fach