Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department...

33
Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ

Transcript of Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department...

Page 1: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Brownian Bridge and nonparametric rank tests

Olena Kravchuk School of Physical SciencesDepartment of MathematicsUQ

Page 2: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

2

Lecture outline

Definition and important characteristics of the Brownian bridge (BB)

Interesting measurable events on the BB

Asymptotic behaviour of rank statistics

Cramer-von Mises statistic

Small and large sample properties of rank statistics

Some applications of rank procedures

Useful references

Page 3: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

3

Definition of Brownian bridge

Page 4: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

4

Construction of the BB

Page 5: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

5

Varying the coefficients of the bridge

Page 6: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

6

Two useful properties

Page 7: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

7

Ranks and anti-ranks

)()(

'#)(

YdDYrR

ird

ysyyr

iiii

dr

ii

ii

First sample Second sample

Index 1 2 3 4 5 6

Data 5 7 0 3 1 4

Rank 5 6 1 3 2 4

Anti-rank 3 5 4 6 1 2

)()(

)()(

1

11

i

jj

i

N

iii

N

ii

DciT

dcarac

Page 8: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

8

Simple linear rank statistic

Any simple linear rank statistic is a linear combination of the scores, a’s, and the constants, c’s.

When the constants are standardised, the first moment is zero and the second moment is expressed in terms of the scores.

The limiting distribution is normal because of a CLT.

)1,0()var(

.1

1)var(,0)(

,1

,1,0

).()(

1

2

11

2

1

11

NS

Saa

NSSE

aaN

cc

DcaRacS

N

ii

N

ii

N

ii

N

ii

N

iii

N

iii

Page 9: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

9

Constrained random walk on pooled data

Combine all the observations from two samples into the pooled sample, N=m+n.

Permute the vector of the constants according to the anti-ranks of the observations and walk on the permuted constants, linearly interpolating the walk Z between the steps.

Pin down the walk by normalizing the constants. This random bridge Z converges in distribution to the Brownian

Bridge as the smaller sample increases.

.1,)1()(

,),(

.1,0)(;

.,1

,,1

1

2

1

tNitNTtNiTtNi

tNiTtiZ

ccDcT

miN

mn

n

miN

mn

mc

ii

i

i iii

i

jjii

Page 10: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

10

From real data to the random bridge

First sample Second sample

Index, i 1 2 3 4 5 6

Data, X 5 7 0 3 1 4

Constant, c 0.41 0.41 0.41 -0.41 -0.41 -0.41

Rank, R 5 6 1 3 2 4

Anti-rank, D 3 5 4 6 1 2

Bridge, Z 0.41 0 -0.41 -0.82 -0.41 0

41.06/1,3 cnm

Page 11: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

11

Symmetric distributions and the BB

2exp

2

1)(,

24

1)(),(

1)(

2yyf

ysechyfysechyf N

2LHSD

Page 12: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

12

Random walk model: no difference in distributions

Page 13: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

13

Location and scale alternatives

Page 14: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

14

Random walk: location and scale alternatives

Shift = 2 Scale = 2

Page 15: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

15

Simple linear rank statistic again

The simple linear rank statistic is expressed in terms of the random bridge.

Although the small sample properties are investigated in the usual manner, the large sample properties are governed by the properties of the Brownian Bridge.

It is easy to visualise a linear rank statistic in such a way that the shape of the bridge suggests a particular type of statistic.

1

0

111

)()(

).(

dttBtbS

RacN

bcZ

N

bS

N

iii

N

i

N

Rj

ji

N

ii

i

i

Page 16: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

16

Trigonometric scores rank statistics

The Cramer-von Mises statistic

)1,0(~.)(,1

.,)(

122

21

0

22

1

22

1

222

NXi

XdttBWT

NW

m

m

n

nnmmnW

i

iN

ii

iii

N

ii

The first and second Fourier coefficients:

dttBtSTN

i

NS

dttBtSTN

i

NS

N

ii

N

ii

)()2sin(22,2

sin2

2

2

)()sin(2,sin2

21

0

21

2

1

0

11

1

Page 17: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

17

Combined trigonometric scores rank statistics

The first and second coefficients are uncorrelated

Fast convergence to the asymptotic distribution

22

2121 SSS

2221 ~ S

The Lepage test is a common test of the combined alternative (SW is the Wilcoxon statistic and SA-B is the Ansari-Bradley, adopted Wilcoxon, statistic)

Page 18: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

18

Percentage points for the first component (one-sample)

Durbin and Knott – Components of Cramer-von Mises Statistics

Page 19: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

19

Percentage points for the first component (two-sample)

Kravchuk – Rank test of location optimal for HSD

Page 20: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

20

Some tests of location

Page 21: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

21

Trigonometric scores rank estimators

Location estimator of the HSD (Vaughan)

Scale estimator of the Cauchy distribution (Rublik)

Trigonometric scores rank estimator (Kravchuk)

Page 22: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

22

Optimal linear rank test

An optimal test of location may be found in the class of simple linear rank tests by an appropriate choice of the score function, a.

Assume that the score function is differentiable. An optimal test statistic may be constructed by selecting the

coefficients, b’s.

.),(])[1(lim

]1,0[,))((

))(('),( 1

1

du

dbuuNa

uuFf

uFffu

N

Page 23: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

23

Functionals on the bridge

When the score function is defined and differentiable, it is easy to derive the corresponding functional.

)4/1,0(~)5.0()5.0()(

)()(),,0(~)()(,)()(

1

0

1

0

1

0

21

0

222

0

NBdtttB

dttdttNdttBtbdxxbtt

Page 24: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

24

Result 4: trigonometric scores estimators

Efficient location estimator for the HSD

Efficient scale estimator for the Cauchy distribution

Easy to establish exact confidence level

Easy to encode into automatic procedures

Page 25: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

25

Numerical examples: test of location

Normal 2Normal 1

750

700

650

600

550

500

450

400

350

300

Boxplots of Normal 1 and Normal 2(means are indicated by solid circles)

t-test Wilcoxon S1

p-value 0.150 0.162 0.154

CI95% (-172.4,28.6) (-185.0,25.0) (-183.0,25.0)

1. Normal, N(500,1002)2. Normal, N(580,1002)

Page 26: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

26

Normals2Normals1

600

500

400

300

200

100

Nor

mal

s1

Numerical examples: test of scale

F-test Siegel-Tukey S2

p-value 0.123 0.064 0.054

1. Normal, N(300,2002)2. Normal, N(300,1002)

Page 27: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

27

Normalc2Normalc1

1000

900

800

700

600

500

400

300

200

Nor

mal

c1

Numerical examples: combined test

F-test t-test S12+S2

2 Lepage CM

p-value 0.021 0.174 0.018 0.035 0.010

1. Normal, N(580,2002)2. Normal, N(500,1002)

Page 28: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

28

When two colour histograms are compared, nonparametric tests are required as a priori knowledge about the colour probability distribution isgenerally not available.

The difficulty arises when statistical tests are applied to colour images: whether one should treat colour distributions as continuous, discrete or categorical.

Application: palette-based images

Page 29: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

29

Application: grey-scale images

Page 30: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

30

Application: grey-scale images, histograms

Page 31: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

31

Application: colour images

Page 32: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

32

Useful books

1. H. Cramer. Mathematical Methods of Statistics. Princeton University Press, Princeton, 19th edition, 1999.

2. G. Grimmett and D. Stirzaker. Probability and Random Processes. Oxford University Press, N.Y., 1982.

3. J. Hajek, Z. Sidak and P.K. Sen. Theory of Rank Tests. Academic Press, San Diego, California, 1999.

4. F. Knight. Essentials of Brownian Motion and Diffusion. AMS, Providence, R.I., 1981.

5. K. Knight. Mathematical Statistics. Chapman & Hall, Boca Raton, 2000.

6. J. Maritz. Distribution-free Statistical Methods. Monographs on Applied Probability and Statistics. Chapman & Hall, London, 1981.

Page 33: Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.

Olena Kravchuk Brownian bridge and nonparametric rank tests

33

Interesting papers

1. J. Durbin and M. Knott. Components of Cramer – von Mises statistics. Part 1. Journal of the Royal Statistical Society, Series B., 1972.

2. K.M. Hanson and D.R. Wolf. Estimators for the Cauchy distribution. In G.R. Heidbreder, editor, Maximum entropy and Bayesian methods, Kluwer Academic Publisher, Netherlands, 1996.

3. N. Henze and Ya.Yu. Nikitin. Two-sample tests based on the integrated empirical processes. Communications in Statistics – Theory and Methods, 2003.

4. A. Janseen. Testing nonparametric statistical functionals with application to rank tests. Journal of Statistical Planning and Inference, 1999.

5. F.Rublik. A quantile goodness-of-fit test for the Cauchy distribution, based on extreme order statistics. Applications of Mathematics, 2001.

6. D.C. Vaughan. The generalized secant hyperbolic distribution and its properties. Communications in Statistics – Theory and Methods, 2002.