Sufficient statistics. The Poisson and the exponential can be summarized by (n, ).

Sufficient statistics.

The Poisson and the exponential can be summarized by (n, ).

So too can the normal with known variance

Consider a statistic S(Y)

Suppose that the conditional distribution of Y given S does not depend on , then S is a sufficient statistic for based on Y

Occurs iff the density of Y factors into a function of s(y) and and a function of y that doesn't depend on

y

More Chapter 4

Example. Exponential

IExp() ~ Y

E(Y) = Var(Y) = 2

Data y1,...,yn

L() = -1 exp(-yj /)

l() = -nlog() - yj /

yj /n is sufficient

2222

2

322

2

2

2)ˆ(

2)(

m.l.e. ˆ

1

0)ˆ( .

yn

yn

ynl

nnl

y

ynl

UequationLikelihood

maximum

0})(

2{)}({

)}({)(

.

2)(

.

2000

0

2

32

ynn

EUE

n

JEI

nInformatioFisher

ynn

J

nInformatioObserved

=

))(

,(~ˆ

distin )()(

)()(

probin 1)2

()()()(

)/()}({

)/()()}({

200

0

20

002/10

3

0

2

0

1

2

0

010

200

0

2000

0

nN

Zyn

nUI

ynnnn

JI

nJE

nIUVar

Approximate 100(1-2 )% CI for 0

2

2/1

/)ˆ()ˆ(

)ˆI(insert Could

)ˆ(ˆ

ynJIHere

Jz

Example. spring data

8.34)(168.26,16 )0188(.96.130.168

000353.3.168/10/)ˆ()ˆ(

3.168ˆ

22

ynJI

cyclesky

Weibull.

)/log()/(/log/

)/(/),(

log)1(loglog),(

)(exp),;()(

1 if lExponentia

0,, ,)(exp),;(

1

1

1

jjj

j

jj

j

jn

n

yyyn

ynU

yynnl

yyyfL

yyy

yf

Note.

),ˆ(),(max

problemmax D-1

)(ˆ /11

lll

yn

profile

j

Expected information

large Want

/)(log)(

/)(2)(2)'(1 (2)/-(2)/- )/(

),(22

2

I

dzzdz

nI

Gamma.

sufficient ),log(

log)1()(loglog)(

)exp()(

)(

0,, ),exp()(

),;(

1

1

jj

jj

jjn

n

yyS

yynnl

yyL

yyy

yf

Example. Bernoulli

Pr{Y = 1} = 1 - Pr{Y = 0} = 0 1

L() = ^yi (1 - )^(1-yi)

= r(1 - )n-r

l() = rlog() + (n-r)log(1-)

r = yj

R = Yj is sufficient for , as is R/n

L() factors into a function of r and a constant

Score vector

[ yj / - (n-yj )/(1-)]

Observed information

[yj /2 + (n-yj )/(1-)2 ]

ny j /ˆ

M.l.e.

Cauchy.

ICau()

f(y;) = 1/(1+(y-)2 )

E|Y| = Var(Y) =

L() = 1/((1+(yj -)2 )

Many local maxima

l() = -log(1+(yj -)2 )

J() = 2((1-(yj -)2 )/(1+(yj -)2 )2 I() = n/2

sufficient is ,....y

N(0,1) closer to is

)ˆ()ˆ( Z)ˆ()ˆ(

)((1)

0

2/1

J0

2/1

n

J

I

y

Z

JIZ

Uniform.

f(u;) = 1/ 0 < u <

= 0 otherwise

L() = 1/n 0 < y1 ,..., yn <

= 0 otherwise

0ˆ//)ˆ(

0/)ˆ(

,...y y

)max(ˆ

222

1(n)

nddl

ddl

y

y

n

j

l() becomes increasingly spikey

E u() = -1 i() = -

ondistributiin lExponentia)ˆ(

1

0 )/(

0 }ˆPr{

n

a

aa

aan

Logistic regression. Challenger data Ibinomials Rj , mj , j

)21()ˆ)(ˆ,ˆ()ˆ(

region Confidence

),(

statistic Sufficient

))exp(1(

)exp(

)!(!

!

),;Pr(),(

})exp{1/(}exp{

2001000

1

110

110

1010

110110

cJ

xRRS

x

xrr

rmr

m

rRL

xx

T

jjj

m

j

jjj

jjj

j

jj

jjj

j

Likelihood ratio.

Model includes dim() = p

true (unknown) value 0

Likelihood ratio statistic

)( ason distributiin )(

)}()ˆ({2)(

020

00

IW

llW

p

Justification.

Multinormal result

If Y ~ N (,) then (Y- )T -1(Y- ) ~ p2

)ˆ)(ˆ()ˆ(

)ˆ)(ˆ()ˆ(

)ˆ()ˆ(

)ˆ(21

)ˆ(

)ˆ()ˆ()ˆ({2

)}()ˆ({2)(

00

00

02

00

00

I

J

llll

llW

T

T

T

TT

Uses.

Pr[W(0) cp(1-2 )] 1-2

)}21(21

)ˆ()(:{

)21( )(

p

p

cll

cW

Approx 100(1-2 )% confidence region

Example. exponential

84.3}/log1/{2:{

84.3)95.0( 1

}/log1/{2)}()ˆ({2

log)ˆ(

/log)(

1

000

yyn

cp

yynll

nynl

ynnl

Spring data: 96 < <335

vs. asymp normal approx 64 < <273 kcycles

Prob-value/P-value. See (7.28)

Choose T whose large values cast doubt on H0

Pr0(T tobs)

Example. Spring data

Exponential E(Y) =

H0: = 100?

.071.0368*2

)802.1|Pr(|)248.3(Pr

248.3)100(

}/log1/{2)(

/)ˆ()ˆ(

10n 3.168ˆ

2

10

2

ZvalueP

W

yynW

ynJI

y

Nesting

: p by 1 parameter of interest

: q by 1 nuisance parameter

Model with params (0, ) nested within (, )

Second model reduces to first when = 0

)ˆ,()ˆ,ˆ(

Note.

0

0

ll

Example. Weibull

params (,)

exponential when = 1

How to examine H0 : = 1?

1p on,distributiin

)]ˆ,()ˆ,ˆ([2)(

2

p

000

llWp

Spring failure times. Weibull

07-5.73E

07)-2(2.867E

)00.5|(|

02.25)]1,168()6,181([2

26.61)1,168( 2749.2)1,168(

)1,168()1,( 75.48)6,181(

227.6)6,181( )6,181()ˆ,ˆ(

1

ZPvalueP

ll

lEL

l

EL

Challenger data. Logistic regression

temperature x1 pressure x2

(0 , 1 , 2 ) = exp{}/(1+exp{})

= 0 + 1 x1 + 2 x2 linear predictor

loglike l(0 , 1 , 2 ) =

0 rj + 1 rj x1j + 2 rj x2j - m log(1+exp{j })

Does pressure matter?

214.)107(.2)24.1|Pr(|

)54.1(Pr :

54.177.*2

05.15),,(max

82.15)0,,(max

2

10

210,,

10,

210

10

Z

valueP

l

l

Model fit.

Are labor times Weibull?

Nest its model in a more general one

Generalized gamma.

0,,, ),exp()(

),,;(1

yyy

yf

Gamma for =1

Weibull for =1

Exponential for ==1

Likelihood results.

max log likelihood:

generalized gamma -250.65

gamma -251.12

Weibull -251.17

gamma vs. generalized gamma

- 2 log like diff:

2(-250.65+251.12) = .94

P-value Pr0 (12 > .94)

= Pr(|Z|>.969)

= 2(.166) = .332

Chi-squared statistics. Pearson's chi-squared

categories 1,...,k

count of cases in category i: Yi

Pr(case in i) = i 0 < i < 1 1k i =1

E(Yi ) = ni

var(Yi ) = i (1 - i )n

cov(Yi ,Yj ) = -i j n i j

E.g. k=2 case cov(Y,n-Y) = -var(Y) = -n1 2

= { (1 ,...,k ): 1k i = 1, 0<1 ,...,k <1}

dimension k-1

Reduced dimension possible?

model i () dim() = p

log like general model:

1k-1 yi log i + yk log[1-1 -...-k-1], 1

k yi = n

nYii /ˆ

log like restricted model:

l() = 1k-1 yi log i() + yk log[1-1()-...-k-1()]

likelihood ratio statistic:

k

pkiiiy1

2

1~)ˆ(/ˆlog2

if restricted model true

The statistic is sometimes written

W = 2 Oi log(Oi /Ei )

(Oi - Ei )2/Ei

)ˆ(E where i iii nyO

Pearson's chi-squared.

5)ˆ(ntion recommenda

~

)ˆ(/)]ˆ([2

p-1-k

12

i

kiii nnyP

Example. Birth data. Poisson?

12.9ˆ 92n arrivalsDaily

Split into k=13 categories

[0,7.5), [7.5,8.5),...[18.5,24] hours

O(bserved) 6 3 3 8 ...

E(xpected) 5.23 4.37 6.26 8.08 ...

P = 4.39

P-value Pr(112 > 4.39) = .96

Two way contingency table.

r rows and c columns

n individuals

Blood groups A, B, AB, O

A, B antigens - substance causing body to produce antibodies

2

2

2

)1)(1(202

26

2)1(35

2)1(179

O

BA

OBB

OAA

O

AB

B

A

group count model I model II

O = 1 - A - B

Question. Rows and columns independent?

W = 2 yij log nyij / yi.y.j

with yi. = j yij

~ k-1-p2 = (r-1)c-1)

2

with k=rc p=(r-1)+(c-1)

P = (yij - yi. y.j /n)2 / (yi. y.j /n)

~ (r-1)(c-1)2

Model 1

W = 17.66

Pr(12> 17.66) = Pr(|Z| > 4.202) = 2.646E-05

P = 15.73

Pr(12> 15.73) = Pr(|Z| > 3.966) = 7.309E-05

k-1-p = 4-1-2 = 1

Model 2

W = 3.17

Pr(|Z| > 1.780) = .075

P = 2.82

Pr(|Z|>1.679) = .093

Incorrect model.

True model g(y), fit f(y;)

valuebad"least " :

0D 0);(

yprobabilitin )()f(y; log /)ˆ(

ydiscrepancLiebler -Kullback

)(})f(y;

g(y)log{ );( minimizes

);( log )( maximizes ˆ

g

g

g

j

ffD

dyygnl

dyyggfD

yfl

Example 1. Quadratic, fit linear

Example 2. True lognormal, but fit exponential

dyyg

nYY

y

ZY

gg

g

)(})f(y;

g(y)log{ Minimizing

/)1(var }2/exp{YE ˆ

}2/exp{

/log :likelog

}exp{ Lognormal

222

2

Large sample distribution.

);()( ifresult mle

))()()(;( ~ ˆ 11

yfyg

IKIN ggggp

Model selection.

Various models:

non-nested

Ockham's razor.

Prefer the simplest model

Formal criteria.

}log)ˆ({2

})ˆ({2

nplBIC

plAIC

Look for minimum

Example. Spring failure

Model p AIC BIC

M1 12 744.8* 769.9*

M2 7 771.8 786.5

M3 2 827.8 831.2

M4 2 925.1 929.3

6 stress levels

M1: Weibull - unconnected , at each stress level

Sufficient statistics. The Poisson and the exponential can be summarized by (n, ).

Documents

Transcript of Sufficient statistics. The Poisson and the exponential can be summarized by (n, ).