scholar.harvard.eduPower Analyses I Power Analysis: If a given alternative hypothesis were true, how...

Lecture 15:Power and

Confidence Intervals/Interval EstimationAPI-201Z

Maya Sen

Harvard Kennedy Schoolhttp://scholar.harvard.edu/msen

Announcements

I Second midterm just over two weeks, November 15

I Will be in-class, closed book and closed note (will provideformula sheet, probability tables)

I Have posted old exams and problem sets

I Review Session Tuesday 11/13, 4-5:15pm, Rubenstein 304

Announcements

Roadmap

I Be comfortable with four kinds of hypothesis tests:I Single meanI Difference in meansI Single proportionI Difference in proportions

I Proper interpretation of Hypothesis TestsI Today:

I Power for hypothesis testingI Interval estimationI Confidence IntervalsI Proper Interpretation

Roadmap

I Be comfortable with four kinds of hypothesis tests:

I Single meanI Difference in meansI Single proportionI Difference in proportions

Roadmap

I Be comfortable with four kinds of hypothesis tests:I Single mean

I Difference in meansI Single proportionI Difference in proportions

Roadmap

I Be comfortable with four kinds of hypothesis tests:I Single meanI Difference in means

I Single proportionI Difference in proportions

Roadmap

I Be comfortable with four kinds of hypothesis tests:I Single meanI Difference in meansI Single proportion

I Difference in proportions

Roadmap

I Proper interpretation of Hypothesis Tests

I Today:I Power for hypothesis testingI Interval estimationI Confidence IntervalsI Proper Interpretation

Roadmap

I Power for hypothesis testing

I Interval estimationI Confidence IntervalsI Proper Interpretation

Roadmap

I Power for hypothesis testingI Interval estimation

I Confidence IntervalsI Proper Interpretation

Roadmap

I Power for hypothesis testingI Interval estimationI Confidence Intervals

I Proper Interpretation

Roadmap

Type I versus Type II Errors

H0 is true H0 is not true

Reject H0

Do not reject H0

Reject H0

Do not reject H0

Reject H0 GoodDo not reject H0 Good

Reject H0 Bad! GoodDo not reject H0 Good Bad!

Reject H0 Type 1 GoodDo not reject H0 Good Type 2

I Both types of errors can occur w/ a hypothesis test

I Good practice: Before study carried out, set limits on howmuch error we would tolerate

Type I Error

I Type I: Null hypothesis rejected when in fact it is trueI Akin to false positive

I Mammogram analogy: Positive test, despite not having thedisease

I P(Type I error) = P(Rejecting H0|H0 true) = α

I α: Also referred to as level of significance (which gives us acritical value)

I Rejection region: Values of X to left/right of values of thecritical value for which the hypothesis test would reject thenull

I Very common to set α = 0.05, α = 0.10, or α = 0.01

I Question: Why would we want to avoid Type I Error?

I Question: How do you interpret a level of significance of 0.05?

Type I Error

I Type I: Null hypothesis rejected when in fact it is true

I Akin to false positiveI Mammogram analogy: Positive test, despite not having the

disease

Type I Error

Type II Error

I Type II: Null hypothesis is not rejected when in fact it is falseI Akin to false negative

I Mammogram analogy: Negative test, despite having thedisease

I P(Type II error) = P(Not rejecting H0|H0 false)

I Often set at P(Type II error) = 0.20 = β

Type II Error

I Type II: Null hypothesis is not rejected when in fact it is false

I Akin to false negativeI Mammogram analogy: Negative test, despite having the

disease

Type II Error

Type II Error and Power

I 1 − β = P(Rejecting H0|H0 false)

I Probability of correctly rejecting the null

I Known as the power of a test

I Sometimes considered less important from substantiveperspective

I However: consider your specific problem → in some instances,either error may be more costly

I Note: Exact power calculation will depend on your alternativeHa

Power Analyses

I Power Analysis: If a given alternative hypothesis were true,how good would our test be at (correctly) rejecting the null?

I Somewhat similar to hypothesis test set up, but for purposesof calculating power, assume alternative true:P(Rejecting H0|Ha true)

I Usually approached in one of two ways:I 1) You are asked to calculate the sample size you will need to

detect an effect (difference between null and alternative) of acertain size

I Need to state a precise alternative null

I 2) You are asked to calculate what is the smallest effect(difference) you could detect given a sample size

→ Both mean that power analyses usually done before data arecollected (and $ spent!)

Power Analyses

I Usually approached in one of two ways:

I 1) You are asked to calculate the sample size you will need todetect an effect (difference between null and alternative) of acertain size

Power Analyses

Power Analysis Example

I You are an urban policy expert studying the effects of recentconstruction on commuting times

I Suppose known average time spent commuting is 110minutes/week, with standard deviation of 30 minutes

I Take sample of 81 commutersI Find power of a hypothesis test when alternative hypothesis is

that commuting hours equals 120 minutesI So alternative is that the effect of construction is 20 minutes

I So H0 = 110, HA = 120, σ = 30, n = 81

I Take sample of 81 commuters

I Find power of a hypothesis test when alternative hypothesis isthat commuting hours equals 120 minutes

I So alternative is that the effect of construction is 20 minutes

I So H0 = 110, HA = 120, σ = 30, n = 81

that commuting hours equals 120 minutes

I So alternative is that the effect of construction is 20 minutes

I So H0 = 110, HA = 120, σ = 30, n = 81

Steps:

I Because you’ll eventually do a hypothesis test assuming thenull, first calculate the rejection region under standard set-up(given your α values)

I Calculate critical values to define the rejection region

I Assume alternative hypothesis is true, µA = 120

I Calculate probability of not being in rejection region when thisalternative is true (this is β)

I And then finally calculate 1 − β to get Power

I Note: Can repeat for different values of µA (e.g.,µA = 120, 130, 140, etc) to plot a power curve or powerfunction

Steps:

Power Analysis Intuition

100 110 120 130

Normal distribution under H0 and H1

Distribution under H0

100 110 120 130

Critical value

100 110 120 130

Critical value

100 110 120 130

Critical value

100 110 120 130

Critical value

100 110 120 130

Critical value

I Step 1: Calculate rejection region under the null hypothesisbeing true

I Remember that test statistic for single mean is

z =X − µ

σ/√n

I Assuming one-tailed test, we reject H0 if z > 1.645 (α = 0.05)

I Step 2: Calculate critical value

1.645 <X − 110

30/√

115.48 < X

I That is, we would reject null for all X > 115.48 (our rejectionregion)

z =X − µ

σ/√n

1.645 <X − 110

30/√

115.48 < X

z =X − µ

σ/√n

1.645 <X − 110

30/√

115.48 < X

z =X − µ

σ/√n

1.645 <X − 110

30/√

115.48 < X

z =X − µ

σ/√n

1.645 <X − 110

30/√

115.48 < X

z =X − µ

σ/√n

1.645 <X − 110

30/√

115.48 < X

I Step 3: Assume alternative true, µA = 120

I Given alternative being true, how often would we (correctly)reject null?

I That is, what is P(X > 115.48) if µ = 120?

z =X − µ

σ/√n

=115.48 − 120

30/√

= −1.36

I Step 4: Finally β = P(Z < −1.36), which is 0.087