Introduction to Statistical Data Analysis Lecture 7: The ...
STATISTICAL ANALYSIS - LECTURE 10
Transcript of STATISTICAL ANALYSIS - LECTURE 10
STATISTICAL ANALYSIS -LECTURE 10Dr. Mahmoud Mounir
STATISTICAL INFERENCE
Recap
Statistics
Descriptive Inferential
Estimation
Point Estimation
ยต = เดฅ๐ and p = เท๐
Interval
(Lower Limit, Upper Limit)
Hypothesis
Statistical Inference
โข The statistical inference is the process of making
judgment about a population based on the
properties of a random sample from the
population.
Estimators
Estimator (Sample Statistic)
Population Parameter
เดฅ๐ ESTIMATES ยต
S2 ESTIMATES ฯ2
S ESTIMATES ฯ
เท๐ ESTIMATES p
เดฅ๐๐- เดฅ๐๐ ESTIMATES ยต1 - ยต2
เท๐๐ โ เท๐๐ ESTIMATES p1 โ p2
Confidence Interval (CI)
Population Parameter = Estimator ยฑ Margin of Error (d)
Where:
Margin of Error (d) = Critical Value of Z * Standard Error
Population Parameter โ
[Estimator โ Margin of Error (d) , Estimator + Margin of Error (d)]
Confidence Interval (CI) for Single Mean
๐ = เดฅ๐ ยฑ ๐๐
๐
๐ โ [เดฅ๐ โ ๐๐
๐, เดฅ๐ + ๐
๐
๐]
Confidence Interval (CI) for Single Proportion
๐ = เท๐ ยฑ ๐เท๐ ๐ โ เท๐
๐
๐ โ [ เท๐ โ ๐เท๐ ๐โเท๐
๐, เท๐ + ๐
เท๐ ๐โเท๐
๐]
Standard Error for Different Sample Statistics
Estimator (Sample Statistic)
Standard Error
เดฅ๐
เท๐
เดฅ๐๐- เดฅ๐๐
เท๐๐ โ เท๐๐
Confidence Interval (CI)
(1-ฮฑ) Confidence Level
ฮฑ Confidence Coefficient
โ๐(๐โ ฮค๐ถ ๐) ๐(๐โ ฮค๐ถ ๐)
Critical Value for Z:
Confidence Interval (CI) for the Difference Between
Two Population Means
๐๐โ ๐๐ = เดฅ๐๐โ เดฅ๐๐ ยฑ ๐ฯ๐๐
๐๐
+ฯ๐๐
๐๐
๐๐โ ๐๐ โ
[ เดฅ๐๐โ เดฅ๐๐ โ ๐ฯ๐๐
๐๐
+ฯ๐๐
๐๐
, เดฅ๐๐โ เดฅ๐๐ + ๐ฯ๐๐
๐๐
+ฯ๐๐
๐๐
]
Confidence Interval (CI) for Difference Between
Two Population Proportions
(๐๐ โ ๐๐) = (เท๐๐ โ เท๐๐) ยฑ ๐เท๐๐๐โเท๐
๐
๐๐
+เท๐๐๐โเท๐
๐
๐๐
(๐๐ โ ๐๐) โ
[ เท๐๐โ เท๐๐ โ ๐เท๐๐๐โเท๐
๐
๐๐
+เท๐๐๐โเท๐
๐
๐๐
, เท๐๐โ เท๐๐ + ๐เท๐๐๐โเท๐
๐
๐๐
+เท๐๐๐โเท๐
๐
๐๐
]
Hypothesis Tests
Statistics
Descriptive Inferential
Estimation
Point Estimation
ยต = เดฅ๐ and p = เท๐
Interval
(Lower Limit, Upper Limit)
Hypothesis
Hypothesis
โข It is a claim or statement about a population
parameter.
โข For example, the average age of college students
equals 23 years.
Null Hypothesis (Ho)
โข Usually, the null hypothesis represents a statement
of โno effect,โ โno difference,โ or, put another way,
โthings havenโt changed.โ
โข For example, Ho: ฮผ = 23
Alternate Hypothesis (Ha) or (H1)
โข This test is a statistical test is designed to assess the
strength of the evidence (data) against the null
hypothesis.
โข For example, H1: ฮผ โ 23
Test Statistics
The Critical Region
Hypothesis Tests
Two-Tailed
Ho: ฮผ = 23
H1: ฮผ โ 23
Left-Tailed
Ho: ฮผ โฅ 23
H1: ฮผ < 23
Right-Tailed
Ho: ฮผ โค 23
H1: ฮผ > 23
The Critical Region
Fail to reject HoFail to reject Ho
Fail to reject Ho
ฮฑ: Significance Level
Steps for Hypothesis Test
โข Step 1: State the null and alternative hypothesis (Ho and
H1).
โข Step 2: Specify the suitable test statistic.
โข Step 3: Compute the critical value for the statistic.
โข Step 4: Make a decision.
Reject Ho or Fail to reject Ho
โข Step 5: Make a conclusion
Accept H1 or we donโt have enough evidence to support H1
In recent years, the mean age of all college students
in a city has been 23. A random sample of 42
students revealed a mean age of 23.8 suppose their
ages are normally distributed with a population
standard deviation of ฯ=2.4 can we infer at ฮฑ=0.05
that the population mean has changed?
Example (1)
Solution๐ = 23 ๐ = 42 ๐ = 2.4 าง๐ฅ = 23.8
Step 1: State the null and alternative hypothesis (Ho and H1)
Null hypothesis (Ho) ๐ = 23
alternative hypothesis (H1) ๐ โ 23
Step 2: Specify the suitable test statistic
Since ๐ โฅ 30 and ๐ is known, we will use Z-statistic
๐ = ๐๐ ๐ = ๐. ๐ ๐ง = ๐๐ เดค๐ฑ = ๐๐. ๐
๐เดค๐ฑ = ๐ = ๐๐
๐เดค๐ฑ =๐
๐ง
=๐. ๐
๐๐= ๐. ๐๐
๐ณเดค๐ฑ =เดค๐ฑ โ ๐
๐เดค๐ฑ
=๐๐. ๐ โ ๐๐
๐. ๐๐= ๐. ๐๐
SolutionStep 3: Compute the critical value for the statistic
๐ โ ๐ = ๐. ๐๐ ๐ = ๐. ๐๐๐
๐=. ๐๐๐
๐ โ๐
๐= ๐. ๐๐๐ ยฑ๐ณ๐โ๐/๐ = ยฑ๐. ๐๐
Solution
Step 5: Make a conclusion
We accept ๐ป1, there is enough evidence that
mean age has changed at ๐ผ = 0.05
Step 4: Make a decision
๐ง าง๐ฅ = 2.16 which is greater than 1.96
=> ๐ง าง๐ฅ = 2.16 > 1.96, so we reject ๐ป0
Fail to reject Ho
In the previous example, if the confidence level (ฮฑ)
= 0.02, what can we infer about the population
mean ?
Example (2)
Solution๐ = 23 ๐ = 42 ๐ = 2.4 าง๐ฅ = 23.8
Step 1: State the null and alternative hypothesis (Ho and H1)
Null hypothesis (Ho) ๐ = 23
alternative hypothesis (H1) ๐ โ 23
Step 2: Specify the suitable test statistic
Since ๐ โฅ 30 and ๐ is known, we will use Z-statistic
๐ = ๐๐ ๐ = ๐. ๐ ๐ง = ๐๐ เดค๐ฑ = ๐๐. ๐
๐เดค๐ฑ = ๐ = ๐๐
๐เดค๐ฑ =๐
๐ง
=๐. ๐
๐๐= ๐. ๐๐
๐ณเดค๐ฑ =เดค๐ฑ โ ๐
๐เดค๐ฑ
=๐๐. ๐ โ ๐๐
๐. ๐๐= ๐. ๐๐
SolutionStep 3: Compute the critical value for the statistic
๐ โ ๐ = ๐. ๐๐ ๐ = ๐. ๐๐๐
๐=. ๐๐
๐ โ๐
๐= ๐. ๐๐ ยฑ๐ณ๐โ๐/๐ = ยฑ๐. ๐๐
Solution
Step 5: Make a conclusion
We donโt have enough evidence to accept ๐ป1, or
we donโt have enough evidence that mean age
has changed at ๐ผ = 0.02
Step 4: Make a decision
๐ง าง๐ฅ = 2.16 which is greater than 1.96
=> ๐ง าง๐ฅ = 2.16 < 1.96, so we fail to reject ๐ป0
Fail to reject Ho
A random sample of 27 observations from a large
population has a mean of 22 and a standard
deviation of 4.8. can we conclude at ฮฑ = 0.01 that a
population mean is significantly below 24?
Example (3)
Solution๐ = 2๐ ๐ = ๐๐ ๐ฌ = ๐. ๐ าง๐ฅ = ๐๐
Step 1: State the null and alternative hypothesis (Ho and H1)
Null hypothesis (Ho) ๐ โฅ 23
alternative hypothesis (H1) ๐ < 23
Step 2: Specify the suitable test statistic
Since ๐ < 30 and ๐ is unknown, we will use t-statistic
๐ = ๐๐ ๐ = ๐๐ ๐ฌ = ๐. ๐ เดฅ๐ = ๐๐
๐เดค๐ฑ = ๐ = ๐๐
๐เดค๐ฑ =๐
๐ง
=๐. ๐
๐๐= ๐. ๐๐๐
๐ =เดค๐ฑ โ ๐
๐เดค๐ฑ
=๐๐ โ ๐๐
๐. ๐๐๐= โ๐. ๐๐๐
SolutionStep 3: Compute the critical value for the statistic
๐ โ ๐ = ๐. ๐๐ ๐ = ๐. ๐๐
โ๐๐ ๐๐ โ ๐ = โ๐๐.๐๐ ๐๐ = โ๐. ๐๐๐
Solution
Step 5: Make a conclusion
We donโt have enough evidence to accept ๐ป1, or
we fail to support ๐ป1
Step 4: Make a decision
๐ก = 2.165 which is greater than -2.479 ,
So we fail to reject ๐ป0
Fail to reject Ho-tฮฑ