wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would...

33
Chapter 06 - Multiple Linear Regression Analysis CHAPTER 6 Answers to End of Chapter Problems 6.1 a. The correlations we are most interested in are those against college GPA because that is the dependent variable. Hours studied has the highest correlation, then SAT score, and lastly HS GPA. All of the independent variables are positively and relatively strongly related to college GPA. We can also see that high-school GPA and hours studied have a positive and somewhat strong relationship with each other and high-school GPA and SAT score and SAT score and hours studied have a positive but not as strong relationship. b. The correlation matrix sheds light on linear relationships but does offer information on the marginal effects such as the effect that hours studied has on College GPA holding SAT score and high-school GPA constant. 6.2 a. The two conditions are if ^ δ 1 =0 (this would mean that IQ and education are not linearly related) and if ^ β 2 =0 (this would mean that log(wage) and IQ are not linearly related). b. ^ β 2 =0 (this would mean that log(wage) and IQ are not linearly related) 6-1 Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.

Transcript of wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would...

Page 1: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

CHAPTER 6

Answers to End of Chapter Problems

6.1 a. The correlations we are most interested in are those against college GPA because that is the dependent variable. Hours studied has the highest correlation, then SAT score, and lastly HS GPA. All of the independent variables are positively and relatively strongly related to college GPA. We can also see that high-school GPA and hours studied have a positive and somewhat strong relationship with each other and high-school GPA and SAT score and SAT score and hours studied have a positive but not as strong relationship.

b. The correlation matrix sheds light on linear relationships but does offer information on the marginal effects such as the effect that hours studied has on College GPA holding SAT score and high-school GPA constant.

6.2 a. The two conditions are if δ 1=0 (this would mean that IQ and education are not linearly related) and if β2=0 (this would mean that log(wage) and IQ are not linearly related).

b. β2=0 (this would mean that log(wage) and IQ are not linearly related)

6-1Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 2: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6-2Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 3: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

δ 1=0 (this would mean that IQ and education are not linearly related)

c. It is genearlly the case the IQ and education are related to each other and IQ is related to log(wage)

6-3Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 4: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6.3 a. I would expect β1=~β1

b. Again, I would expect β1=~β1

6.4 I would advise my friend that it would be easiest to just regress y on x and z and perform a t-test on the coefficient on z. The method that the friend suggested is not quite right. The friend should regress z on x and save the residuals and then regress y on those residuals. This will give your friend the same results as the first method but the standard errors of the second method will be wrong.

6-4Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 5: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6.5 a.

b. R-squared is 1,237,904/2,347,829 = .5273. This means that 52.73% of the variation of Sales is related to advertising, bonus, relative price, and competition.

c. 100%

d. Hypothesis:H 0 : β1=β2¿ β3=β4=0H 1: at least one β i isnot equal¿0

Test Statistic F-statistic = 12.5471 and the p-value of the F-test is 0.001.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.001< 0.01 we reject H0 and conclude that at least one of advertising, bonus, relative price, and competition is related to sales at the 1% level.

e. BonusHypothesis:

H 0 : β2=0H 1: β2≠0

Test Statistic t-statistic = 11.6/3.02= 3.84 and the p-value of the t-test is 0.003.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.003 < 0.01 we reject H0 and conclude that bonus is statistically significant at the 1% level.

f. ADV: On average, holding bonus, relative price, and competition constant, if the amount spend on advertising increases by $1000, then sales go up by $10,800.

Bonus: On average, holding advertising, relative price, and competition constant, if the amount spend on bonuses increases by $1000, then sales go up by $11,600.

RELPRICE: On average, holding advertising, bonus, relative price, and competition constant, if the Medicorp’s relative price increases by $1, then sales goes down by 122,000.

6-5Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 6: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

g. The 95% confidence interval is

β2± t α2 , n−k−1

se β 2

11.6 ± (2.02) (3.02)(5.5, 17.7)

This interval does not include 0, which means that Bonus is statistically significant at the 5% level.

h. Salesi= β0+ β1 Advi+ β2Bonu s i+ β3 Relpric e i+ β4Compe tiSalesi=889.1+10.8 Adv i+11.6Bonu si−122Relpric ei−0.18Compe ti

Salesi=889.1+10.8(60)+11.6(45)−122(0.7)−0.18(100)Salesi=$ 2,073,520

6.6 a.

b. R-squared is 1,108,892/2,807,27 9.334= .395. This means that 39.5% of the variation of price is related to speed and charge.

c. Hypothesis:H 0 : β1=β2=0

H 1: at least one β i isnot equal¿0Test Statistic F-statistic = 6.203 and the p-value of the F-test is 0.0048.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.0048< 0.01 we reject H0 and conclude that at least one of speed or charge is related to price at the 1% level.

d. SpeedHypothesis:

H 0 : β1=0H 1: β1≠0

Test Statistic t-statistic = 10.1506/14.5019= 0.7 and the p-value of the t-test is 0.003.Rejection RuleReject H0 if the |t-stat| > 2.1.Rejection RuleBecause 0.7 < 2.1 we fail to reject H0 and conclude that speed is statistically unrelated to price at the 5% level.

6-6Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 7: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6-7Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 8: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

e. Speed: On average, holding charge constant, if the speed increases by 1megahertz then price go up by $10.15.

Charge: On average, holding speed constant, if the charge increases by one minute then price go up by $1.63.

f. The 95% confidence interval is

β1± t α2 , n−k−1

se β 1

10.15 ± (2.1) (14.5) (-22.19, 42.49)

This interval includes 0, which means that Speed is statistically insignificant at the 5% level. This interval brackets β1 .

g. sy∨ x=√89,389 = 298.98.

h. Pricei= β0+ β1Speed i+ β2Charge i

Pricei=1500.6+10.15 Speed i+1.63Chargei

i. Pricei=1500.6+10.15(33)+1.63(305)Pricei=$2,332.7

6.7 a. On average, holding population within one mile and population beyond one mile constant, if the number of competitors within one mile goes up by 1 then sales goes down by $4,200.

b. On average, holding competition within one mile and population beyond one mile constant, if the population within one mile goes up by 1000 people then sales goes down by $6,800.

c. On average, holding competition within one mile and population within one mile constant, if the population beyond one mile goes up by 1000 people then sales goes down by $2,300.

d. Salesi=10.1−4.2 (2 )+6.8 (8 )+2.3(13) Salesi = $86,000

6.8 a. R-squared is 3660.74/4190.95= .8735. This means that 87.35% of the variation of risk is related to age, pressure, and family.

b. sy∨ x=√33.14 = 5.7567. This is the variation left in risk when the effects of age, pressure and family have been removed.

c. Age: On average, holding pressure and family constant, if the age of a person increases by one year then risk go up by 1.08 or 1%.

6-8Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 9: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6-9Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 10: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

Pressure: On average, holding age and family constant, if the blood pressure of a person increases by one then risk go up by .25 or .25%.

Family: On average, holding age and pressure constant, if the number of strokes the person’s parents have had goes up by one then risk goes up by 8.74 or 8.74%.

d. Hypothesis:H 0 : β1=β2=β3=0

H 1: at least one β i isnot equal¿0Test Statistic F-statistic = 36.82 and the p-value of the F-test is 2.06E-07.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 2.06E-07< 0.01 we reject H0 and conclude that at least one of age, pressure or family is related to price at the 1% level.

e. PressureHypothesis:

H 0 : β2=0H 1: β2≠0

Test Statistic t-statistic = 0.25/0.05= 5.Rejection RuleReject H0 if the |t-stat| > 1.75.Rejection RuleBecause 5 > 1.75 we reject H0 and conclude that pressure is statistically significant at the 10% level.

f. The 95% confidence interval is

β1± t α2 , n−k−1

se β 1

1.08 ± (2.12) (0.17)(0.72, 1.44)This interval does not include 0, which means that age is statistically significant at the 5% level. This interval bracketsβ1 .

g. Riski= β0+ β1 Age i+ β2 Pressurei+ β3 Family i

Riski=−91.76+1.08 Agei+0.25 Pressurei+8.74 Family i

Riski=−91.76+1.08(70)+0.25 (165)+8.74(1) Riski=33.83 or their risk of a stroke is 33.83%

h. The prediction could be above 100% or below 0%.

6-10Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 11: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6.9 a. On average, holding square footage and niceness constant, if number of bathrooms increases by one then the price of the house increases by $10,043.

6-11Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 12: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

b. Increasing a niceness rating by 2 will increase the value of the house by (2)(10,042)=$20,084. If it costs $32,500 to increase the niceness rating from 5 to 7 then when selling a house 20,084/32,500 = .618 or 61.8%.

c. Increasing a niceness rating by 2 will increase the value of the house by (2)(10,042)=$20,084. If it costs $24,000 to increase the niceness rating from 2 to 4 then when selling a house 20,084/24,000 = .8368 or 83.68%.

d. From this question we see that it is more expensive to increase the niceness rating from 5 to 7 than it is to increase the niceness rating from 2 to 4 but the way the regression model is specified these increases change the value of the house by the same amount. It would be better to specify the model such that increasing the niceness rating has a non-constant marginal effect on price.

e. y i= β0+ β1 x1 ,i+ β2 x2 ,i+ β3 x3 , i

y i=24.976+0.0526 x1 , i+10.043 x2 ,i+10.042 x3 ,i

y i=24.976+0.0526(3200)+10.043(3)+10.042(3)y i=$253,551

6.10 a. To test this hypothesis we need to change the regression model to obtain the standard error.

H 0 : β1=β2 or β1−β2=0H 1: β1≠β2 or β1−β2≠0y i=β0+β1 x1 ,i+β2 x2 ,i+ε set β1−β2=θ and then solve for β1

β1=θ+β2 and then substitute this equation into the model for β1

y i=β0+(θ+β2) x1, i+ β2 x2 ,i+εy i=β0+θ x1 ,i+β2 x1 ,i+β2 x2 ,i+εy i=β0+θ x1 ,i+β2(x¿¿1 , i+ x2 , i)+ε ¿

This equation says that we need to create a new variable, (x¿¿1 ,i+x2 ,i)¿, and the regress y i on x1 ,i and (x¿¿1 ,i+x2 ,i) .¿ The coefficient estimate on x1 ,i is θ and then the standard error onx1 ,i is the standard error of θ. The t-stat next to x1 ,i is the t-stat for the hypothesis and the p-value on x1 ,i is the p-value for this hypothesis. The rejection rule is to reject H0 if the p-value < 0.05.b. To generate a confidence interval around a prediction we need to change the data. We need to create two new columns in Excel. The first column is generated by taking each x1 and subtract off 6 (the value of the prediction for x1) for observations 1 all the way to n and name it (x1−6). The second column is generated by taking each x2 and subtract off 12 (the value of the prediction for x2) for observations 1 all the way to n and name it (x2−12). Once these two new columns are created then you need to regress y on (x1−6) and (x2−12). The predicted value is the coefficient estimate next to the intercept, the standard error is in the next column, and

6-12Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 13: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6-13Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 14: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

the 95% prediction interval is the confidence interval in the intercept row of the coefficient estimates.

6.11 a. The sample regression function is y i= β0+ β1 x1+ β2x2+ β3 x3+ β4 x 4

y i=14,122.2409+63.1533 x1+10.0958 x2+31.5062 x3+10.4609 x 4

b. On average, holding teams win/loss percentage, opponent win/loss percentage, and games played constant, when temperature at game time goes up by one degree then the game attendance goes up by 10.46 people.

c. y i=14,122.2409+63.1533 (60 )+10.0958 (40 )+31.5062 (50 )+10.4609(70)y i=20,622.8439 people.

d. To increase attendance the athletic director should win more games, play teams with a better record, play more games, and play games when the temperature is hotter.

Answers to End of Chapter Exercises

E6.1 Regression of Experience on Education

Regression of wage on the residuals from the regression above

6-14Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 15: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

Regression of wage on experience and education

6-15Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

These match!

Page 16: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

E6.2 a

Number of Friends: On average, holding age and member in months constant, if the number of friends goes up by 1 then the numbers of times tagged in a picture goes up by .16 times.

Age: On average, holding number of friends and member in months constant, if the age of a person increases by 1 year then the numbers of times tagged in a picture goes down by .1.93 times.

Member in Months: On average, holding number of friends and member in months constant, if the how long the person has been a member of facebook goes up by 1 month then the numbers of times tagged in a picture goes up by .01 times.

b. Hypothesis:H 0 : β1=β2¿ β3=0H 1: at least one β i isnot equal¿0

Test Statistic F-statistic = 128.32 and the p-value of the F-test is 1.14 E – 29.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 1.14E-29< 0.01 we reject H0 and conclude that at least one of number of friends, age, or member in months is related to number of times tagged at the 1% level.

6-16Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 17: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

c. Number of FriendsHypothesis:

H 0 : β1=0H 1: β1≠0

Test Statistic t-statistic = 10.15 and the p-value of the t-test is 8.63E-16.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 8.63E-16 < 0.01 we reject H0 and conclude that number of friends is statistically significant at the 1% level.

AgeHypothesis:

H 0 : β2=0H1: β2≠0

Test Statistic t-statistic = -5.75 and the p-value of the t-test is 1.76E-16.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 1.76E-16 < 0.01 we reject H0 and conclude that the age of the person is statistically significant at the 1% level.

Number of MonthsHypothesis:

H 0 : β3=0H 1: β3≠0

Test Statistic t-statistic = 0.079 and the p-value of the t-test is 0.94.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.94 > 0.10 we reject H0 and conclude that number of months a person is a member is not statistically significant at the 10% level.

d. The R2 is 0.8351. This number tells us that 83.51 percent of the variation in the number of times a person is tagged is explained by number of friends, age, and member in months.

e. ¿Taggedi= β0+ β1 Friendsi+ β2age i+ β3membe r i¿Taggedi=27.0287+0.1615 Friendsi−1.929agei+0.0112member i¿Taggedi=27.0287+0.1615(250)−1.929(25)+0.0112(18)¿Taggedi=19.3842

6-17Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 18: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

The predicted number of times tagged is 19.38.

6-18Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 19: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

f. To obtain this prediction interval in excel take each observation for each independent variable and subtract off the value you would like to predict for and regress times tagged on these new variables. For example, take each number of friends observation and subtract off 250, take each age observation and subtract off 25, and take each member observation and subtract off 18.

The intercept contains the predicted value and the 95% confidence interval output for the intercept contains the 95% confidence interval for the mean.

This 95% confidence prediction interval for the mean is (-5.8, 44.57).

g. To obtain the standard error for the individual prediction interval, take the standard error of the prediction (12.645) and add the standard error of the regression (27.1027). To obtain the 95% prediction interval multiply this standard error by the critical value (2) and add and subtract this from the predicted value.

95% confidence prediction interval for an individual is (-60.11, 98.88). Notice how much bigger this is than the interval that was obtained in part f.

6-19Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 20: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

E6.3.

a. Age: On average, holding hours working out, hours at work and number of times sick constant, if the age of a person increases by one year then their BMI goes up by 0.145.

Work Out: On average, holding age, hours at work and number of times sick constant, if the number of hours a person spends working out in a week increases by one year then their BMI goes down by 0.195.

Work: On average, holding age, hours working out, and number of times sick constant, if the number of hours spent at work in a week increases by one year then their BMI goes up by 0.151.

Sick: On average, holding age, hours working out, and hours at work constant, if the number of times a person is sick in a year increases by one then their BMI goes up by 0.4.

b. Hypothesis:H 0 : β1=β2¿ β3¿ β4=0H 1: at least one β i isnot equal¿0

Test Statistic F-statistic = 55.65 and the p-value of the F-test is 1.95 E – 17.

6-20Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 21: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 1.95 E – 17 < 0.01 we reject H0 and conclude that at least one of age, workout, work, or sick is related to BMI at the 1% level. This model is statistically significant.

c. AgeHypothesis:

H 0 : β1=0H 1: β1≠0

Test Statistic t-statistic = 3.03 and the p-value of the t-test is 0.0039.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.0039 < 0.01 we reject H0 and conclude that age is statistically significant at the 1% level.

Work OutHypothesis:

H 0 : β2=0H1: β2≠0

Test Statistic t-statistic = -2.69 and the p-value of the t-test is 0.0097.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 0.0097 < 0.01 we reject H0 and conclude that hours working out in a week is statistically significant at the 1% level.

WorkHypothesis:

H 0 : β3=0H 1: β3≠0

Test Statistic t-statistic = 4.26 and the p-value of the t-test is 9.39E-05.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 9.39E-05 < 0.01 we reject H0 and conclude that hours working in a week is statistically significant at the 1% level.

6-21Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 22: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

6-22Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 23: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

SickHypothesis:

H 0 : β4=0H 1: β4≠0

Test Statistic t-statistic = 5.99 and the p-value of the t-test is 2.61E-05.Rejection RuleReject H0 if the p-value < 0.01.Rejection RuleBecause 2.61E-05 < 0.01 we reject H0 and conclude that number of times a person is sick in a year is statistically significant at the 1% level.

d. The R2 is 0.8226. This number tells us that 82.26 percent of the variation in BMI is explained by age, work out, work, and sick.

e. BMIi= β0+ β1 Age i+ β2WorkOu ti+ β3Work i+ β4 Sic ki

BMIi=11.74+0.1451 Agei−0.1955Work Out i+0.1512Wor k i+0.4006 Sic k i

BMIi=11.74+0.1451 (35 )−0.1955(3)+0.1512(45)+0.4006(2)The predicted number of times tagged is 23.8381.

f. To obtain this prediction interval in excel take each observation for each independent variable and subtract off the value you would like to predict for and regress BMI on these new variables.

The intercept contains the predicted value and the 95% confidence interval output for the intercept contains the 95% confidence interval for the mean.

This 95% confidence prediction interval for the mean is (22.54, 25.14).

6-23Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.

Page 24: wordpress.viu.cawordpress.viu.ca/danielsimons/files/2016/09/SMChap006.docx · Web view6.4I would advise my friend that it would be easiest to just regress y on x and z and perform

Chapter 06 - Multiple Linear Regression Analysis

g. To obtain the standard error for the individual prediction interval, take the standard error of the prediction (0.6466) and add the standard error of the regression (1.9427). To obtain the 95% prediction interval multiply this standard error by the critical value (2.02) and add and subtract this from the predicted value.

95% confidence prediction interval for an individual is (18.61, 29.07). Notice how much bigger this is than the interval that was obtained in part f.

6-24Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill

Education.