As part of a study on transportation safety, the U.S. Departmentof Transportation collected data on the number of fatal accidentsper 1000 licenses and the percentage of licensed drivers under theage of 21 in a sample of 42 cities. Data collected over a one-yearperiod follow. These data are contained in the file named“Safety.csvâ€.
1- Find the sample mean and standard deviation for eachvariable. Round your answers to the nearest thousandth.
2- Use the function lm() in R to run a simple linear regressionmodel on the data provided. Use the function summary() in R togenerate the regression output. Use the function aov() in R togenerate the corresponding ANOVA table. You ought to be able todetermine which is the dependent variable and which is theindependent variable in this SLR model.
Please copy your R code andthe result and paste them here.
3- Write down the estimated regression function below andprovide a practical interpretation of the coefficient of theindependent variable.
4- Please find a 95% confidence interval for the coefficient ofthe independent variable and provide a practical interpretation ofthis interval.
5- At the 5% level of significance, is there a significantrelationship between the two variables? Why or why not?
6- What is the value of the coefficient of determination forthis simple linear regression model? Provide a brief interpretationof this value.
7- Use the information from the ANOVA table to compute thestandard error of estimate, a.k,a, residual standard error. Thisvalue must match the residual standard error in the regressionsummary.
8- What is the point estimate of the expectednumber of fatal accidents per 1000 licenses if there are 10%drivers under age in a city?
9- Suppose we want to develop a 95% confidence interval for theaverage number of fatal accidents per 1000 licenses for all thecities with 10% of drivers under age 21. What is the estimate ofthe standard deviation for this confidence interval?
10-Suppose we want to develop a 95% confidence interval for theaverage number of fatal accidents per 1000 licenses for all thecities with 10% of drivers under age 21. Compute the t value andthe margin of error needed for this confidence interval.
Please copy your R code andthe result and paste them here.
11-Provide a 95% confidence interval for the average number offatal accidents per 1000 licenses for all the cities with 10% ofdrivers under age 21 and a practical interpretation to thisconfidence interval.
12- Suppose we want to develop a 95% prediction interval for thenumber of fatal accidents per 1000 licenses for a city with 10% ofdrivers under age 21. What is the estimate of the standarddeviation for this prediction interval?
13- Suppose we want to develop a 95% prediction interval for thenumber of fatal accidents per 1000 licenses for a city with 10% ofdrivers under age 21. Compute the margin of error needed for thisprediction interval.
14- Provide a 95% prediction interval for the number of fatalaccidents per 1000 licenses for a city with 10% of drivers underage 21 and a practical interpretation to this predictioninterval.
Percent Under 21 | Fatal Accidents per 1000 |
13 | 2.962 |
12 | 0.708 |
8 | 0.885 |
12 | 1.652 |
11 | 2.091 |
17 | 2.627 |
18 | 3.83 |
8 | 0.368 |
13 | 1.142 |
8 | 0.645 |
9 | 1.028 |
16 | 2.801 |
12 | 1.405 |
9 | 1.433 |
10 | 0.039 |
9 | 0.338 |
11 | 1.849 |
12 | 2.246 |
14 | 2.855 |
14 | 2.352 |
11 | 1.294 |
17 | 4.1 |
8 | 2.19 |
16 | 3.623 |
15 | 2.623 |
9 | 0.835 |
8 | 0.82 |
14 | 2.89 |
8 | 1.267 |
15 | 3.224 |
10 | 1.014 |
10 | 0.493 |
14 | 1.443 |
18 | 3.614 |
10 | 1.926 |
14 | 1.643 |
16 | 2.943 |
12 | 1.913 |
15 | 2.814 |
13 | 2.634 |
9 | 0.926 |
17 | 3.256 |
Ps: I do appreciate your help But please do not simply copy andpaste irrelevant answer, Thanks