Data Analysis Project You will be assessed on: · getting appropriate Excel output and plots (note: you must use Excel for this project!) · validity of your statistical conclusions and interpretations...

Please help me with my project. I have added the files below.


Data Analysis Project You will be assessed on: · getting appropriate Excel output and plots (note: you must use Excel for this project!) · validity of your statistical conclusions and interpretations · commentary and critical thinking · writing style, including spelling and grammar · creativity, professionalism, apparent effort, and overall presentation Formatting Details: Your final report can be a maximum of 4 pages long. It can be shorter. Reports that are significantly over the page limit will be penalized. Your analysis should be written in a word processing software such as Microsoft Word, with double-spaced text and size 12 font. Any figures or tables created in Excel should have proper axis labels and titles, and then be copied into your Word document. NO SCREEN SHOTS! Figures and tables should be scaled to an appropriate size so as to be legible, but still allow you to complete your commentary within the page allotment. You can include text above, below, or around your figures as you see fit. • Summary statistics and interpretations should include units, where appropriate. • Raw data, as summarized in your Excel spreadsheet, should NOT be included in your report. Your report should only include summary tables and graphs/charts. The purpose of the assignment is to: · develop statistical communication skills through a formal written report of the statistical analysis conducted. The data for this project can be found on in the file Rock and Roll Marathon 2015.xlsx. In brief, the data set contains a random sample of 100 runners (male and female) from the 2015 Rock and Roll Marathon held in Las Vegas, NV. The population is all runners (male or female) in the 2015 marathon. The variables in the data set include: • Runner: Runner ID - this variable is only used to identify individuals, not for analysis. • Gender: M = male, F = female, as identified on each Runner’s entry form. • Seconds: The time, in seconds, taken to complete the marathon. • MPH: Miles per hour, which measures the typical speed of each runner through the marathon. For this analysis you will need to: • Select ONE gender to analyze (you do not need to analyze both). Filter your data to remove the gender you are not going to consider. Clearly identify in your report which gender you selected. • Select ONE race variable (Seconds or MPH, you do not need to analyze both). You can ignore the column for the variable you are not going to consider. Clearly identify in your report which variable you selected. Once you have identified the gender and variable you will consider, you can conduct the following analysis: • Create a histogram and boxplot of the race variable you selected. Comment on the shape (i.e. distribution) of the sample data. Based on the distribution of the sample data, what can you conclude about the distribution of the variable you selected in the population? Include your histogram (with appropriate titles and axis labels) along with your commentary in your report. • The population from which we are sampling had the following means and standard deviations for the different genders / race variables: Gender Race Variable Mean (µ) Standard Deviation (σ) Male Seconds 15305.98 2518.36 Male MPH 6.34 1.06 Female Seconds 16477.64 2247.55 Female MPH 5.84 0.83 Based on the information above, along with the information in the histogram / boxplot you created, determine the sampling distribution of the sample mean, X¯. You should determine and clearly identify µX¯, σX¯ , and the shape of the sampling distribution of X¯ in your report. (NOTE: the sample size n will depend on the gender you have selected). • Using the sampling distribution of the sample mean, determine the following quantities (based on samples of the same size, n, where n is the sample size of the gender you selected). Include all calculated values in your report: – What is the probability a randomly selected group of n runners will have a mean completion time of less than 4.5 hours? OR What is the probability a randomly selected group of n runners will have a mean speed of less than 5 MPH? You only need to answer ONE of these questions, based on the variable you selected. – What is the probability a randomly selected group of n runners will have a mean completion time of over 5 hours? OR What is the probability that a randomly selected group of n runners will have a mean speed greater than 6 MPH? You only need to answer ONE of these questions, based on the variable you selected. – What is the 90th percentile for the mean value of n runners, for the variable you selected? – For the calculations you have completed above, explain why it was necessary for us to hold the sample size n constant? · Even though we know the true value of µ, calculate a 95% confidence interval for µ as if we did not know it. For now, you can use the fact that we know σ from the population. You do not need to show every step of your calculation of the confidence interval, however you must show your point estimate, the confidence coefficient you used, and the standard deviation of the sampling distribution for X¯. – Include your confidence interval, as well as an appropriate interpretation of the interval, in your report. Was your confidence interval successful in capturing the true population mean? · Calculate a 90% for µ, however this time assume that we do not know σ. Explain how we can work around this problem. As before, you must include your point estimate, the confidence coefficient, and the standard error of X¯ in your report. – Include your final confidence interval (you do not need to show all steps of the calculations) and an appropriate interpretation of the interval in your report. Was this second confidence interval successful in capturing the true population mean?
Mar 09, 2021
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here