Use the data set below to produce the following outputs. For this task, use "median_income_by_state_2010_ XXXXXXXXXXcsv" Provide the R codes in the R-script file under the sub-heading Module 3, and...

1 answer below »

Use the data set below to produce the following outputs. For this task, use "median_income_by_state_2010_20210610.csv" Provide the R codes in the R-script file under the sub-heading Module 3, and the statistical outputs under the sub-heading "Module 3" in the Word file, using the appropriate file naming convention. In general, your statistical outputs (in this case: hypothesis test) should be titled and labeled so that they can stand on their own (and can be understood by anyone who looks at them for the first time).



  1. Conduct a one-sample t-tests for income in southern states. Use the t.test( ) in R. Provide the null and the alternative hypothesis for each test. Should this be a one or a two-sided test? Explain/justify your choice. Provide the test results and an interpretation of your results, in light of your choice for a one or two-sided test.


Submit a report that explores the dataset, provides an interpretation of your hypothesis test, and your R script.


For a definition of "southern states", Wikipedia (US Southern States(Links to an external site.)) discerns South Atlantic States (DE, FL, GA, MD, NC, SC, VA, WV), East South Central States (AL, KY, MS, TN), and West South Central States (AR, LA, OK, TX). For the purpose of this exercise, consider all three sets "southern states." For state abbreviations see:State Abbreviations




State,2010 United States,"55,520" Alabama,"46,120" Alaska,"65,178" Arizona,"52,839" Arkansas,"43,477" California,"61,162" Colorado,"67,866" Connecticut,"74,361" Delaware,"62,211" D.C.,"64,142" Florida,"49,650" Georgia,"49,707" Hawaii,"67,084" Idaho,"53,012" Illinois,"57,156" Indiana,"51,986" Iowa,"55,227" Kansas,"51,890" Kentucky,"46,313" Louisiana,"44,280" Maine,"54,005" Maryland,"72,336" Massachusetts,"68,655" Michigan,"52,140" Minnesota,"58,951" Mississippi,"42,996" Missouri,"51,623" Montana,"46,511" Nebraska,"59,157" Nevada,"57,688" New Hampshire,"75,077" New Jersey,"70,947" New Mexico,"50,853" New York,"56,089" North Carolina,"49,384" North Dakota,"57,469" Ohio,"51,701" Oklahoma,"48,565" Oregon,"57,014" Pennsylvania,"54,436" Rhode Island,"58,164" South Carolina,"46,982" South Dakota,"51,099" Tennessee,"43,481" Texas,"53,255" Utah,"63,886" Vermont,"63,015" Virginia,"68,017" Washington,"63,280" West Virginia,"48,198" Wisconsin,"56,731" Wyoming,"58,816"
Answered 2 days AfterJun 14, 2021

Answer To: Use the data set below to produce the following outputs. For this task, use...

Atreye answered on Jun 16 2021
143 Votes
Description:
The data consists of 52 southern states and corresponding median income.
Loading of the dataset
in R:
Code
> data<-read.csv(file.choose(),header=TRUE)
> head(data,2)
Output:
State MedianIncome
1 United States 55520
2 Alabama 46120
Making of dataframe
Code
> df<-data.frame(data)
Summary Statistics
Code
> summary(df$MedianIncome)
Output
Min. 1st Qu. Median Mean 3rd Qu. Max.
42996 50567 55374 56340 62412 75077
Interpretation:
The minimum value of Median Income is 42996.
The first quartile of median income is 50567.
The median of median income is 55374.
The mean of median income is 56340.
The third quartile of median income is 62412.
The maximum value of median income is 75077.
Boxplot of median income
Code
> boxplot(df$MedianIncome,ylab = "Median Income")
Output
Interpretation
From the boxplot, it is observed that the median income is slightly positive symmetric.
Performing Shapiro Wilk test
The Shapiro-Wilk test is performed to check for normality which is basic assumption for conducting one sample...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here