Only question number 4 please

1 answer below »
Only question number 4 please


APPLIED PROJECT PART B – TERM 1 2022 STUDENT NAME STUDENT ID SIGNATURE YUVRAJ KHETAN 20352484 YUVRAJ ASHUTOSH SHARMA ANKITA THAPA USHMITA SUBEDI UNIT NAME: Statistics for Business UNIT NUMBER: 700007 NUMBER OF QUESTIONS: 4 VALUE OF QUESTIONS: Group Project Part B questions are worth 30 marks in total. ANSWERING QUESTIONS: This is a Microsoft Excel based project. All handwritten answers are to be written on the space provided in this assignment. LECTURER/UNIT COORDINATOR: DUE DATE: 27th May 2022, Friday Week 11 at 11.59pm TOTAL PAGES: 7 INSTRUCTIONS TO STUDENTS · The file containing the data sets for the following questions can be downloaded from the e-learning site vUWS. · You should use Microsoft Excel to carry out all calculations and statistical analysis which must be shown in the file to be submitted on vUWS. · All answers in this booklet must match your calculations done and submitted in the Excel file. · To complete this project, you must provide the Excel outputs for each of the questions. · GROUP Projects submitted after the due date will attract a late penalty in accordance with the late penalty policy stated on the Western Sydney University – The International College Website. To upload your Group Project file, follow these instructions: a. Login to “Moodle”. b. Select the Statistics for Business folder. c. Select “Assessments” (from the left hand side of the screen) and navigate to “Applied Project Part B”. Here you will find a submission link. d. Use “Add attachment” to select the file to be submitted. e. Double-check to make sure you have the correct file. f. “Submit” your file. g. Answers of your group project containing the solutions to each question and Excel file must be handed in by the due date of assessment. Description of the Data files The data used in this Project are stored in an Excel worksheet named Assignment Data.xlsx. It can be downloaded on Moodle. These data comprise of the following variables: Question 1 worksheet Internet usage Amount of data (Gigabytes) used during the previous month. Question 2 worksheet 1st column Annual salary ($) of male athletes in 2019 2nd column Annual salary ($) of female athletes in 2019 Question 3 worksheet 1st column Male ID 2nd column Weight (kg) before weight loss program 3rd column Weight (kg) after weight loss program Question 4 worksheet 1st column Average amount of study (minutes) per day 2nd column Average % marks for the year. NOTE: The data was created using a Random Number generator for the sole purpose of this project. Question 1 (7 marks) Marks All work for this question must be done in the worksheet named Question 1. This worksheet can be found in the data file Assignment Data. a) Using Excel, obtain a Descriptive Statistics output of the internet usage from a random sample of 50 people during the previous month. (Mean, Mode, Range and Standard Deviation) * Mean 73.9 Mode 55 Standard Deviation 16.17663975 Range 48 [1] b) Using Excel, construct a histogram with appropriate bin widths (with 7 classes). * [2] Bin width=maximum-minimum/number of bins Bin width = (99-51)/7 = 6.857143 CLASS FREQUENCY 51-58 12 59-66 8 67-74 8 75-81 5 82-88 1 89-96 13 97-104 3 Total 50 c) Using the information from the descriptive statistics output and the histogram, write one paragraph describing the data set. You must include information about the mean, mode, range, standard deviation and the shape of the distribution, of the data in your answer. [2] From the descriptive statistics, the average internet usage for a random sample of 50 people is 73.9 with a sample standard deviation of 16.17663975. This standard deviation is high, thus indicates that the data are more spread out. The modal internet usage in 50 with a similar range. On the other hand, the above histogram shows that the data in this study is skewed to the right. d) Using an appropriate Excel output, construct a 90% confidence interval for the population mean of the internet usage during the previous month. [2] Column1 Mean 73.9 Standard Error 2.287722332 Median 71.5 Mode 55 Standard Deviation 16.17663975 Sample Variance 261.6836735 Kurtosis -1.499163249 Skewness 0.146809114 Range 48 Minimum 51 Maximum 99 Sum 3695 Count 50 Confidence Level(90.0%) 3.835482918 Upper CI (90%) 77.73548292 Lower CI (90%) 70.06451708 * Evidence of work in Excel is required Question 2 (8 marks) Marks All work for this question must be done in the worksheet named Question 2. This worksheet can be found in the data file Assignment Data. You are about to test the following hypothesis: In 2019, the annual salary (US dollars) of sixty five randomly selected professional sporting athletes around the world was collected. Is there a significant difference between the average annual salary of male and female athletes? a) Using Excel, obtain an output for a hypothesis test at 5% level of significance. Assume that the population variances of the two groups of sporting athletes are equal. Also assume that the annual salaries of male and female athletes are normally distributed.* [3] b) Using the information in the output, conduct a hypothesis test in the space below. Make sure that you include the null and the alternative hypothesis, the decision rule and the value of the test statistics. Explain your decision and write a conclusion. [5] * Evidence of work in Excel is required Question 3 (8 marks) Marks All work for this question must be done in the worksheet named Question 3. This worksheet can be found in the data file Assignment Data. You are about to test the following hypothesis: A researcher is looking to determine if a twelve-week weight loss program is working effectively for middle-aged men aged between 35 and 45. Thirty males participated in this program and their weights (kg) were recorded before and after the twelve-week weight loss program. Test the hypothesis at a 5% level of significance to determine if the twelve-week weight loss program is effective. a) Using Excel, obtain an output for a hypothesis test at 5% level of significance. Assume that the paired differences are normally distributed.* [3] b) Using the information in the output, conduct a hypothesis test in the space below. Make sure that you include the null and the alternative hypothesis, the decision rule and the value of the test statistics. Explain your decision and write a conclusion. [5] * Evidence of work in Excel is required Question 4 (7 marks) Marks All work for this question must be done in the worksheet named Question 4. This worksheet can be found in the data file Assignment Data. Western Sydney University The College is investigating whether there is any relationship between the student’s average study minutes per day and their average % mark for the year. Sample data concerning the average study minutes per day and the average % mark of the students for the year was collected. a) Using Excel, obtain a regression output. * [2] b) Determine the regression equation that may be used to predict the average % mark for the year from the average study minutes per day. [2] c) Interpret the slope of the regression line. [1] d) Obtain and interpret the coefficient of determination. [2] * Evidence of work in Excel is required END OF PROJECT Page: 1 Question 1 Internet usage (Gigabytes) last month Question 1 451(a) 8951Internet usage (Gigabytes) last month 9152Mean73.9Mean73.9 5152Standard Error2.2877223322Mode55 7454Median71.5Standard Deviation16.1766397459 9055Mode55Range48 7155Standard Deviation16.1766397459 5555Sample Variance261.6836734694 7956Kurtosis-1.4991632485 5556Skewness0.1468091137 5258Range48 7058Minimum51 9559Maximum99 7960Sum3695 5960Count50 9960 9662mean73.9 5663mode55 5663range 48 5864standard deviation16.1766397459 5268 6369 7170b) 9671Bin Width=max-min/no of bins(99-51)/76.8571428571 9871Internet usage (Gigabytes) last month 5572CLASSFREQUENCY 637351-5812 587459-668 957667-748 697975-815 807982-881 738089-9613 768097-1043 6888Total50 9489 6289 9890D) 6091Column1 8891Mean73.9 6093Standard Error2.2877223322 6093Median71.5 9394Mode55 8094Standard Deviation16.1766397459 7295Sample Variance261.6836734694 8995Kurtosis-1.4991632485 6496Skewness0.1468091137 5496Range48 9398Minimum51 9498Maximum99 5199Sum3695 Count50 Confidence Level(90.0%)3.8354829182 Upper CI (90%)77.7354829182 Lower CI (90%)70.0645170818 Internet usage (Gigabytes) last month Internet usage (Gigabytes) last month FREQUENCY51-5859-6667-7475-8182-8889-9697-104Total12885113350CLASS FREQUENCY Question 2 Annual salary of males athletes ($US in millions)Annual salary of female athletes ($US in millions) 36.125.2 28.517.1 19.913.6 39.06.1 21.14.0 2.94.4t-Test: Two-Sample Assuming Equal Variances 42.726.7 3.623.3Annual salary of males athletes ($US in millions)Annual salary of female athletes ($US in millions) 11.430.4Mean27.276618475715.072444721 3.622.2Variance234.780654617793.4880732666 17.913.6Observations3827 22.217.1Pooled Variance176.469430568 20.823.2Hypothesized Mean Difference0 37.94.8df63 34.820.7t Stat3.6499813225 33.420.7P(T<=t) one-tail="" 0.0002672374="" 28.6="" 2.8="" t="" critical="" one-tail="" 1.6694022217="" 11.7="" 23.4=""><=t) two-tail="" 0.0005344747="" 44.7="" 7.6="" t="" critical="" two-tail="" 1.9983405425="" 5.5="" 4.9="" 7.6="" 11.1="" 22.3="" 4.0="" 28.4="" 30.8="" 1.1="" 30.5="" 51.6="" 5.2="" 19.4="" 10.8="" 45.9="" 2.6="" 56.6="" 26.4="" 34.1="" 40.8="" 43.4="" 24.1="" 17.9="" 26.5="" 57.6="" 44.2="" 22.3="" question="" 3="" id="" weight="" (kg)="" before="" weight="" (kg)="" after="" 1="" 91="" 108="" 2="" 97="" 92="" t-test:="" paired="" two="" sample="" for="" means="" 3="" 106="" 99="" 4="" 111="" 81="" weight="" (kg)="" before="" weight="" (kg)="" after="" 5="" 100="" 98="" mean="" 107.6="" 96.0666666667="" 6="" 108="" 100="" variance="" 85.5586206897="" 77.3747126437="" 7="" 105="" 109="" observations="" 30="" 30="" 8="" 116="" 86="" pearson="" correlation="" -0.3090418182="" 9="" 96="" 110="" hypothesized="" mean="" difference="" 0="" 10="" 115="" 92="" df="" 29="" 11="" 115="" 101="" t="" stat="" 4.3261189686="" 12="" 95="" 81=""><=t) one-tail="" 0.0000820005="" 13="" 115="" 107="" t="" critical="" one-tail="" 1.6991270265="" 14="" 103="" 102=""><=t) two-tail0.0001640011 1511593t critical two-tail2.0452296421 1611884 1711986 18109103 1911493 2011585 2111896 2212099 23117105 2410991 2511685 269497 27101106 2895108 299291 3010394 question 4 average number of study minutes per dayaverage % marks for the year 3556 3652.2 3954.8 3956.8 3935.8summary output 3942.8 4144.2regression statistics 4533multiple r0.8351153546 4858.6r square0.6974176555 5041adjusted r square0.6952250299 5436.8standard error8.1822631145 5454.8observations140 5532 5659.2anova 5651.2dfssmsfsignificance f 5647.2regression121294.886133447721294.8861334477318.07419774721.21994085426028e-37 5747.4residual1389239.021295123766.9494296748 5861.6total13930533.9074285714 5943.8 6434.8coefficientsstandard errort statp-valuelower 95%upper 95%lower 95.0%upper 95.0% 6460.8intercept35.62667726961.680658126921.19805134613.060629067655e-4532.303506002338.949848536932.303506002338.9498485369 6453.8average number of study minutes per day0.2042872120.011454521717.83463478031.21994085426039e-370.18163814550.22693627840.18163814550.2269362784 6538 6653.2correlation coefficient 0.835115355 6749.4 6740.4 7244.4 7242.4 7542 7655.2 7658.2 7736.4 7743.4 7763.4 7743.4 7747.4 7946.8 7965.8 8146.2 8355.6 8652.2 8743.4 8863.6 9054 9064 9046 9450.8 9540 9552 9660.2 9769.4 9755.4 9846.6 10156.2 10458.8 10543 10554 11172.2 11351.6 11346.6 11656.2 11870.6 12357.6 12455.8 12465.8 12449.8 12574 12546 12663.2 12948.8 13063 13064 13470.8 13560 13571 13856.6 14475.8 14460.8 14774.4 15171.2 15159.2 15172.2 15560 15861.6 15973.8 16075 16576 16560 16582 16571 16778.4 two-tail="" 0.0001640011="" 15="" 115="" 93="" t="" critical="" two-tail="" 2.0452296421="" 16="" 118="" 84="" 17="" 119="" 86="" 18="" 109="" 103="" 19="" 114="" 93="" 20="" 115="" 85="" 21="" 118="" 96="" 22="" 120="" 99="" 23="" 117="" 105="" 24="" 109="" 91="" 25="" 116="" 85="" 26="" 94="" 97="" 27="" 101="" 106="" 28="" 95="" 108="" 29="" 92="" 91="" 30="" 103="" 94="" question="" 4="" average="" number="" of="" study="" minutes="" per="" day="" average="" %="" marks="" for="" the="" year="" 35="" 56="" 36="" 52.2="" 39="" 54.8="" 39="" 56.8="" 39="" 35.8="" summary="" output="" 39="" 42.8="" 41="" 44.2="" regression="" statistics="" 45="" 33="" multiple="" r="" 0.8351153546="" 48="" 58.6="" r="" square="" 0.6974176555="" 50="" 41="" adjusted="" r="" square="" 0.6952250299="" 54="" 36.8="" standard="" error="" 8.1822631145="" 54="" 54.8="" observations="" 140="" 55="" 32="" 56="" 59.2="" anova="" 56="" 51.2="" df="" ss="" ms="" f="" significance="" f="" 56="" 47.2="" regression="" 1="" 21294.8861334477="" 21294.8861334477="" 318.0741977472="" 1.21994085426028e-37="" 57="" 47.4="" residual="" 138="" 9239.0212951237="" 66.9494296748="" 58="" 61.6="" total="" 139="" 30533.9074285714="" 59="" 43.8="" 64="" 34.8="" coefficients="" standard="" error="" t="" stat="" p-value="" lower="" 95%="" upper="" 95%="" lower="" 95.0%="" upper="" 95.0%="" 64="" 60.8="" intercept="" 35.6266772696="" 1.6806581269="" 21.1980513461="" 3.060629067655e-45="" 32.3035060023="" 38.9498485369="" 32.3035060023="" 38.9498485369="" 64="" 53.8="" average="" number="" of="" study="" minutes="" per="" day="" 0.204287212="" 0.0114545217="" 17.8346347803="" 1.21994085426039e-37="" 0.1816381455="" 0.2269362784="" 0.1816381455="" 0.2269362784="" 65="" 38="" 66="" 53.2="" correlation="" coefficient="" 0.835115355="" 67="" 49.4="" 67="" 40.4="" 72="" 44.4="" 72="" 42.4="" 75="" 42="" 76="" 55.2="" 76="" 58.2="" 77="" 36.4="" 77="" 43.4="" 77="" 63.4="" 77="" 43.4="" 77="" 47.4="" 79="" 46.8="" 79="" 65.8="" 81="" 46.2="" 83="" 55.6="" 86="" 52.2="" 87="" 43.4="" 88="" 63.6="" 90="" 54="" 90="" 64="" 90="" 46="" 94="" 50.8="" 95="" 40="" 95="" 52="" 96="" 60.2="" 97="" 69.4="" 97="" 55.4="" 98="" 46.6="" 101="" 56.2="" 104="" 58.8="" 105="" 43="" 105="" 54="" 111="" 72.2="" 113="" 51.6="" 113="" 46.6="" 116="" 56.2="" 118="" 70.6="" 123="" 57.6="" 124="" 55.8="" 124="" 65.8="" 124="" 49.8="" 125="" 74="" 125="" 46="" 126="" 63.2="" 129="" 48.8="" 130="" 63="" 130="" 64="" 134="" 70.8="" 135="" 60="" 135="" 71="" 138="" 56.6="" 144="" 75.8="" 144="" 60.8="" 147="" 74.4="" 151="" 71.2="" 151="" 59.2="" 151="" 72.2="" 155="" 60="" 158="" 61.6="" 159="" 73.8="" 160="" 75="" 165="" 76="" 165="" 60="" 165="" 82="" 165="" 71="" 167="">
Answered Same DayMay 21, 2022

Answer To: Only question number 4 please

Manoj answered on May 22 2022
90 Votes
Question 1
    Internet usage (Gigabytes) last month                     Question 1
    4        51            (a)
    89        51            Internet usage (Gigabytes) last month
    91        52            Mean    73.9        Mean    73.9
    51        52            Standa
rd Error    2.2877223322        Mode    55
    74        54            Median    71.5        Standard Deviation    16.1766397459
    90        55            Mode    55        Range    48
    71        55            Standard Deviation    16.1766397459
    55        55            Sample Variance    261.6836734694
    79        56            Kurtosis    -1.4991632485
    55        56            Skewness    0.1468091137
    52        58            Range    48
    70        58            Minimum    51
    95        59            Maximum    99
    79        60            Sum    3695
    59        60            Count    50
    99        60
    96        62            mean    73.9
    56        63            mode    55
    56        63            range     48
    58        64            standard deviation    16.1766397459
    52        68
    63        69
    71        70            b)
    96        71            Bin Width=max-min/no of bins    (99-51)/7    6.8571428571
    98        71            Internet usage (Gigabytes) last month
    55        72            CLASS    FREQUENCY
    63        73            51-58    12
    58        74            59-66    8
    95        76            67-74    8
    69        79            75-81    5
    80        79            82-88    1
    73        80            89-96    13
    76        80            97-104    3
    68        88            Total    50
    94        89
    62        89
    98        90            D)
    60        91            Column1
    88        91            Mean    73.9
    60        93            Standard Error    2.2877223322
    60        93            Median    71.5
    93        94            Mode    55
    80        94            Standard Deviation    16.1766397459
    72        95            Sample Variance    261.6836734694
    89        95            Kurtosis    -1.4991632485
    64        96            Skewness    0.1468091137
    54        96            Range    48
    93        98            Minimum    51
    94        98            Maximum    99
    51        99            Sum    3695
                        Count    50
                        Confidence Level(90.0%)    3.8354829182
                        Upper CI (90%)    77.7354829182
                        Lower CI (90%)    70.0645170818
Internet usage (Gigabytes) last month
Internet usage (Gigabytes) last month     FREQUENCY    51-58    59-66    67-74    75-81    82-88    89-96    97-104    Total    12    8    8    5    1    13    3    50    CLASS
FREQUENCY
Question 2
    Annual salary of males athletes ($US in millions)    Annual salary of female athletes ($US in...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here