· The assignment must be in MS Word format, no spacing, 12-pt Arial font and 2 cm margins on all four sides of your page with appropriate section headings and page numbers. · Reference sources must be...

1 answer below »
Please see attached


· The assignment must be in MS Word format, no spacing, 12-pt Arial font and 2 cm margins on all four sides of your page with appropriate section headings and page numbers. · Reference sources must be cited in the text of the report, and listed appropriately at the end in a reference list using Harvard referencing style. · When answering questions, students are expected to show all the workings. Wherever required, you should copy/cut and paste the Excel output (e.g., plots, regression output etc.) to show your working/output. Question 1 The higher education department of Holmes Institute recorded data on the number of students enrolled in the different study majors for the years 2018 and 2019. The data are stored in file STUDYMAJOR.xls. a) Use an appropriate graphical technique or chart to compare the number of enrolment in 2018 and 2019 of the different study major. Display the chart. b) Use an appropriate graphical technique or chart to display the percentage value of the number of enrolment of the different study major in 2018 and 2019. Display the chart. Note: Questions 2 to 6 are related. Question 2 Sociologists argued that women on average earn less than men as women often choose to work less hours. They further suggest that the choice of hours worked may be driven by various factors such as age, childcare needs, occupation choice and flexibility. To investigate the relation between hours worked and income earned by Australian men and women, a researcher plans to survey a sample of individuals across the country. Briefly explain (using no more than 250 words in total for this question) a) What type of survey method the researcher could use and why? b) What sampling method could the researcher use to select his/her sample and why? c) What are the two main variables the researcher should consider collecting data for the purpose of the above analysis and why? Identify the data type(s) for the variables. d) What kind of issues the researcher may face in this data collection? Suppose a researcher has collected data from a sample of 65 individuals using the sampling method you have proposed in (b). For each individual, the hours worked per week and yearly income (measured in ‘000’s dollars) were recorded. The data are stored in file HOURSWORKED.xls. Question 3 First, the researcher categorised the data into six location groups and six occupation groups, and calculated the frequencies given below. Using Excel and the data in the frequency tables above, answer the following questions. a) Which graphical technique or chart should be used if the researcher is interested in comparing the number of individuals in each location group? Explain the reason for the selection of this graphical chart. Construct and display the chart, also briefly describe what you can observe about the number of individuals belonging to each location category. b) Which graphical technique or chart should be used if the researcher is interested in comparing the proportion of the number of individuals in each occupation group? Explain the reason for the selection of this graphical chart. Construct and display the chart, also briefly describe what you can observe about the proportion of the number of individuals belonging to each occupation category. Question 4 Second, the researcher wishes to use graphical descriptive methods to present summaries of the data on each of the two variables: hours worked per week and yearly income, as stored in file HOURSWORKED.xls. a) The number of observations (n) is 65 individuals. The researcher suggests using 7 class intervals to construct a histogram for each variable. Explain how the researcher would have decided on the number of class intervals (K) as 7. b) The researcher suggests using class intervals as 10 < x="" ≤="" 15,="" 15="">< x="" ≤="" 20,="" …,="" 40="">< x="" ≤="" 45="" for="" the="" hours="" per="" week="" variable="" and="" class="" intervals="" 40="">< x="" ≤="" 45,="" 45="">< x="" ≤="" 50,="" ...,="" 70="">< x ≤ 75 for the yearly income variable. explain how the researcher would have decided the width of the above class intervals (or class width). c) draw and display a histogram for each of the two variables using appropriate bin values from part (b) and comment on the shape of the two distributions. question 5 third, the researcher wishes to use numerical descriptive measures to summarize the data on each of the two variables: hours worked per week and yearly income. a) prepare and display a numerical summary report for each of the two variables including summary measures such as mean, median, range, variance, standard deviation, smallest and largest values and the three quartiles. notes: use quartile.exc command to generate the three quartiles. (3 marks) b) compute the correlation coefficient using the relevant excel function to measure the direction and strength of the linear relationship between the two variables. display and interpret the correlation value. question 6 finally, the researcher considers using regression analysis to establish a linear relationship between the two variables – hours worked per week and yearly income. a) what is the dependent variable and independent variable for this analysis? why? b) use an appropriate plot to investigate the relationship between the two variables. display the plot. on the same plot, fit a linear trend line including the equation and the coefficient of determination r2. c) estimate a simple linear regression model and present the estimated linear equation. display the regression summary table and interpret the intercept and slope coefficient estimates of the linear model. d) display and interpret the value of the coefficient of determination, r-squared (r2). x="" ≤="" 75="" for="" the="" yearly="" income="" variable.="" explain="" how="" the="" researcher="" would="" have="" decided="" the="" width="" of="" the="" above="" class="" intervals="" (or="" class="" width).="" c)="" draw="" and="" display="" a="" histogram="" for="" each="" of="" the="" two="" variables="" using="" appropriate="" bin="" values="" from="" part="" (b)="" and="" comment="" on="" the="" shape="" of="" the="" two="" distributions.="" question="" 5="" third,="" the="" researcher="" wishes="" to="" use="" numerical="" descriptive="" measures="" to="" summarize="" the="" data="" on="" each="" of="" the="" two="" variables:="" hours="" worked="" per="" week="" and="" yearly="" income.="" a)="" prepare="" and="" display="" a="" numerical="" summary="" report="" for="" each="" of="" the="" two="" variables="" including="" summary="" measures="" such="" as="" mean,="" median,="" range,="" variance,="" standard="" deviation,="" smallest="" and="" largest="" values="" and="" the="" three="" quartiles.="" notes:="" use="" quartile.exc="" command="" to="" generate="" the="" three="" quartiles.="" (3="" marks)="" b)="" compute="" the="" correlation="" coefficient="" using="" the="" relevant="" excel="" function="" to="" measure="" the="" direction="" and="" strength="" of="" the="" linear="" relationship="" between="" the="" two="" variables.="" display="" and="" interpret="" the="" correlation="" value.="" question="" 6="" finally,="" the="" researcher="" considers="" using="" regression="" analysis="" to="" establish="" a="" linear="" relationship="" between="" the="" two="" variables="" –="" hours="" worked="" per="" week="" and="" yearly="" income.="" a)="" what="" is="" the="" dependent="" variable="" and="" independent="" variable="" for="" this="" analysis?="" why?="" b)="" use="" an="" appropriate="" plot="" to="" investigate="" the="" relationship="" between="" the="" two="" variables.="" display="" the="" plot.="" on="" the="" same="" plot,="" fit="" a="" linear="" trend="" line="" including="" the="" equation="" and="" the="" coefficient="" of="" determination="" r2.="" c)="" estimate="" a="" simple="" linear="" regression="" model="" and="" present="" the="" estimated="" linear="" equation.="" display="" the="" regression="" summary="" table="" and="" interpret="" the="" intercept="" and="" slope="" coefficient="" estimates="" of="" the="" linear="" model.="" d)="" display="" and="" interpret="" the="" value="" of="" the="" coefficient="" of="" determination,="" r-squared="">
Answered Same DayJun 03, 2021

Answer To: · The assignment must be in MS Word format, no spacing, 12-pt Arial font and 2 cm margins on all...

Komalavalli answered on Jun 05 2021
141 Votes
1
Question1:
From above chart we can say that more number of students have enrolled in statistics, business law, Accounting, Economics, Finance and Marketing Management. For the course auditing both years has same number of students en
rolled
By comparing percentage of students enrolled in different majors between 2 years 2018 and 2019, from we can interpret that percentage of student enrolment in statistics has increased from 23% in 2018 to 25% in 2019 and for economics it increased from 10% to 11%. percentage of student enrolment has came down for Accounting from 18% to 16%, Marketing management 15% to 13% and for Auditing 5% to 4% from 2018 to 2019.The % of student enrolment contribution to major studies remains unchanged for Business law and Finance.
Question 2:
a) Researcher could use questionnaire method for collecting primary samples. This technique helps us to gather larger number of information within a short span of time.
b) Researcher could use multistage random sampling method to choose his/her samples. Researcher could divide the nations into different locations after obtaining samples based on location. Location sample can be further divided into occupational category of individual and obtain samples based this category .Therefore multistage random sampling technique helps the researcher to obtain samples based on location and occupation of the individuals.
c) Annual income and number of workings hours of the individual are the two main variables that the researcher should consider collecting data for analysing. These two variables will help the researcher to study the relationship between working hours and income of an individual.
d) The individual may lie while answering the questions; this may end up in sampling bias. Some individuals may not respond towards the questionnaire and end up in missing data while collecting samples.
Question 3:
a)
Bar chart is used to show the data category in a frequency. Plotting frequency against Locations, researcher can easily able to identify number of individuals were sampled in each location. From above bar chart it is revealed that the researcher have collected samples from 25 individuals from location group D and from location group A he/she has collected data from 5 individuals .Location group A has high weightage in the data set while Location group D has low...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here