Using the “ Independent Project Data ” set file supplied, perform an analysis in StatCrunch for the following using the variable(s) of your choice: 1. Frequency distribution of a variable and bar...

1 answer below »

Using the “Independent Project Data” set file supplied, perform an analysis inStatCrunchfor the following using the variable(s) of your choice:



1.
Frequency distribution of a variable and bar graph of the same variable



2. Descriptives of a continuous variable: mean, median, mode, skewness, kurtosis, standard deviation and graph of that variable



3. Cross tabulation of two variables with the appropriate statistical test



4.
Comparison of two groups (single variable) on a single continuous variable with the appropriate statistical test



5. Comparison of the effect of three or more groups (single variable) on a single continuous variable with the appropriate statistical test



6. Scatterplot and correlation between the two continuous variables with the appropriate statistical test


Think carefully about what kind of variables to choose for the given tasks. A short descriptive statement should accompany each of the above including a description of the variables used and any meaning that may be attached to the results. Write up the project in a WORD document for submission.


Grading on this project is as follows:



3 pointsfor each task 1-6: 1 point each for variable choice, appropriate display/test, description of result.



2 pointsfor overall format/readability/construct (the writing and graphs should be formal and of publishable quality as you would see in a journal article).






SIDE NOTES FROM STUDENT:


1. I will attach the "Independent Project Data" that is to be used on StatCrunch. It contains tabs on the bottom of the excel spread sheet with more information.


2. I will attach a "sample project write up" as a reference


3. Here is my login to My StatCrunch:



https://nu.okta.com/



Username: [email protected]


Password: S4arper5word


Click on "Blackboard Learn" app


Click on BST322 in "My courses"


Then on the LEFT hand side click "StatCrunch Resources"


Click "Open the StatCrunch"


Then click "View the data sets for your course"


Then you'll be able to load the data from Excel to start



Answered Same DayFeb 15, 2021

Answer To: Using the “ Independent Project Data ” set file supplied, perform an analysis in StatCrunch for the...

Pritam answered on Feb 25 2021
141 Votes
Independent project
1. Frequency distribution and bar graph visualization:
On this particular problem, we need to find the bar graph of a categorical variable, measured on a nominal scale, education. The frequency distribution table contains the variable, absolute frequency, relative frequency, and total percentage.
Frequency table results for education:
Count = 950
    education
    Frequen
cy
    Relative Frequency
    Percent of Total
    BA degree
    32
    0.033684211
    3.3684211
    Diploma or GED
    473
    0.49789474
    49.789474
    No high school diploma
    445
    0.46842105
    46.842105
The frequency distribution table and the bar graph above show that there are a greater number of Diploma or GED (473) and no high school Diploma (445) than BA degree (32), which is very low compared to the above two categories. Hence In terms of education level, BA degree holders form the minority group in the sample.
2. Descriptives of a continuous variable:
A continuous variable can take any real values in a particular range. We choose BMI as a continuous variable. The following table contains the basic descriptive statistics of the variable under consideration.
Summary statistics
    Column
    Mean
    Std. dev.
    Median
    Kurtosis
    Skewness
    Mode
    bmi
    29.167131
    7.3633338
    27.97
    1.1320945
    0.9961317
    24.05

Mode:
The mode is the value that occurs most often in a data. In other words, it is the value that is most likely to be sampled. In our example 24.05 occurs 11 times which is the highest and the next closest value of 27.46 occurring 10 times in the sample. The histogram can also show the same in the next section. Hence 24.05 is the mode.
Skewness:
The skewness is the measure of the asymmetry of the probability distribution. The skewness can be positive or negative based on the shape of the tail of the distribution. If the left tail of the distribution is longer then that is termed as negative skewness and for a longer right tail, it is termed as positive skewness.
The histogram concludes that the distribution of the BMI has a longer right tail. Thus the data is positively skewed (0.996).
Kurtosis:
Kurtosis measures how the distribution at the tails differ from that of a normal distribution. The normal distribution has a kurtosis value 3. So the difference between the kurtosis of a distribution and 3 is called the excess kurtosis. There are three types of kurtosis and they are mesokurtic, leptokurtic and platykurtic. If the kurtosis value is zero or close to zero, then it is called mesokurtic. Positive excess kurtosis happens in the case of leptokurtic and platykurtic happens in case of negative excess kurtosis. In this particular example, the data is leptokurtic (1.132).
Standard deviation:
Standard deviation measures the variation of the data around the mean. The standard deviation of the sample is 7.36 which indicates that the data is distributed with a mean deviation of 7.36 from the mean above or below.
3. Cross-tabulation of two variables:
Cross-tabulation basically involves the dependencies among two nominal variables. In this particular problem, we discuss whether the depression level is anyway related to the drunk state of the person. The dependent variable felt down, denotes the depression level of the person and drunk state is the independent variable. Chi-square test measures the relationship between two categorical variables. Statcrunch provides the following result which contains the cross-table as well as the Chi-square test.
Contingency table results:
    Drunk
    
    
    Felt Down
    
    
    
    Little of the time
    Most of the time
    None of the time
    Some of the time
    Total
    3+ times
% within drunk
% within felt down
% of the total
Expected count
    18
(22.5%)
(7.2%)
(1.89%)
(21.05)
    26
(32.5%)
(16.88%)
(2.74%)
(12.97)
    16
(20%)
(5.56%)
(1.68%)
(24.25)
    20
(25%)
(7.75%)
(2.11%)
(21.73)
    80
(100%)
(8.42%)
(8.42%)
    Never
% within...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here