In this exercise, we will look at descriptive statistics and how to explore and summarize data sets. For this, we use the Heart Disease dataset from the UCI data repository. This dataset consists of 4...

1 answer below »

In this exercise, we will look at descriptive statistics and how to explore and summarize data sets. For this, we use theHeart Disease datasetfrom the UCI data repository. This dataset consists of 4 small datasets of people with heart disease admitted to 4 hospitals.



For now, we only work with the
attached data


Download attached data

this data consists of 271 instances with 7 attributes. The attributes are described as below:




  1. Age: age in years


  2. sex: 1 = male; 0 = female


  3. cp: chest pain type

    1. Value 1: typical angina

    2. Value 2: atypical angina

    3. Value 3: non-anginal pain

    4. Value 4: asymptomatic


    5. Trestbps: resting blood pressure


    6. Chol:cholesterol level


    7. Thalach:maximum heart rate achieved


    8. heart_problem:1= have heart disease; 0=No heart disease






Instruction: Use Microsoft Excel to do your work. Please submit your work as ONE MS excel file and create one tab for each question. Show your work as rigorously as possible. name the file as lastname_fastname_hw1.excel.



Using the attached data, answer the following questions:


1. How many patients have heart diseases? (0.5)


2. What is the average Cholesterol level of people with heart disease and without heart disease? What is the standard deviation? (1)


3. What is the median and average age of people with,



a. cholesterol higher than240.0? (0.5)



b. cholesterol higher than240.0with heart disease? (0.5)



c. cholesterol higher than240.0without heart disease? (0.5)


4. Create a histogram of resting blood pressure. (1)


5. Create boxplots based on thesexof the patients for the following attributes:



a. cholesterol level (1.5)



b. maximum heart rate achieved (1.5)


6. For each Box plot, answer the following questions:


a. What is the H-Spread (Q3-Q1) of cholesterol level for male and females? (0.5)


b. What are the Lower Hinge and Upper Hinge values for maximum heart rate for male and female? (0.5)


7. Answer the following questions:


a. Create two scatter plots ofageandresting blood pressurefor people with heart disease and without heart disease. Is there any visual correlation? (1+1)


b. Doesheart disease exhibit correlation with gender? If yes, what kind! Explain and justify. Show your detailed work. (2)


c.Which one exhibits higher correlation with heart disease gender or age? Or do gender and age together exhibit higher correlation with heart disease? Show your detailed work and explain. (2)


d. Calculate the average resting blood pressure of each age (HINT : Use Group by forage) for people with heart disease. Calculate the average resting blood pressure of each age (HINT : Use Group by forage) for people without heart disease. Now create two scatter plots. Do you observe a correlation now? Do people without heart disease have higher blood pressure as they age than people with heart disease? (2)


8. Compare the resting blood pressure of people with heart disease and without. (1)









file I attached please downloads and work on that
Answered 4 days AfterSep 25, 2022

Answer To: In this exercise, we will look at descriptive statistics and how to explore and summarize data sets....

Aditi answered on Sep 26 2022
54 Votes
cleveland-data
    ,age,sex,cp,trestbps,chol,thalach,heart_problem
    0,28,1,2,130.0,132.0,185.0,0
    1,29,1,2,120.0,243.0,160.0,0
    3,30,0,1,170.0,237.0,170.0,0
    4,31,0,2,100.0,219.0,150.0,0
    5,32,0,2,105.0,198.0,165.0,0
    6,32,1,2,110.0,225.0,184.0,0
    7,32,1,2,125.0,254.0,155.0,0
    8,33,1,3,120.0,298.0,185.0,0
    9,34,0,2,130.0,161.0,190.0,0
    10,34,1,2,150.0,214.0,168.0,0
    11,34,1,2
,98.0,220.0,150.0,0
    12,35,0,1,120.0,160.0,185.0,0
    13,35,0,4,140.0,167.0,150.0,0
    14,35,1,2,120.0,308.0,180.0,0
    15,35,1,2,150.0,264.0,168.0,0
    16,36,1,2,120.0,166.0,180.0,0
    17,36,1,3,112.0,340.0,184.0,0
    18,36,1,3,130.0,209.0,178.0,0
    19,36,1,3,150.0,160.0,172.0,0
    20,37,0,2,120.0,260.0,130.0,0
    21,37,0,3,130.0,211.0,142.0,0
    22,37,0,4,130.0,173.0,184.0,0
    23,37,1,2,130.0,283.0,98.0,0
    24,37,1,3,130.0,194.0,150.0,0
    25,37,1,4,120.0,223.0,168.0,0
    26,37,1,4,130.0,315.0,158.0,0
    27,38,0,2,120.0,275.0,129.0,0
    28,38,1,2,140.0,297.0,150.0,0
    29,38,1,3,145.0,292.0,130.0,0
    30,39,0,3,110.0,182.0,180.0,0
    32,39,1,2,120.0,200.0,160.0,0
    33,39,1,2,120.0,204.0,145.0,0
    35,39,1,2,190.0,241.0,106.0,0
    36,39,1,3,120.0,339.0,170.0,0
    37,39,1,3,160.0,147.0,160.0,0
    38,39,1,4,110.0,273.0,132.0,0
    39,39,1,4,130.0,307.0,140.0,0
    40,40,1,2,130.0,275.0,150.0,0
    41,40,1,2,140.0,289.0,172.0,0
    42,40,1,3,130.0,215.0,138.0,0
    43,40,1,3,130.0,281.0,167.0,0
    45,41,0,2,110.0,250.0,142.0,0
    46,41,0,2,125.0,184.0,180.0,0
    47,41,0,2,130.0,245.0,150.0,0
    48,41,1,2,120.0,291.0,160.0,0
    49,41,1,2,120.0,295.0,170.0,0
    50,41,1,2,125.0,269.0,144.0,0
    51,41,1,4,112.0,250.0,142.0,0
    52,42,0,3,115.0,211.0,137.0,0
    53,42,1,2,120.0,196.0,150.0,0
    54,42,1,2,120.0,198.0,155.0,0
    55,42,1,2,150.0,268.0,136.0,0
    56,42,1,3,120.0,228.0,152.0,0
    57,42,1,3,160.0,147.0,146.0,0
    58,42,1,4,140.0,358.0,170.0,0
    59,43,0,1,100.0,223.0,142.0,0
    60,43,0,2,120.0,201.0,165.0,0
    61,43,0,2,120.0,215.0,175.0,0
    62,43,0,2,120.0,249.0,176.0,0
    63,43,0,2,120.0,266.0,118.0,0
    64,43,0,2,150.0,186.0,154.0,0
    66,43,1,2,142.0,207.0,138.0,0
    67,44,0,4,120.0,218.0,115.0,0
    68,44,1,2,120.0,184.0,142.0,0
    69,44,1,2,130.0,215.0,135.0,0
    70,44,1,4,150.0,412.0,170.0,0
    71,45,0,2,130.0,237.0,170.0,0
    73,45,0,4,132.0,297.0,144.0,0
    74,45,1,2,140.0,224.0,122.0,0
    76,45,1,4,120.0,225.0,140.0,0
    77,45,1,4,140.0,224.0,144.0,0
    78,46,0,4,130.0,238.0,90.0,0
    79,46,1,2,140.0,275.0,165.0,0
    80,46,1,3,120.0,230.0,150.0,0
    81,46,1,3,150.0,163.0,116.0,0
    82,46,1,4,110.0,238.0,140.0,0
    83,46,1,4,110.0,240.0,140.0,0
    84,46,1,4,180.0,280.0,120.0,0
    85,47,0,2,140.0,257.0,135.0,0
    87,47,1,1,110.0,249.0,150.0,0
    88,47,1,2,160.0,263.0,174.0,0
    89,47,1,4,140.0,276.0,125.0,0
    90,48,0,2,,308.0,,0
    92,48,0,2,120.0,284.0,120.0,0
    93,48,0,3,120.0,195.0,125.0,0
    94,48,0,4,108.0,163.0,175.0,0
    95,48,0,4,120.0,254.0,110.0,0
    96,48,0,4,150.0,227.0,130.0,0
    98,48,1,2,130.0,245.0,160.0,0
    99,48,1,2,140.0,238.0,118.0,0
    100,48,1,3,110.0,211.0,138.0,0
    103,49,0,2,124.0,201.0,164.0,0
    104,49,0,3,130.0,207.0,135.0,0
    105,49,1,2,100.0,253.0,174.0,0
    106,49,1,3,140.0,187.0,172.0,0
    107,49,1,4,120.0,297.0,132.0,0
    109,50,0,2,110.0,202.0,145.0,0
    110,50,0,4,120.0,328.0,110.0,0
    111,50,1,2,120.0,168.0,160.0,0
    112,50,1,2,140.0,216.0,170.0,0
    113,50,1,2,170.0,209.0,116.0,0
    114,50,1,4,140.0,129.0,135.0,0
    115,50,1,4,150.0,215.0,140.0,0
    116,51,0,2,160.0,194.0,170.0,0
    117,51,0,3,110.0,190.0,120.0,0
    118,51,0,3,130.0,220.0,160.0,0
    119,51,0,3,150.0,200.0,120.0,0
    120,51,1,2,125.0,188.0,145.0,0
    121,51,1,2,130.0,224.0,150.0,0
    122,51,1,4,130.0,179.0,100.0,0
    123,52,0,2,120.0,210.0,148.0,0
    125,52,0,3,125.0,272.0,139.0,0
    126,52,0,4,130.0,180.0,140.0,0
    127,52,1,2,120.0,284.0,118.0,0
    128,52,1,2,140.0,100.0,138.0,0
    129,52,1,2,160.0,196.0,165.0,0
    130,52,1,3,140.0,259.0,170.0,0
    131,53,0,2,113.0,468.0,127.0,0
    132,53,0,2,140.0,216.0,1...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here