Assignment 5 PS 3780 Data Literacy & Visualization, Summer 2022 Due Date: Thursday, June 23, 2021 at 11:59 p.m. Please save your visualizations and answers to these questions as one .pdf �le (use the...

1 answer below »
Please answer the questions appropriately. Code with the answers that ask for it, and written responses accordingly. Also have each accordingly labeled with the corresponding number.


Assignment 5 PS 3780 Data Literacy & Visualization, Summer 2022 Due Date: Thursday, June 23, 2021 at 11:59 p.m. Please save your visualizations and answers to these questions as one .pdf �le (use the �save as� function in most word processors). Be sure to include your name, your teammate's name if there is anyone, and the assignment number. Submit the �le to Carmen by the due date. Remember we are looking for professional visualizations so please include a meaningful title as well as axis labels and a legend. Plotting in R - MOMA Artwork Use the Artworks dataset from Carmen and answer the questions below. The data has been downloaded and partially cleaned from github and includes pieces in the Museum of Modern Art Collection. A random subset of around 6000 pieces we selected to make this a reasonably sized dataset. Import the dataset into R. Write down what command that you use for answering each question. Summarize and Transform 1. (1 pt) Load in the data and check that it is read properly. What variables are in the dataset? Summarize Height and Width; what's the mean, median, max, and min? 2. (1 pt) Using log(), transform and save di�erent versions of both the Height and Width variables so that outliers are less extreme. For these new variables, what's the mean, median, max, and min? (Note: log(data$variable) will calculate the log of each value in variable and return a vector of the same length.) What's the Size? 1. (2 pt) Make a basic x-y plot comparing the logged width and logged height of pieces in the collection. Indicate the department that each piece is in by customizing some part of the graph. Write a paragraph describing the apparent relationship (or lack thereof) between these three variables. 1 Density Plots 1. (2 pt) Pick three departments that have at least 300 pieces in the dataset. Plot overlapping density curves for the logged height of objects in these departments. Make sure each density curve is visible. Write a paragraph describing the di�erences and similarities between the pieces in the three departments? (Notes: table() can help you decide which departments to focus on. Watch video lecture 10c for help with the plot.) 2
Answered Same DayJun 22, 2022

Answer To: Assignment 5 PS 3780 Data Literacy & Visualization, Summer 2022 Due Date: Thursday, June 23, 2021 at...

Suraj answered on Jun 23 2022
82 Votes
1.
The data set is loaded in the R-Studio environment. The variables in the data set are given as f
ollows:
> colnames(df)
[1] "Title" "Artist" "Nationality" "Classification" "Department" "DateAcquired" "Height" "Width"
The summary statistics for the height and weight variables are given as follows:
> summary(df$Height)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 17.90 27.70 37.53 43.82 1574.80
> summary(df$Width)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 17.46 25.30 38.19 43.16 9144.00
2.
The height and weight variables transformed to other scale using log transformation. The summary statistics for the transformed variables given as follows:
> summary(df$log_height)
Min. 1st Qu. Median Mean 3rd Qu. Max.
-Inf ...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here