Microsoft Word - Assignment2_Q3_2022.doc School of Computer, Data & Mathematical Sciences Visualisation - (COMP7016) Q3 2022: Assignment 2 Multidimensional Data Visualisation (Assignment deadline:...

1 answer below »
Tableau


Microsoft Word - Assignment2_Q3_2022.doc School of Computer, Data & Mathematical Sciences Visualisation - (COMP7016) Q3 2022: Assignment 2 Multidimensional Data Visualisation (Assignment deadline: Sunday 04/09 11:59pm on vUWS) Assignment Details For this assignment, you are required to identify and develop one (or more) visualisation(s) for one of the given multidimensional datasets (We give two data sets, you only need to pick one of them) using existing software and/or programming platform, (e.g., Tableau, TabuVis, R, Python, etc.). Based on the visualisations, you can explore to find insight, patterns, ir(regularity) and interesting properties from the visualisation. Note, it might be required to clean and process the data before using them in visualisation tools. You are also required to write a report with 1500 words (note: your report can be longer than 1500 words if necessary) on the following aspects: • Brief technical details of the used visualisation method(s), • Discussion on the advantages and disadvantages of the visualisation method(s) compared with other methods in the literature. Can the visualisation method(s) be used effectively for large multidimensional data sets and why? Which other analysis techniques could be helpful to support the visualisation in the big data analysis? Such as in data processing, automated analysis, etc. • Discussion on the analysis results and findings on the data sets, • Discussion on other aspects, literature review of related work and your critical thinking on the visualisation(s). Note: images (as figures) are essential and should be included in the report to illustrate the visualisations, results, and findings. Marking criteria for the assignment includes • Development of satisfactorily visualisation(s) for multi-dimensional data (50%). The marking will be based on how well the visualisation method presenting the multidimensional data. Interaction should also be included in the visualisation. • A report on the technical description of the visualisation, analysis results and other aspects (50%) Deliverables Students must individually complete the visualisation(s) and the report. The report should be typed and submitted online through vUWS as a Word or pdf file. A high standard of professional English and neat logical structure (including consistent and complete referencing style) is expected. Declaration You are required to submit a declaration with the following claim (in a text file or world file). DECLARATION I hold a copy of this assignment that I can produce if the original is lost or damaged. I hereby certify that no part of this assignment/product has been copied from any other student’s work or from any other source except where due acknowledgement is made in the assignment. No part of this assignment/product has been written/produced for me by another person except where such collaboration has been authorised by the subject lecturer/tutor concerned. Submission Together with the report document, the declaration, visualisation program(s) and data sets, and the report should be submitted via vUWS before the deadline for marking purpose. All these files should be zipped into one file with your student id as the zipped file name. A submission that does not follow the formats is not acceptable (i.e. Doc or PDF format for the report document, and ZIP file for source codes and others). No hard copy of the work and email submission is acceptable. 19493145.pdf COVID 19 Data Visualisation using Tableau Introduction In our day to day lives, we encounter different and various type of information. This can also vary by size, form and source of obtaining the data. To better understand this, people would leverage on different visualisation techniques. Data visualisation, according to Heitzman (2019), involves gathering of data and converting them into graphical representation to better enhance in terms of presenting insights and meaning about the information. As also discussed by Kelleher and Wagener (2011), visualisation is significant tool in presenting and communicating information as it is able to synthesize huge amount of data into an effective graphical representation. Basically, the purpose of having this can be broadly categorized into two which are data analysis and data presentation. To be able to perform visualisation, Senay and Ignatius (1994) proposed these steps: Data Manipulation, Visual Mapping and Rendering. On the other hand, Fry (2008) provided more steps to consider particularly in dealing with large data set. These seven steps are shown below. Figure 1: Data Visualisation Steps The initial step is, of course, to obtain the data from a certain source. This is basically the data acquisition part. After that data parsing would need to be performed. This can mean exploring the data and organize the data structure. Once the structure is organized, filtering can easily be done to ensure that only data of interest are selected. Next would be data mining or for other they also consider this as data enrichment wherein some additional techniques are implemented to be able to understand any pattern that exists. From there, tools and techniques can be identified on how to better illustrate the data e.g. map, bar, scatter plot etc. That can be enhanced by implementing graphic design methods. One way is by ensuring that colour and shapes are properly used to provide better readability. Then finally, interaction can also be added to enable used to explore the data and/or control the features the user wants to see (Fry, 2008). Tableau as Data Visualisation Tool One of the visualisation tools that can be used to implement those steps discussed is Tableau. Tableau, as introduced its website, is said to be a powerful, flexible and secure data visual analytics platform (Tableau, n.d.). Among its various software available, Tableau Desktop, an easy to use interface, will be used in this paper to implement different visualisation techniques. Aside from that, Acquire Parse Filter Mine Represent Refine Interact Dataflair (2019) also mentioned the other benefits of using this tool such as high quality of visual image capabilities, strong and reliable performance, multiple information source connection among others. In the succeeding parts of this paper the different visualisation generated from Tableau will be discussed including on how the visualisation steps mentioned were considered. COVID 19 Data Set As mentioned earlier, the first part of Data visualisation is to identify the source. In this case, this paper will use the COVID 19 data set from Our World in Data website (Ritchie, 2020). However, it will only consider the data reported until May 21, 2020. As also mentioned in their website. There are four main metrics in data set which are total confirmed cases, total deaths, new confirmed cases, and new deaths. Details on number of tests, population, location, date are also other fields available in the csv source file. Using the VLOOKUP method from Excel, the author has also added new field which is continent wherein the aim is to improve the visualisation that will be performed. Visualisation Implementation After exploring the data set, it would be a nice start to show the global count using a bar chart. On the graph below, it shows there the global summary of total confirmed case and total deaths as of May 21, 2020. Note that Total test is empty hence author simply showed these two to provide a global count for this pandemic. Figure 2: Global COVID Count of Confirmed Case and Death as of May 21 To track the count for both over time, a line chart can provide more details about this as shown below. This line chart is able to provide information regarding the increasing number of cases of both metrics. On below, dual axis was used to plot the on the same graph. It was also helpful to have unique colour on each metric and include a label of the peak of new cases for both. For example, on the week of April 6, there were 52, 531 new death cases which was at its peak. On the other hand, it was on the week of May 11 that the number of new cases was reported to be at its peak of 608,652. Also, another good thing that we can observed on both metrics is the drop on new cases which hopefully would be continuously going down. Figure 3: Global Covid Trend of New Confirmed Case and Death as of May 21 From both bar and line graph, it was shown some significant numbers on COVID confirmed cases and death count. Kiss (2018) discussed line graph would definitely be useful to present trend which indeed serve its purpose in this case; hence, it was effective to visualise the global overview. However, if all of the figures of each countries were included it would not be that effective and would not give a clear picture of the information. As mentioned by Gulbis (2016) and ROM (n.d.), bar and line graphs may not be that suitable for large scale visualisation. Knowing that limitation, another technique that can be explored to somehow show the highlevel information per country is using a map. As mentioned by Gulbis (2016), map can provide an overview (but not an absolute count) of the data across various location in this case country. As also suggested by the same author, using overlay bubble can better improved the visualisation which is implemented in this by adding the blue-green circle. From that, its size can aid in identifying which countries have higher number of confirmed cases. From Figure 4, it can be observed that United States has the most number of cases. By hovering the pointer on the circle, the total case can be checked which is 1,551,853. Furthermore, each continent was given unique colours. In doing so, another insight can be observed from the map in terms of which continent has more countries having high confirmed COVID cases. From the below map, it can be noticed that there are a lot of countries in Europe with high cases of COVID as shown by the blue-green circles. Figure 4: Total COVID Cases per Country using Symbol Map. The good thing on symbol map as discussed by Mace (2014)
Answered 4 days AfterAug 27, 2022

Answer To: Microsoft Word - Assignment2_Q3_2022.doc School of Computer, Data & Mathematical Sciences...

Irfan Haider answered on Aug 29 2022
64 Votes
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here