1. Model.pdf2. Database.sql3. Vaccinations.db4. Queries.sql5. Queries.pdfPage 1 of 7 RMIT...

Question

1. Model.pdf2. Database.sql3. Vaccinations.db4. Queries.sql5. Queries.pdfPage 1 of 7 RMIT Classification: Trusted  School of Science   —  ISYS1055/1057/3412 (Practical) Database Concepts  Assessment 4: Database Design Project   Assessment type: Take-home assessment  Word limit: N/A   Draft Due Date: 16 October at 23:59 (Melbourne Time) – Week 12 (otherwise no 4 mark bonus)  Final Due Date: 30 October at 23:59 (Melbourne Time) – Week 14    Weighting: 35%, 35 Marks Overview  This is a practical and real-world project that puts the knowledge you gained into practice. You are required to  investigate and understand a publicly available dataset, design a conceptual model for storing the dataset in a  relational database, apply normalisation techniques to improve the model, build the database according to your  design and import the data into your database, and develop SQL queries in response to a set of requirements.   The objective of this assignment is to reinforce what you have learned in the whole course. Specifically, it involves  how to build a simple application that connects to a database backend, running a simple relational schema.    Part A: Understanding the Data (0 Marks, Preliminary Work)   Part B: Designing the Database (10%)   Part C: Creating the Database and Importing Data (10%)   Part D: Data Retrieval and Visualisation(15%) Assessment criteria  This assessment will measure your ability to:   Analyse the requirements outlined in the problem description   Develop a conceptual model for the design of a database backend required for the system   Use an industry-standard ER modelling tool to draw the ER model   Use 7-step mapping process to create relational database schema   Use normalisation process to evaluate the schema and make sure that all the relations are at  least 3NF Page 2 of 7 RMIT Classification: Trusted   Create tables on SQLite Studio and populate them with data available from the specified sources.    Write SQL statements required for CRUD (create, read, update and delete) operations on the  database you built   Develop your knowledge further to represent data in a meaningful way using data visualisation.   Course learning outcomes  This assessment is relevant to the following course learning outcomes:  CLO1  Describe the underlying theoretical basis of the relational database model and apply the  theories into practice;  CLO2 Explain the main concepts for data modelling and characteristics of database systems.  CLO3  Develop a sound database design using conceptual modeling mechanisms such as entity- relationship diagrams.  CLO4 Develop a database based on a sound database design;  CLO5  Apply SQL as a programming language to define database schemas, update database  contents, and to extract data from databases for specific users’ information needs. Page 3 of 7 RMIT Classification: Trusted  Assessment details  Part A: Understanding the Data  In this assignment, we are working with the publicly available dataset: A Global Database of COVID-19  Vaccinations. Further details about this dataset are available in the article available through the following  URL: https://www.nature.com/articles/s41562-021-01122-8. The abstract of the article is as follows.  An effective rollout of vaccinations against COVID-19 offers the most promising prospect of  bringing the pandemic to an end. We present the Our World in Data COVID-19 vaccination  dataset, a global public dataset that tracks the scale and rate of the vaccine rollout across the  world. This dataset is updated regularly and includes data on the total number of vaccinations  administered, first and second doses administered, daily vaccination rates and population- adjusted coverage for all countries for which data are available (169 countries as of 7 April 2021).  It will be maintained as the global vaccination campaign continues to progress. This resource aids  policymakers and researchers in understanding the rate of current and potential vaccine rollout;  the interactions with non-vaccination policy responses; the potential impact of vaccinations on  pandemic outcomes such as transmission, morbidity and mortality; and global inequalities in  vaccine access.  A live version of the vaccination dataset and documentation are available in a public GitHub repository at  https://github.com/owid/covid-19-data/tree/master/public/data/vaccinations. These data can be  downloaded in CSV and JSON formats.  For the purposes of completing this assignment, we are only using the following files. You are required to  review and analyse the dataset available in these files. You will find that reviewing the rest of the files, even if  not listed below, will help you to form a better understanding about the big picture.   FILE NAME DESCRIPTION  1 locations.csv Country names and the type of vaccines administered. Each line  represents the last observation in a specific country. Refer to  README.md for the details.  2 us_state_vaccinations.csv History of observations for various locations in the US.  3 vaccinations-by-age-group.csv History of observations for vaccinations of various age groups in each  country.  4 vaccinations-by-manufacturer.csv History of observations for various types of vaccines used in each  country.  5 vaccinations.csv Country-by-country data on global COVID-19 vaccinations. Each line  represents an observation date. Refer to README.md for the details.  6 country_data/Australia.csv Daily observations of vaccination in Australia.  7 country_data/United States.csv Daily observations of vaccination in the US.  8 country_data/Germany.csv Daily observations of vaccination in Germany.  9 country_data/Italy.csv Daily observations of vaccination in Italy.   Table 1: List of data files  https://www.nature.com/articles/s41562-021-01122-8 https://github.com/owid/covid-19-data/tree/master/public/data/vaccinations https://github.com/owid/covid-19-data/blob/master/public/data/vaccinations/README.md https://github.com/owid/covid-19-data/blob/master/public/data/vaccinations/README.md   Page 4 of 7 RMIT Classification: Trusted  To complete the tasks in the following sections, you are required to review and analyse the dataset that is  available in the named files.  Part B: Designing the Database (10%)  Task B.1 Produce an ER diagram for a relational database that will be able to store the given dataset.  It is important to note that the given CSV files are not necessarily representing a good design for a relational  database. It is your task to design a database that will adhere to good design principles that were taught  throughout the course. This means your database schema will not match the structure of the CSV files and,  therefore, you will require to manipulate the structure of the dataset (and not the data itself) to import it  into your database. Importing the data is required to complete Task C.2.  The ER diagram must be produced by Lucidchart similar to the exercises that were completed in in the course.  UML notation is expected and using other notations will not be acceptable. Including a high-quality image  representing your model is important, which can be achieved using Export function of Lucidchart.     You are also required to transform the ER diagram into a database schema that will be used in the next part  of the assignment.  Creating a good database design typically involves some database normalisation activities. You should  document your normalisation activities and support them with good reasoning. This typically involves  explaining what the initial design was, what the problem was, and what changes have been made to rectify  the issue.  The expected outcome of completing this task is one PDF file named model.pdf containing the following  sections.   1. Database ER diagram and, if needed, a reasonable set of assumptions.   2. Explanation of normalisation challenges and the resulting changes.   3. Database schema.     Part C: Creating the Database and Importing Data (10%)  Task C.1 Produce one SQL script file named database.sql. This script file requires all the SQL statements  necessary to create all the database relations and their corresponding integrity constraints as per your  proposed design in Part B. The script file must run without any errors in SQLite Studio and contain necessary  commenting to separate various relations. Note that this script is not supposed to store any data into the  relations.  The expected outcome of completing this task is one script file with the specific name of database.sql.  Task C.2 Create a database file named Vaccinations.db. Import the given dataset into your database.  To complete this task, you may need to change the format of the CSV files to match the attributes of your  designed database. You can use a spreadsheet editor such as Microsoft Excel.  The next step is to import the spreadsheets into the database you create in SQLite Studio. To complete this  task, use the menu option Tools – Import in SQLite.   The expected outcome of completing this task is one database file named Vaccinations.db, which must  contain all the data that is stored in the CSV files named in Table 1.  https://www.lucidchart.com/   Page 5 of 7 RMIT Classification: Trusted  Part D: Data Retrieval and Visualisation (15%)  Now that you have created and populated a database, it is time to create some queries to investigate the  data in various ways. In addition to writing the required queries, you are also asked to produce data  visualisation for the results of your queries.   The tasks in this section represent the queries that must be supported. Each query must consist of one SQL  statement. It would be acceptable to use several nested queries, combine several SELECT statements with  various operators etc. However, it would not be acceptable to have multiple and separated queries for each  task.   After you have written each query, you are expected to produce a data visualisation for each result set. You  have the freedom to choose the tool for creating your visuals (e.g., Excel, Google Charts, Tableau) as well as  the visualisation techniques (e.g., charts, plots, diagrams, maps). Completing this portion of the work will  require that you understand the nature of the results of each query, undertake research to choose a  visualisation tool you are comfortable with, decide about the best technique to visually represent each result  set, and produce the visualisation. Answers to tasks in Part D that are not supported by a visualisation can  achieve up to 80% of the grade associated with each task.  The expected outcome of completing this task is as follows.  1. One SQL script file named Queries.sql containing all the queries developed for the tasks in this  section. It is important that you add comment lines to separate the queries and indicate which task  they belong to. Note that valid SQL comments must not generate errors in SQLite Studio. The marker  of your work will use this file to execute and test your queries.  1. A PDF file named QuerieResults.pdf containing the following elements for each task.  a. The SQL query

Aditi · Accepted Answer

Model 
 
As we all know, an entity relationship diagram (ERD), also known as an entity relationship 
model, is a graphical representation of relationships in an information technology (IT) 
system between people, objects, places, concepts, as well as events.

Page 1 of 7RMIT Classification: Trusted School of Science — ISYS1055/1057/3412 (Practical) Database Concepts Assessment 4: Database Design ProjectAssessment type: Take-home...

Answer To: Page 1 of 7RMIT Classification: Trusted School of Science — ISYS1055/1057/3412 (Practical)...

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment