TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney International School of Technology and Commerce ABN XXXXXXXXXX |ACN XXXXXXXXXXLevel 14/233 Castlereagh...

1 answer below »
TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney International School of Technology and Commerce ABN 74 613 055 440 |ACN 613 055 440 Level 14/233 Castlereagh Street, Sydney NSW 2000 P a g e | 1 ICT205 Data Analytics Assessment 1 – Case Study Report Overview A data analytics project starts with collecting the data and ends with communicating the results from the data. In between, there are multiple steps that are required to be followed- data preprocessing is one of the most important steps among them. The data preprocessing step itself has multiple steps depending on the nature, type, value etc. of the data. On the other hand, data visualisation uses visual representations to explore, make sense of, and communicate data that often includes charts, graphs, illustrations etc. Today, there is a move towards visualisation that can be observed among many big companies. Timelines and Expectations Students are expected to work individually to prepare a report that details the use and applications of data preprocessing and data visualisation techniques on a selected data set. The aim of this assessment is to enable students to create a report that evaluates the use of data preprocessing and data visualisation techniques applied to a given case. Students are required to select a data set and answer the following questions: - What is the purpose of the data set, and what kind of insights can be extracted from the chosen data set? - Have you applied any data cleaning approaches (e.g., missing value handling, noisy data handling) for the chosen data set? Explain in your own words what data cleaning approaches you have perform or why it was not required. - Have you applied any data transformation techniques (normalisation, attribute creation, discretisation etc.) for the chosen data set? What data transformation techniques you have performed or why it was not required to perform any transformation? Explain in your own words. - Have you applied any data reduction techniques (reduce dimension, reduce volume, balance data) ?If yes, then describe the data transformation technique(s) you have followed; otherwise, explain why no transformation techniques were not required. - Design an interactive dashboard using 3-4 charts/graphs/illustrations to represent the data. Case Study Report (20%) Individual Report Due (20th August 2021 Week 6 Friday 11:59pm) Expected word count 1,500 words Students are expected to submit their assessments via Turnitin on Moodle. Minimum time expectation: 15 hrs Learning Outcomes Assessed The following course learning outcomes are assessed by completing this assessment task: LO1. review and differentiate between the methods of data analysis and presentation; LO2. analyse internal and external sources of data relevant to business environments including TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney International School of Technology and Commerce ABN 74 613 055 440 |ACN 613 055 440 Level 14/233 Castlereagh Street, Sydney NSW 2000 P a g e | 2 technology and service utilisation data to identify relationships and trends; LO3. develop and apply skills in spreadsheets to sort, manage, summarise and display data to support managerial decision-making; Assessment Details For this assignment, students are required to write 1,500 words report on a specific case study and explain the use and applications of data preprocessing and data visualisation techniques on a selected data set. Students can choose any suitable data set that is publicly available on the internet. In week 6, students will be required to submit their report on moodle. Students are expected to work individually and undergo their own research without collaboration with any other student. Students are expected to prepare a comprehensive report on the application of their knowledge of data preprocessing and visualisation on a given case study. 1. All reports must include at least 5 academic references which must be done using APA7 reference style. 2. The case study must assess the value propositions of the chosen data set and discuss what types of business questions can be answered using the data set. It must highlight the suitability of data cleaning approachesfor the selected data set. It must highlight the data transformation techniques that are applicable to the data set. Students must also highlight how an interactive dashboard can be designed for the chosen data set to communicate the data effectively. 3. This unit requires you to use APA system of referencing. See Sydney International’s quick reference guide. It should be used in conjunction with the online tool Academic Writer: https://extras.apa.org/apastyle/basics-7e/#/. 4. A passing grade will be awarded to assignments adequately addressing all assessment criteria. Higher grades require better quality and more effort. For example, a minimum is set on the wider reading required. A student reading vastly more than this minimum will be better prepared to discuss the issues in depth and consequently their report is likely to be of a higher quality. So before submitting, please read through the assessment criteria very carefully. Submission All assessments must be submitted through Turnitin on Moodle. Marking Criteria / Rubric Refer to the attached marking guide. Feedback Feedback will be supplied through Moodle. Authoritative results will be published on Moodle. Academic Misconduct To submit your assessment task, you must indicate that you have read and understood, and comply with, the Sydney International School of Technology and Commerce Academic Integrity and Student Plagiarism policies and procedures. TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney International School of Technology and Commerce ABN 74 613 055 440 |ACN 613 055 440 Level 14/233 Castlereagh Street, Sydney NSW 2000 P a g e | 3 You must also agree that your work has not been outsourced and is entirely your own except where work quoted is duly acknowledged. Additionally, you must agree that your work has not been submitted for assessment in any other course or program. Individual report sample structure - Coversheet (mandatory) - Title page - Table of content 1. Introduction 2. Overview of the data 3. Data Preprocessing a. Data Cleaning b. Data Transformation c. Data Reduction 4. Dashboard Design 5. Conclusions 6. References 7. Appendix Note: Students are allowed in include other sections as they deem necessary based on their case study. Sample data set for case study: Absenteeism at work Data Set Bank Marketing Data Set Iranian Churn Dataset Data Set Productivity Prediction of Garment Employees Data Set Real estate valuation data set Data Set Apartment for rent classified Data Set Chronic_Kidney_Disease Data Set TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney International School of Technology and Commerce ABN 74 613 055 440 |ACN 613 055 440 Level 14/233 Castlereagh Street, Sydney NSW 2000 P a g e | 4 Case Study Report Marking Guide – Marks 100 Weighting: 20% Student IDs: Assessment Criteria: Score Very Good Good Satisfactory Unsatisfactory Presentation Information is well Information is Information is somewhat Information is somewhat /Layout organised, well written, organised, well written, organised, proper organised, but proper and proper grammar with proper grammar grammar and grammar and and punctuation are and punctuation. punctuation mostly punctuation not always used throughout. Correct layout used. used. Correct layout used. Some elements of /05 marks Correct layout used. used. layout incorrect. Structure Structure guidelines Structure guidelines Structure guidelines Some elements of Enhanced followed exactly mostly followed. structure omitted /05 marks Introduction Introduces the topic of Introduces the topic of Satisfactorily introduces Introduces the topic of the report in an the report in an the topic of the report. the report, but omits a extremely engaging engaging manner which Gives a general general background of manner which arouses arouses the reader's background. the topic and/or the the reader's interest. interest. Indicates the overall overall "plan" of the Gives a detailed general Gives some general "plan" of the paper. paper. background and background and indicates the overall indicates the overall /10 marks "plan" of the paper. "plan" of the paper. Details All topics are discussed in Consistently detailed A topic has been Inadequate discussion Depth coherently. discussion. Displays adequately discussed. of issues Little/no Significant evidence of sound understanding Displays some demonstrated Critical analysis and with some analysis of understanding and understanding or Reflection. Topics. analysis of issues. analysis of most issues and/or some irrelevant /65 marks information. Summary & Conclusion An interesting, well A good summary of the Satisfactory summary of Poor/no summary of the written summary of the main points. the main points. main points. main points. A good final comment A final comment on the A poor final comment on An excellent final on the subject, based subject, but introduced the subject and/or new comment on the on the information new material. material introduced. subject, based on the provided. /05 marks information provided. Referencing Correct referencing Mostly correct Mostly correct Not all material correctly (APA7 Style). All quoted referencing (APA7 Style). All referencing (APA7 Style) acknowledged. material in quotes and quoted material in Some problems with Some problems with the acknowledged. All Quotes & acknowledged. quoted material and reference list. paraphrased material All paraphrased material paraphrased material acknowledged. acknowledged. Some problems with the Correctly set out Mostly correct setting reference list. /10 marks reference list. out reference list. SubTotal-/100 marks Total out of 20
Answered 4 days AfterAug 15, 2021ICT205

Answer To: TEQSA: PRV14311 CRICOS: 03836J Australia Advance Education Group Pty Ltd. trading as Sydney...

Neha answered on Aug 19 2021
129 Votes
Contents
Introduction    1
Overview of the Data    1
Data pre-processing    2
Data Cleaning    2
Data transformation    3
Data Reduction    3
Dashboard Design    3
Conclusion    4
References    4
Introduction
The data mining applications are used in different areas and now have been in
corporated in most of the industries. The transition has been very slow, but it is growing steadily. It will take enough time to let the people understand its benefits. Initially it was used by automotive industry, and they have used artificial intelligence and data mining. In this report we will analyse the data for absenteeism in the organization. The dataset has different attributes, and they can be visualized to understand the pattern.
The data mining has been used in the industry to understand the behaviour of customers, sales, production and other aspects but we can also use it to analyse the employees working behaviour and check for their efficiency. The data mining has been used by the people in different areas like banking, healthcare, automobile, traffic system and few others and all of them have employees. The use of data mining technique can be useful for the business and the right application can be useful for them to increase the efficiency of the employees by significant amount.
In this report we will discuss about the steps which are used for data mining. These steps can be helpful for the company to analyse the work. When we start with the visualization of data then it is important to pre-process the data as it may have different errors or null values. The data which has errors will give wrong output. We are trying to analyse the absenteeism record of the employees and understand the accuracy of it. We will perform this task by using the data collected from online portal and use the descriptive model to analyse it. We will deal with different attributes and there can be multiple reasons for taking leaves. Therefore, there is requirement of a model which can describe the reason for leave and help to predict the behaviour of the employees. The data is selected from UCI which provided data for free. It is mainly used for machine learning repositories.
Overview of the Data
The dataset has 740 rows with 21 columns. It has Id, reason for absence, month of absence, day of the week, seasons, transportation expense, distance from residence to work, service time, age, workload average per day, Hit target, disciplinary, education, son, social drinker, social smoker, pet, weight, height, body mass index, Absenteeism time in hours. The dataset has been collected from UCI machine learning repository. It is open-source platform to collect data. The dataset has multiple rows and...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here