School of Engineering, Information Technology, and Physical Sciences School of Engineering, Information Technology, and Physical Sciences CRICOS Provider No. 00103D | RTO Code 4909 ITECH7407 – Real...

1 answer below »
It is a data analysis assignment. Have attached the files along with it. Can you guys do it?



School of Engineering, Information Technology, and Physical Sciences School of Engineering, Information Technology, and Physical Sciences CRICOS Provider No. 00103D | RTO Code 4909 ITECH7407 – Real Time Analytics Assessment Task – Data Analytics Assignment Overview For this assessment task, you will work in a group to analyse a selected data set, and provide recommendations to the leadership of the company based on your findings. Timelines and Expectations Percentage Value of Task: 25% Due: Week 11, Sunday 5pm Minimum time expectation: 35 hrs Learning Outcomes Assessed The following course learning outcomes are assessed by completing this assessment task: S1.Integrate data warehouse and business intelligence techniques when using big data. S2.Create flexible analytical models based on real time data, and use connectivity interfaces and tools for reporting purposes. S3.Use real time performance analysis techniques to monitor data, and identify shifts or events occurring in data, as a basis for organisational decision making. S4.Use real time mobile tracking techniques to utilise mobile-specific usage data. K3.Communicate the key drivers for big data in terms of efficiency, productivity, revenue and profitability to global organisations. K4.Identify and describe types of big data, and analyse its differences from other types of data. A1.Communicate security, compliance, auditing and protection of real time big data systems. A2.Adopt problem solving and decision making strategies, to communicate solutions to organisational problems with key stakeholders, based on analysis of big data, in real time settings. Assessment Details This is a business analytics project aimed at generating innovative analytics solutions for a Company. The objective is to analyse the given datasets from a relevant firm’s perspective in terms of implications and strategies which the chosen company could adopt to improve its functions, resources and processes efficiently and effectively. You are expected to work in a group to find a dataset with minimum of 10,000 rows from publicly available sources. You also need to hypothetically identify a company (chosen company) who could get benefits from the analysis of the dataset. You are expected to discuss the selected dataset with your tutor in labs by Week 7. Below is a list of open data sources: 1. World Bank Open Data 2. WHO (World Health Organization) — Open data repository 3. Google Public Data Explorer 4. Registry of Open Data on AWS (RODA) 5. European Union Open Data Portal 6. FiveThirtyEight 7. U.S. Census Bureau 8. Data.gov 9. UNICEF Dataset 10. DBpedia 11. freeCodeCamp Open Data 12. Australian Govt Data Your group is to complete the following tasks: Task 1- Background information Write a description of the selected dataset and project, and its importance for your chosen company. Information must be appropriately referenced. [1 Page] Task 2 – Perform Data Mining on data view Upload the selected dataset on SAP Predictive Analysis. For your dataset, perform the relevant data analysis tasks on data uploaded using data mining techniques such as classification/association/time series/clustering and identify the BI reporting solution and/or dashboards you need to develop for the operational manager of the chosen company. [2-3 Pages] Task 3 – Research Justify why you chose the BI reporting solution, dashboards and data mining technique in Task 3 and why those data sets attributes are present and laid out in the fashion you proposed (feel free to include all other relevant justifications). Note: A BI dashboard is an integrated and interactive tool to display key performance indicators (KPIs) and other important business metrics and data points on one screen, but not a static diagram or graph. To ensure that you discuss this task properly, you must include visual samples of the reports you have produced (i.e. the screenshots of the BI report/dashboard must be presented and explained in the written report; use ‘Snipping tool’), and also include any assumptions that you may have made about the analysis from Task 3. [1-2 Pages] Task 4 – Recommendations for CEO The CEO of the chosen company would like to improve their operations. Based on your BI analysis and the insights gained from your “Dataset” in the lights of analysis performed in previous tasks, make some logical recommendations to the CEO, and justify why/how your proposal could assist in achieving operational/strategic objectives with the help of appropriate references from peer-reviewed sources. [2-3 Pages]. Task 5 – Cover letter Write a cover letter to the CEO of the chosen firm with the important data insights and recommendations to achieve operational/strategic objectives [1 page] Other Tasks – At least 5 references in your report must be from peer-reviewed sources. Include any and all sources of information including any person(s) you interviewed for this project. Please refer to the marking scheme at the end of the assignment for other tasks and expectations. Submission You need to submit your report (about 3000 words not counting cover page, references and Appendix) of this project suing the drop box on Moodle. For each team, only one submission from one member of the team is required. Your submission will be checked for plagiarism using text-matching software. Please note that all references must adhere to APA 7th style. See https://federation.edu.au/library/student-resources/fedcite/content/apa-7th-ed./using-apa-7th-ed./introduction for details on how to format a report and how to cite references. Make sure your follow a formal report structure with cover page, introduction, use of headings, subheadings, conclusion sand reference section. Marking Criteria / Rubric Refer to the attached marking guide. Feedback Feedback will be supplied through Moodle. Authoritative results will be published on fdlMarks. Academic Misconduct To submit your assessment task, you must indicate that you have read and understood, and comply with, the Federation University Australia Academic Integrity and Student Plagiarism policies and procedures (http://policy.federation.edu.au/learning_and_teaching/compliance/academic_integrity/ch02.php). You must also agree that your work has not been outsourced, and is entirely your own except where work quoted is duly acknowledged. Additionally, you must agree that your work has not been submitted for assessment in any other course or program. ITECH7407 – Real Time Analytics Marking Guide – Data Analytics Assignment Criteria Maximum Obtained Comments Background of the project Description of Project, Datasets and firm. The importance of project for the firm [1+1+1+2] 5 Perform Data Mining Techniques Perform various analysis using SAP expert analysis on your uploaded dataset - [Quality and complexity of the analysis – association/classification/clustering/time series models, dashboard designing and relevance for the project] 50 Research Justify why these BI reporting solution/dashboards are chosen and why those attributes are present and laid out in the fashion you proposed (feel free to include all other relevant justifications). Note: To ensure that you discuss this task properly, you must include visual samples of the reports you produce (i.e. the screenshots of the BI report/dashboard must be presented and explained in the written report; use ‘Snipping tool’), and also include any assumptions that you may have made about the analysis in your assignment report (i.e. the report to the operational team of the company). [Each analysis/dashboard and report explanation with relevant research papers, complexity and depth of the justification, use of peer-reviewed sources] 15 Recommendations The CEO of the chosen firm would like to improve the operations. Based on your BI analysis and the insights gained from “Data Set” in the lights of analysis performed in previous tasks, make some logical recommendations to the CEO, and justify why/how your proposal could enhance company operations and could assist in achieving operational/strategic objectives. (Key data insights, recommendations to achieve organisational objectives with theoretical justifications with proper references. ) [2+3+5] 10 Cover Letter (Format, key findings and recommendation ) [1+1+3] 5 Presentation Report is well-written and presented professionally, containing: • Title page • Table of Contents • Introduction • Appropriate use of headings within report • Appropriate use of figures (i.e. graphs, summary tables) and reference to calculations and summaries to justify all observations and recommendations • Overall structure, presentation and formatting. Note that the report has to be presented formally. It must include discussion of calculations, observations and recommendations with graphs and/or tables. [15] 1 2 2 1 4 5 Total 100 (25%) Page 1 of 5 CRICOS Provider No. 00103D | RTO Code 4909Data Analytics AssignmentPage 5 of 5
Answered 15 days AfterMay 10, 2021ITECH7407

Answer To: School of Engineering, Information Technology, and Physical Sciences School of Engineering,...

Payal answered on May 25 2021
129 Votes
ITECH 7407
REAL TIME ANALYTICS
Report Submitted by –
Apoorva Kulkarni
CONTENTS
1. PURPOSE
2. BACKGROUND & INFORMATION
3. DATA MINING & RESEARCH, DATA MODELLING
3.1 DATA MINING
3.2 DATA MODELLING
3.3 RESEARCH & ANALYSIS
    a) Association Technique
    b) Bar Chart (Drug Usage vs. Gender)
c) Pie Chart (Drug Usage & Frequency)
    d) Bar Chart (Health Status vs. Gender)
4. RECOMMENDATION FOR CEO
5. COVER LETTER
6. REFERENCES
1.PURPOSE:
In the current age of Industrialization 5.0, Business Analytics is considered as one of the prime necessity in every sector. We can witness the use o
f Data analysis in almost every field line Manufacturing, Healthcare, Sports, services, etc. Every industry, irrespective of their size, analyse the data of their respective fields for future forecasting & decision making. Few of these Organizations has also adopted the Analytical tools like Tableau, SAP Predictive Analysis, ARIBA cloud, etc.
The field of Business Analytics has grown exponentially over the past few years. The organizations are using this tool to promote their products & services, preparing future business strategies, targeting large customer base, improving their operational efficiency, spend management etc. (Yanqing, 2020)
2.BACKGROUND INFORMATION:
The National Drug Strategy Household Survey (NDSHS) is a survey conducted by the Australian Government on alcohol and other drug use in Australia. The survey conducted by the government also provides estimates of licit and illicit drug use. This survey also measures community attitudes to drug use and community support for various drug-related policies.
In the current case ABC Pharmaceuticals is pharma giant which is planning to rollout a drug that is expected to reduce the drug use among the population so as to lead a better and safer lives. Before launching the particular drug, it wants to carry out market research and analyse the drug use pattern across the people of different age groups and gender. This would in turn help the company to take the decisions that will be crucial for components that are to be there in the drug so as to be successful in the market.
Table 1: Raw Data of Survey done by NDSHS
3.DATA MINING & RESEARCH, DATA MODELLING:
Let us understand it with the help of graphs how classification/association/Time series & clustering is being followed in the selected data set to identify the reporting procedures & relevant dashboards. (Help.sap.com)
3.1 DATA MINING:
Data Mining is a technique of classification of data into simpler form & with a relevant pattern. This relevant data is further categorized into different clusters for further predictions.
This particular dataset is a survey data collected from different people both gender by asking them various question on their drug related usage and their attitude to the drug usage. The people who are a part of the survey are categorizes in different age groups consisting of 18+ to 60+. The Raw data has been cleaned & encoded based on the codification used. This will lead to better analysis & reliable conclusions.
Table 2: Raw Data of Survey done by NDSHS
Table 3: Transformed Data of Survey done by NDSHS
3.2 DATA MODELLING:
Data modelling is the step of configuring the data into measures & dimensions in which the analysis is required to be done. (Aldowah, 2019) There are two basic types of Data modelling
1) Planning
2) Analytical
SAP Predictive Analysis is a advanced Business Intelligence tool that can help the industries to convert their business data into the useful business information & thereby helping them to take better and insightful decisions for their businesses operations. (Evaluation, 2005)
Fig 1: SAP Predictive Analysis Home Screen
We will be using the SAP Predictive Analysis for this particular project and apply various statistical techniques to analyze the data at our hand. There are various statistical techniques that can be applied like Clustering, Regression, Association and etc. But before applying any advanced statistical technique one has to be very cautious as there are certain prerequisite that must be fulfilled before applying any of the above technique. As in our case we have a Drug Usage survey data which consists of response and in turn most of the variables are categorical in nature. Thus, for analyzing this particular dataset we will be using Chi-square Test to analyze the association between the different variables.
3.3 RESEARCH & ANALYSIS:
Scope 1 Association Technique Chi-square Test :
Chi-square Test is a correlation technique which is used to analyses correlation between the two categorical variables. (Ritesh Chugh, 2013) By applying the Chi-square test we will be able to analyze if use of any drug like Tobacco, alcohol is significantly related to age group or Gender.
We will be analyzing the association between the Usage of Tobacco between the people of different age groups that are there in the data. For carrying out this particular analysis we will be applying Chi-square test, which can be considered as a correlation but is used to study the association between the categorical; variables.
The output of the Chi-square analysis can be seen below. We will be looking it one by one and interpreting it so as to reach the final conclusions in order to validate the hypothesis.
A) Analyze the association of Tobacco usage among the people of different age groups
    Case Processing Summary
    Particular
    Cases
    
    Valid
    Missing
    Total
    
    N
    Percent
    N
    Percent
    N
    Percent
    Summary tobacco status * Ten year age groups from 14 to 60+
    1000
    100.00%
    0
    0.00%
    1000
    100.00%
Table 4: Co-Processing Summary (Tobacco vs Age Group)
From the above table it can be seen that the are no missing values in the data. The missing a data can have implications on statistical results and being the survey data, the chances of missing values are even higher. But in the current case as we don’t have any missing data, thus, can...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here