Microsoft Word - T2 2020 BISY3001 A4 Briefing.docx Unit Assessment Type Group Assignment Assessment Number A4 Assessment Name Data Mining & BI Report Weighting 25% Alignment with Unit and Course ULO1,...

1 answer below »
I have attached both old and new assignment briefing. So, a4 New assignment briefing is the task to do.So there should be report and knime workflow as explained in briefing also, two file,harvard referencing style with website link,if any problem call on 0451715567


Microsoft Word - T2 2020 BISY3001 A4 Briefing.docx Unit Assessment Type Group Assignment Assessment Number A4 Assessment Name Data Mining & BI Report Weighting 25% Alignment with Unit and Course ULO1, ULO2, ULO3, ULO4 Due Date and Time Report (10%): Week 11, Friday, 02 October 2020, 11:59 pm via Moodle. Presentation and QA Session (15%): Week 12 In Class. Assessment Description In this assessment, the students will extend their previous work from assessment A3 Business case understanding. Here, the students have to submit a report of the data mining process on a real-world scenario and a presentation and QA Session will be held based on the report written. The report will consist of the details of every step followed by the students. Detailed Submission Requirements Cover Page • Title • Group members Introduction • Importance of the chosen area • Why this data set is interesting • What has been done so far • Which can be done • Description of the present experiment 1. Data preparation and Feature extraction: 1.1 Select data o Task Select data 1.2 Clean data o Task Clean data o Output Data cleaning report 1.3 Construct data/ feature extraction o Task Construct data o Output Derived attributes o Activities: Derived attributes o Add new attributes to the accessed data o Activities Single-attribute transformations o Output Generated records 2 Modeling 2.1 Select modeling technique o Task – Select Modelling Technique 2.2 Output Modeling technique o Record the actual modeling technique that is used. 2.3 Output Modeling assumption o Activities Define any built-in assumptions made by the technique about the data (e.g. quality, format, distribution). Compare these assumptions with those in the Data Description Report. Make sure that these assumptions hold and step back to the Data Preparation Phase if necessary. You can explain the data file here, even when it is pre prepared. 3 Generate test design 3.1 Task Generate test design o Activities Check existing test designs for each data mining goal separately. Decide on necessary steps (number of iterations, number of folds etc.). Prepare data required for test. (You can use 66% of records for model Building and rest for Testing) 3.2 Build model o Task - Build model Run the modeling tool on the prepared dataset to create one or more models. (Using Knime Tool as shown in the lab). 3.3 Output Parameter settings o Activities - Set initial parameters. Document reasons for choosing those values. o Activities - Run the selected technique on the input dataset to produce the model. Post-process data mining results (e.g. editing rules, display trees). 3.4 Output Model description o Activities - Describe any characteristics of the current model that may be useful for the future. Give a detailed description of the model and any special features. o Activities - State conclusions regarding patterns in the data (if any); sometimes the model reveals important facts about the data without a separate Assessment process (e.g. that the output or conclusion is duplicated in one of the inputs). 4 Evaluation and Conclusion Previous evaluation steps dealt with factors such as the accuracy and generality of the model. This step assesses the degree to which the model meets the business objectives and seeks to determine if there is some business reason why this model is deficient. It compares results with the evaluation criteria defined at the start of the project. A good way of defining the total outputs of a data mining project is to use the equation: RESULTS = MODELS + FINDINGS In this equation we are defining that the total output of the data mining project is not just the models (although they are, of course, important) but also findings which we define as anything (apart from the model) that is important in meeting objectives of the business (or important in leading to new questions, line of approach or side effects (e.g. data quality problems uncovered by the data mining exercise). Note: although the model is directly connected to the business questions, the findings need not be related to any questions or objective, but are important to the initiator of the project. ~ End of Assessment Details ~ Marking Criteria Activities Rank the possible actions. Select one of the possible actions. Document reasons for the choice. Content Marks Cover Page Table of contents 0.5 Executive Summary 0.5 Introduction 0.5 Data Pre-processing and feature extraction 2.5 Experiment 3 Result analysis 2.5 Conclusion 0.5 Presentation and QA 15 Rubrics Marking criteria HD D C P F ULO1: Demonstrate broad understanding of data mining and business intelligence and their benefits to business practice ULO 2: Choose and apply models and key methods for classification, prediction, reduction, exploration, affinity analysis, and customer segmentation that can be applied to data mining as part of a business intelligence strategy ULO3: Analyse appropriate models and methods for classification, prediction, reduction, exploration, affinity analysis, and customer segmentation to data mining ULO4: Propose a data mining approach using real business cases as part of a business intelligence strategy Report, presentation and QA outcome address all the tasks. Report consists of no/minor mistakes. (21-25 marks) Report, presentation and QA outcome address all the tasks. Report consists of a few number of mistakes. (18-20 marks) Report, presentation and QA outcome address most of the contents. Report consists of a few number of mistakes. (15-17 marks) Report, presentation and QA outcome address a few of the contents. Report consists of a good number of mistakes. (13-14 marks) Incomplete report. Unable to perform the experiment/dat a pre- processing/ conclude result. Unable to answer to the question of QA Session and Unable to present the work that has been done. (0-12.5 marks) Misconduct • Engaging someone else to write any part of your assessment for you is classified as misconduct. • To avoid being charged with Misconduct, students need to submit their own work. • Remember that this is a Turnitin assignment and plagiarism will be subject to severe penalties. • The AIH misconduct policy and procedure can be read on the AIH website (https://aih.nsw.edu.au/about-us/policies-procedures/). Late Submission • Late submission is not permitted, practical submission link will close after 1 hour. Special consideration • Students whose ability to submit or attend an assessment item is affected by sickness, misadventure or other circumstances beyond their control, may be eligible for special consideration. No consideration is given when the condition or event is unrelated to the student's performance in a component of the assessment, or when it is considered not to be serious. • Students applying for special consideration must submit the form within 3 days of the due date of the assessment item or exam. • The form can be obtained from the AIH website (https://aih.nsw.edu.au/current- students/student-forms/) or on-campus at Reception. • The request form must be submitted to Student Services. Supporting evidence should be attached. For further information please refer to the Student Assessment Policy and associated Procedure available on (https://aih.nsw.edu.au/about-us/policies-procedures/).
Answered 4 days AfterFeb 10, 2021BISY3001

Answer To: Microsoft Word - T2 2020 BISY3001 A4 Briefing.docx Unit Assessment Type Group Assignment Assessment...

Swapnil answered on Feb 11 2021
132 Votes
75653/Knime Workflow.docx
KNIME Workflow:
75653/Presentation.pptx
DATA MINING CLUSTERING FOR CREDIT CARD CORP
Table of Contents
Introduction
Data Mining Concepts
The Data Mining Process
Data Preparation
Data Clustering
Applications of Clustering in Text Mining
Text Feature Extraction
Data Preparation for k-Means Algorithm
Results
Conclusion
Introduction
The Credit Card Corp uses SAS enterprise miner, a commercial data mining tool in their credit card business applications, a part of CRMD, for tasks like fraud detection, risk minimization, anticipation of resource demands, seeking increase response rates for marketing campaigns and curbing customer attrition etc.
Aware of growing industry and academia support for JDM and ODM, The Credit Card Corp wants us to build an in-house data mining capability to perform tasks mentioned above, for them, using JDM and/or ODM.
The Credit Card Corp offers an online call center application to service millions of their cred
it card calls. The call center representatives enter call notes and save them in the database.
The Credit Card Corp management team is interested in knowing the top reasons for calls in real time, especially whether there are new issues that generate a large call volume.
Data Mining Concepts
Automatic Discovery: Data mining is accomplished by building models. A model uses an algorithm to act on a set of data.
Prediction: Many forms of data mining are predictive. For example, a model might predict income based on education and other demographic factors.
Grouping: Other forms of data mining identify natural groupings in the data.
Actionable Information: Data mining can derive actionable information from large volumes of data. For example, a town planner might use a model that predicts income based on demographics to develop a plan for low-income housing.
A car leasing agency might a use model that identifies customer segments to design a promotion targeting high-value customers.
Data Preparation
Data preparation is something that for data for mining that must exist within the table of view.
The more information can be stored to the separate rows.
A unique capability that can be used to the data mining and its supports for the dimensioned data though table transformations.
Additionally, the data mining that can be used to mine the unstructured data.
The preparation for that data must be a key factor for the data mining projects.
The data should be used to clean the eliminating the inconsistencies and support to the data mining applications.
Data Clustering
When you use the clustering the data then the numbers indicate how often each document can contain each word.
You can check the table to that documents that are probably about one of three main themes: Astronomy and animals.
However, the last two documents are a bit strange.
Documents M contains words with the astronomy and movie stars and Document N contains to the many words having the both astronomy and animals.
The Data Mining Process
This initial phase of a data mining project focuses on understanding the project objectives and requirements. Once you have specified the project from a business perspective, you can formulate it as a data mining problem and develop a preliminary implementation plan.
The data understanding phase involves data collection and exploration. As you take a closer look at the data, you can determine how well it addresses the business problem. You might decide to remove some of the data or add additional data.
In this phase, you select and apply various modelling techniques and calibrate the parameters to optimal values. If the algorithm requires data transformations, you will need to step back to the previous phase to implement them.
Applications of Clustering in Text Mining
Simple clustering: This refers to the creation of clusters of text features. For example: grouping the hits returned by a search engine.
Taxonomy generation: This refers to the generation of hierarchical groupings. For example: a cluster that includes text about car manufacturers is the parent of child clusters that include text about car models.
Topic extraction: This refers to the extraction of the most typical features of a group. For example: the most typical characteristics of documents in each document topic.
Text Feature Extraction
Feature extraction is used for text transportation at two different stages using the text mining process:
A feature extraction process must be performed on the text document. If you can mine the data that should be documented to the feature extraction. This is basically a pre-processing step transforms to the text documents. And that text documents will be converted into the small units of text called features or terms for mining the data.
Basically the transformation process generates the large data numbers of text features from the text documents.
Data Preparation for k-Means Algorithm
k-Means Algorithm: The k-means algorithm basically focus on the distance based clustering algorithm that is basically used to partitions onto the data to predetermine the number of clusters provided to their distinct cases.
Data Preparation for k-Means: The automatic data preparation can be used to performing onto the k-means normalizations. The missing values can be used for the columns with the simple data types that can be used for the interpreted randomly.
Results
This chapter mentions the intermediary results and final output during the data mining process carried out in the project work.
The user logs in on the admin console and chooses the application, chooses the algorithm and clicks on Demo Text Clusters.
Based on nature of comment, the prediction on the customer service quality and product acceptability by customers for usage of the statistics and histogram of comments in Oracle Data Miner.
Conclusion
An effort has been made to investigate the proof-of-concept work for building text clustering infrastructure in a call center application using data mining concepts.
The efforts involved include learning data mining concepts. Since this work is being done for our client, The Credit Card Corp whose policy mentions about ensuring the confidentiality of their data, the actual text clustering process and real-time data used are masked.
The clustering process and the test data referred to the dissertation report and that are mostly generic.
75653/Report.docx
DATA MINING CLUSTERING FOR CREDIT CARD CORP
10
Executive Summary
The demand for the credit cards is growing over the time. It will give the number of credit cards in the system. The distribution has been increased for the local banks that can give the aggressive for the debit cards. Using the data mining techniques, it will give the cover-up for the duplication of users and it will concern the industry, which can give the number of active users. The research has been carried out to the card industry so that can include the types of providers and feature of that cards will give the major acceptability for the cards among to the consumers. It also involves the finding consumer’s perception towards the different providers. You can use the data mining techniques that can find the consumers perception towards the different age group and occupation and it will help to get the credit card corporation technique.
Table of Contents
Executive Summary    2
Introduction    4
Data Mining Concepts    5
Automatic Discovery    5
Prediction    5
Grouping    5
Actionable Information    5
The Data Mining Process    6
Problem Definition    6
Data Gathering and Preparation    6
Model Building and Evaluation    6
Data Preparation    6
Clustering Data    7
Applications of Clustering in Text Mining    8
Simple clustering    8
Taxonomy generation    8
Topic extraction    8
Text Feature Extraction    9
Algorithms    10
k-Means    10
Data Preparation for k-Means    10
Features of a DME Connection    10
Execute Mining Tasks    10
Retrieve DME Capabilities and Metadata    11
Retrieve Version Information    12
Sample Data    12
Result    13
Conclusion    14
Reference    15
Introduction
The Credit Card Corp uses SAS enterprise miner, a commercial data-mining tool in their credit card business applications, a part of CRMD, for tasks like fraud detection, risk minimization, anticipation of resource demands, seeking increase response rates for marketing campaigns and curbing customer attrition etc. Aware of growing industry and academia support for JDM and ODM, The Credit Card Corp wants us to build an in-house data mining capability to perform tasks mentioned above, for them, using JDM and/or ODM. The Credit Card Corp offers an online call center application to service millions of their credit card calls. A software application that can cluster the call notes periodically and show the report with cluster statistics is being developed. In this dissertation, we talk about an investigation into the proof of concept development of text clustering infrastructure using JDM and ODM.
Data Mining Concepts
Automatic Discovery
The building models of data science techniques can accomplish the data mining concepts. The model can be uses a different types of algorithms to act on a set of data. This process can be applying to the model to new data is called scoring in a data mining.
Prediction
In data mining, the many forms are predictive. The prediction statement can be given as the associated probability. The prediction is also called as the confidence to the model. The predictive data mining can be generated to the rules, which are implying to given the outcome.
Grouping
In data mining to identify the other forms, we focus on identifying on the natural groupings into the data.
Actionable Information
In the data mining process, the actionable information gives the more volumes of data. For example, a town planner may be used to the model can predicts the...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here