course is data mining. for question 1, 2, 3 put your answer in one file make a legible file (finalexam.pdf). for problem 4 write R code (problem4.R). place your two files (finalexam.pdf and...

1 answer below »
course is data mining. for question 1, 2, 3 put your answer in one file make a legible file (finalexam.pdf). for problem 4 write R code (problem4.R). place your two files (finalexam.pdf and problem4.R) in the directory named after you zip.


COMP 541 Final Exam Spring 2022 Please write your answers for problems 1, 2, 3 and make a legible pdf file(finalExam.pdf). Write R code for problem 4 ( problem4.R ). Place your two files (finalExam.pdf & problem4.R) in a directory named after you, zip them and submit them to canvas. 1) a) Briefly describe how does k-means clustering algorithm work? b) What type of learning algorithm is K-means clustering? (supervised/ unsupervised/ semi- supervised) 2) The following table consists of training data from an accelerometer database (x, y, z). Let anomaly be the class label attribute. Design a feed-forward neural network with one hidden layer for the given data. Label the nodes in the input and output layers. x y z Anomaly 0.44 - 0.31 - 0.89 yes 0.12 0.32 0.42 no - 0.7 0.86 - 0.92 yes 3) The following table consists of training data from an employee database. The data have been generalized. For example, “31 . . . 35” for age represents the age range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. Let status be the class label attribute. Given a data tuple having the values “Sales”, “31. . . 35”, and “31K–35K” for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be? Show enough work to convince me that you are using the algorithm and that your answer isn’t just a guess. Please write R code for problem 4 ( problem4.R ) using RStudio. 4) Iris is a built-in dataset that comes with R. It contains 150 observations of flowers from 3 different types of iris species. Load Iris dataset and select only the first four columns into your data-frame and name it “data”. Implement K-means clustering algorithm with k=3 on your data. a) Print the location of each centroid. b) Plot your clusters graph.
Answered 2 days AfterMay 18, 2022

Answer To: course is data mining. for question 1, 2, 3 put your answer in one file make a legible file...

Subhanbasha answered on May 20 2022
80 Votes
2.
Ans: Neural Network with one hidden layer.
3.
Ans:
Let A be the class label attribute
Let B
be the department event
Let C be the age event
Let D be the salary event
Here the P(A/BUCUD) represents the probability of a person be a senior or junior.
P(A/BUCUD) = P(BUCUD/A) P(A)/P(BUCUD)
By...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here