Question: (Learning outcome 6: Will be able to apply data mining algorithms.) 1. (25)You listed the pastries according to the different recipes you found on the
(Learning outcome 6: Will be able to apply data mining algorithms.) 1. (25)You listed the pastries according to the different recipes you found on the internet and their ingredients, as in the table below. You want to determine whether the recipe you have found will be Su pastry or Kol pastry. Suggest a Data Mining Method and show that the recipe with a question mark belongs to which class. (WARNINGs No NORMALIZATION is needed the amounts of the ingredients are fictitious You have to perform mathematical calculation to prove the class not your insights or intuitions You can use pen/pencil and paper and upload you calculation) PLEASE Replace X with your last digit of student Number. if your last digit is 0 then use 5) (PS: YOU CAN write your answer to a paper sheet and insert the Image of it to your answer WORD paper) Butter (g) Milk (ml) Flour (g) Water(ml) Class X00 200 250 200 Su pastry 200 100 300 200 Kol pastry 150 100 300 200 Kol pastry X00 50 350 300 Kol pastry 275 150 325 X00 Su pastry 275 175 275 X00 Su pastry 225 150 275 400 ???? (Learning outcome 2 : will be able to relate Data Mining Models with each other.) 2. (These QUESTIONS are NOT True FALSE questions you have to state and discuss your answer. IF you just write TRUE / FALSE and not describe the REASONS your answer WONT be scored) a. (5) We get rid of missing and inconsistent data with preprocess step called data editing, which is one of the data mining preprocessing step.
(5) Decision trees or Naive Bayes produce more understandable and interpretable results for the modeled dataset compared to Artificial Neural Networks (including Deep Learning). c. (5) I get information about the performance of the classification model through the Confusion matrix with which R2 statistics is calculated. d. (5) I can also use the measurement metrics used for the similarity of the two data for the clustering process. e. (5) For association analysis, thresholding is made with data objects with values above the Support value, and the confidence values are checked for the rest. f. (5) I can also evaluate the results of association analysis with the Silhouette method.
3. (30) According to the Confusion matrix given below, a) Overall Accuracy Overall Recal value Overall Precision value Overall F1 score b) Comment on the RESULTS and Performance of the Classification Model PLEASE Replace X with your last digit of student Number. if your last digit is 0 then use 5) (PS: YOU CAN write your answer to a paper sheet and insert the Image of it to your answer WORD paper) Prediction A B C Real States A X0 5 0 B 6 0 0 C 0 0 X 4. (17) Insert your UDEMY course completion screen shot
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
