Question: edit: added DATASET : https://ufile.io/c99q1 [YEAST.DATA] In this task you are required to classify the data into one of the ten classes using a decision

edit: added DATASET : https://ufile.io/c99q1 [YEAST.DATA]

edit: added DATASET : https://ufile.io/c99q1 [YEAST.DATA] In this task you are required

In this task you are required to classify the data into one of the ten classes using a decision tree When splitting your data into training and test data and for your classification process use a seed of 1234, then classify the data using training data and report statistics for your test data. You have the following 4 sub-tasks (a) Use a 70-30 split to create your training and test data b) Use your training data to train a model. (c) Use your model to predict previously unseen data using the test data (d) Produce a confusion matrix showing your predictions and report the accuracy of your model. Task 2: Visualization This task requires you to produce appropriate visualizations of your classification and results. (a) Produce a visualization of your classification model and how it makes decisions, when using a 70-30 split. You may change the size of the plotting window in RStudio by using: (r, fig width-X, ig.height-Ywhere X and Y are numbers, so as to avoid nodes and labels in the tree to be overlapped (b) Produce a visualization of your confusion matrix as a heatmap. Your heatmap should visualize the predicted variables and normalize these predictions between 0 and 1 Do this task by using the 70-30 split, and use the ggplot packages to produce the heatmap visualization In this task you are required to classify the data into one of the ten classes using a decision tree When splitting your data into training and test data and for your classification process use a seed of 1234, then classify the data using training data and report statistics for your test data. You have the following 4 sub-tasks (a) Use a 70-30 split to create your training and test data b) Use your training data to train a model. (c) Use your model to predict previously unseen data using the test data (d) Produce a confusion matrix showing your predictions and report the accuracy of your model. Task 2: Visualization This task requires you to produce appropriate visualizations of your classification and results. (a) Produce a visualization of your classification model and how it makes decisions, when using a 70-30 split. You may change the size of the plotting window in RStudio by using: (r, fig width-X, ig.height-Ywhere X and Y are numbers, so as to avoid nodes and labels in the tree to be overlapped (b) Produce a visualization of your confusion matrix as a heatmap. Your heatmap should visualize the predicted variables and normalize these predictions between 0 and 1 Do this task by using the 70-30 split, and use the ggplot packages to produce the heatmap visualization

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Re Question: In this task, you are required to classify the data into one of the... - In the above, let us imagine we have used eight variables (attributes from column 2 to column 9) as the...

In this task you are required to classify the data into one of the ten classes using a decision tree When splitting your data into training and test data and for your classification process use a...

edit: added DATASET : https://ufile.io/c99q1 In this task you are required to classify the data into one of the ten classes using a decision tree When splitting your data into training and test data...

Preamble: what the data is about The dataset you have for this test relates to protein localisation sites for yeast. The data contains 10 columns: 1. Sequence Name: Accession number for the...

data mining subject 1- summary the artical 2-what is data size 3- recoreds applied 4-what techqinecs is used 5- explain resualts EMPIRICAL STUDY ON SELECTION OF TEAM MEMBERS FOR SOFTWARE PROJECTS -...

Given summary about this article the most important Humans inherit artificial intelligence biases Luca Vicente & Helena Matute * Artificial intelligence recommendations are sometimes erroneous and...

You can use any software to plot and/or to calculate values/data, but if you do, provide (copy/paste) here the code. Data sets relevant for this HW can be found at the UCI Machine Learning...

What is the dependent variable? What is the independent variable? As you know the two main variables in an experiment are the independent and dependent variable. An independent variable is the...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

The Final Project is to develop a simple database system. The database is to handle multiple records, each composed of several fields. The database will store its information to a file, addition and...

27) Provide the structure of the major organic product in the reaction below. CH 28) Provide the structure of the major organic product in the reaction below. Na CO HSO, 29) Provide the structure of...

In this problem, you are to find the gravitational potential energy of the stick in Example 11-8 and a point mass m 0 that is on the x axis at x 0 . (a) Show that the potential energy of an element...

Question 7 5 pts Which of the following is typically used as the basis of a marked - based forecast? A time - series model showing the currency's moving average The currency's spot rate The...

The following transactions pertain to 2018, the first-year operations of Fanning Company. All inventory was started and completed during 2018. Assume that all transactions are cash transactions. 1....

1 Comment on Glups plans to create engineered costs from the perceived benefits of the new material-handling equipment. Glup SA supplies a range of household soaps to supermarkets in northern Europe....

What happens to the break-even point if: 2 The sales price reduces by 5 per cent? Bond SA is planning to manufacture a new product with an initial sales forecast of 3,600 units in the first year at a...

1 What can we tell from the above analysis in Table 3.3 and the average DPPs per customer? (Consider in particular the differences in DPP between the four orders shown, and between the three...