Question: This is R studio code Hand in an R script file for this assignment. You are asked some questions below that amount to is one

This is R studio code

Hand in an R script file for this assignment. You are asked some questions below that amount to "is one model better than another", so add comments to your R file that answer those questions. In the text you write to answer those questions, you need to include the metric(s) and reasoning you're using to decide one model should be preferred over another.

In Chapter 5 of the textbook, the author builds decision trees using the German credit data and a decision table or rule list using the mushroom dataset. For this assignment, let's flip that around.

Build a C5.0 decision tree for the mushroom dataset. Compare a regular tree (trials = 1, the default value) with a boosted tree (trials = 10 or so), and then compare the best of these two trees with the decision table from the book for this dataset. Be sure to include plots and summaries of each tree. Do the regular and boosted trees agree on which feature is most important?

Build a decision table using JRip for the German credit dataset. Is it a better or worse model than the best decision tree the author builds in the text? Be sure to print out the rule list from JRip in your report.

I have been fighting with this code my teacher says its a problem with class and -ncol(mushrooms) but wont tell me how to fix it please help

#install.packages(c("mlbench", "C50", "OneR")) #credit_df <- read.table("mushrooms.csv") #read.cvs("mushrooms.csv", stringAsFactors = TRUE) install.packages("arules")

library(C50) library(mlbench) library(RWeka)

# Load the mushroom dataset #data(mushrooms) mushrooms <- read.csv("https://raw.githubusercontent.com/PacktPublishing/Machine-Learning-with-R-Third-Edition/master/Chapter05/mushrooms.csv")

#str(mushrooms) summary(mushrooms)

# The Id column has no predictive value since it's unique for each # row and would be different for any new data the model encounters # this is a new way to drop a column in R. Just set it to NULL. for (i in 1: ncol(mushrooms)) { mushrooms[,i] <- as.factor(mushrooms[,i]) } #mushrooms('type', 'cap_shape', 'cap_surface', 'cap_color', 'bruises', 'oder', 'gill_attachment') #mushrooms <-as.factor(mushrooms) # class feature is last, so it's index is ncol(mushroom) # drop it from x and call it y training_inds <- sort(sample(nrow(mushrooms), nrow(mushrooms)*0.7)) x_train <- mushrooms[training_inds, -ncol(mushrooms)] y_train <- mushrooms$class[training_inds] x_test <- mushrooms[-training_inds, -ncol(mushrooms)] y_test <- mushrooms$class[-training_inds]

# C5.0 decision trees library(C50)

c50_model <- C5.0(x_train, y_train) c50_preds <- predict(c50_model, x_test)

table(c50_preds, y_test)

mean(c50_preds == y_test)

summary(c50_model)

c50_model <- C5.0(x_train, y_train, trials=10) c50_preds <- predict(c50_model, x_test)

table(c50_preds, y_test)

mean(c50_preds == y_test) # FYI, this is supposed to work but doesn't. So it's commented out. #plot(c50_model)

c50_model <- C5.0(x_train, y_train, trials=100) c50_preds <- predict(c50_model, x_test)

table(c50_preds, y_test)

mean(c50_preds == y_test)

library("OneR")

training_data <- mushroom[training_inds, ] testing_data <- mushroom[-training_inds, ]

oner_model <- OneR(Class ~ ., data=training_data) oner_preds <- predict(oner_model , testing_data)

table(oner_preds, testing_data[,ncol(mushroom)]) mean(oner_preds == y_test)

# let's see the rule! summary(oner_model)

# Weka is a Java library (and nice standalone data mining software) # it includes a _different_ oneR model, so note the warning message # when we install it library("RWeka")

jrip_model <- JRip(Class ~ ., data=training_data) jrip_preds <- predict(jrip_model, testing_data)

table(jrip_preds, y_test) mean(jrip_preds == y_test)

# let's see the decision table jrip_model

# just FYI - Weka has a decision tree model, too. We can actually plot it. # The book tells us how C4.8 is a predecessor of C5.0 j50_model <- J48(Class ~ ., data=training_data) plot(j50_model) j50_model

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

What is the R Studio code that I would put into R Studio for each of these questions? Thank you - Baths: The number of bathrooms - Beds: The number of bedrooms - Area: The livable area, in square...

Solve the below code exactly in R studio. you will write an R script to calculate the relative volatility for the three given stocks: enb, baba, and aapl The following are the names of the files:...

This assignment will give you a very brief introduction to R What is R? R is a software platform and computer programming language for dealing with data, statistical analyses, and visualizations. It...

Please answer me page 51 to page 56 on the attachment. is a multiple choice questions. Thank you FAC1502/101/3/2016 Tutorial letter 101/3/2016 Financial accounting concepts, principles and procedures...

2. Crypto In contrast with the small isolated exercises you have been doing so far, the goal of this assignment is to give you the opportunity to create something a little larger and more complex....

I am wondering if anyone has corrected solutions to fnce 300 assignments 1 2 and 3. I am concered about my answers and would love to compare. Pretty sure on my answers for 1 and 2 mainly interested...

MNG3701/101/3/2016 Tutorial Letter 101/3/2016 Strategic Planning MNG3701 Semesters 1 and 2 Department of Business Management IMPORTANT INFORMATION: Please activate your myUnisa and myLife email...

S10_Chap7_RQ_WPC-Database Lab This assignment is from the Kroenke book Chap 7 Review Questions (RQ). Create a new folder, WPC-Database, in the Projects folder, found in the SQL Server Management...

This SQL project in three Phases and the instruction as below: Instructions Follow the steps and let me know if you have any questions. The intent of the project is to create a project you can use in...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

Explain why the carbocation shown in Figure 8.8 has a longer lifetime than it does under the conditions shown in Figure 9.8.

Propose a synthesis of p-(dim-ethylamine) azobenzene from benzene as your only organic starting material.

Attend a handover and write down the risks you identified tgat was not addressed?

Questions Q1. Write a Python program to retrieve the first and last colors from the following list: color_list = ["red", "green", "white", "blue", "black") Q2. Given the following dictionary,...

| What is it that I am authentically interested in accomplishing?

5. Brainstorm a list of principles or lessons learned from the examples discussed in your group. How might these principles and lessons help you with future job searches and career planning?

- Does the mission or purpose of your company make you feel that your job is important?