Question: risk modelling Question 5 Consider the iris dataset for this question. Note that the response of this dataset (Species) has three levels, i.e., setosa, versicolor,

risk modelling Question 5 Consider the iris dataset for this question.

risk modelling

Question 5 Consider the iris dataset for this question. Note that the response of this dataset (Species) has three levels, i.e., setosa, versicolor, and virginica. However, we will only consider twolevel response here. Thus, we will omit the rows with the response of setosa by running the codes below. Then, we split the dataset into training and validation sets with a ratio of 7:3. \begin{tabular}{|l|} \hline Rdat=iris[51:150, ] \#1-50 is setosa, 51-100 is versicolor, and 101-150 is virginica \\ Spec=rep(0, nrow(Rdat)) \\ Spec[Rdat\$Species=="virginica"] =1#0 is versicolor, 1 is virginica \\ Rdat=data.frame(Rdat, Spec) \\ Rdat=Rdat[, -5] \#remove the original column of Species \\ set.seed(1) \\ train=sample(1:nrow(Rdat), 0.7*nrow(Rdat)) \\ train_set=Rdat[train, ] \\ test_set=Rdat[-train, ] \end{tabular} Tut 7_Page 2 BAS2083 Risk Modelling (a) Develop a binary logistic regression model (Model I) with the training set and obtain the test error rate based on the validation set. (b) Note that not all the predictors are significant in Model I, drop the insignificant predictor(s) and obtain Model II. Then, estimate the test error rate. (c) Develop an unpruned tree (Model III) with the training set and estimate the test error rate. [Hint: we need to use the function as.factor( ) for all the classification trees since the response is now 0 and 1] (d) From Model III, prune the tree by the CV approach and only a pruned tree (Model IV). Hence, estimate the test error rate. (e) Use the training set, develop a bagging tree (Model V), and estimate the test error rate. (f) To this end, which model (Models I - V) appears to be the best? Do the results meet your expectation? Justify

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

ECO987Q3 1. (10) Briey discuss the following statements (keep your answers short and concise): (a) The consumption-based capital asset pricing model is inconsistent with high volatility of stock...

Union and Intersection with R Introduction The goal of this project is to reinforce the concepts learned in class using R and RStudio. i Note R is the name of the programming language itself and...

Menu DS1000 - Assignment 3.p.. X + Create Sign in X All tools Edit Convert E-Sign Find text or tools Q Al Assistant Part 1 - Written Answer (Be sure to show all your work) Looking for key takeaways?...

Math 338 Lab Assignment In this lab, we will be working with subsets of a much larger data set commonly known as Fisher's or Anderson's Iris data set, see Fisher, R.A. (1936), The use of multiple...

Please help code the following inn python. Thank you All code must be executable from scratch. Code should be written properly (i.e. put code in functions as needed, declare variables as needed, and...

Attempt Q.1The lengths of pregnancies in a little provincial town are ordinarily appropriated with a mean of 262 days and a standard deviation of 15 days. In what reach would you hope to track down...

1 2 3 4 7 8 9 12 13 14 15 16 17 18 19 20 21 22 23 24 28 29 30 31 38 40 41 44 47 48 49 50 51 62 63 64 66 67 68 69 70 71 73 74 76 77 78 79 80 81 82 85 86 87 88 89 90 91 92 93 94 95 99 100 101 104 105...

FORUM: QUALITATIVE SOCIAL RESEARCH SOZIALFORSCHUNG Volume 2, No. 3, Art. 22 September 2001 Qualitative Data Analysis: Common Phases, Strategic Differences Ian Baptiste Key words: Abstract: This paper...

Help with writing a short analytical summary of 150-200 words on each of the 2 articles below. Article 1: Exploring community-based options for reducing youth crime. The BackTrack program was...

Hi, I need someone to do summary for the article I upload AUDITING: A JOURNAL OF PRACTICE & THEORY Vol. 28, No. 2 November 2009 pp. 1-34 American Accounting Association DOI: 10.2308 / aud.2009.28.2.1...

Describe what is meant by a distributed system.

In Figure 10-E, two examples of a metric vernier micrometer are shown. The micrometer is graduated in hundredths of a millimeter (0.01 mm), and an additional reading in two-thousandths of a...

equity and debtholders share the same investor perspective in regard to ; maxium loss; investment risk; return potential or shaort term profit

Review the unadjusted trial balance below and prepare adjusting journal entries to record the various described items below. Record in the space provided at the bottom of this spreadsheet. After...

4. Ask about how much training you would receive in how to use the companys data products.

15-2 What are the alternative strategies for developing global businesses?

15-3 What are the challenges posed by global information systems and management solutions for these challenges?