Question: Project 2 :Please answer each question after the blue Answer Classification In this classification setting, we use a wine dataset of chemical measurement of two

Project 2:Please answer each question after the blue "Answer"

Classification In this classification setting, we use a wine dataset of chemical measurement of two variables, Color_intensity and Alcalinity_of_ash, on 130 wines from two cultivars in a region in Italy. The data set is a subset of a data set from https://archive.ics.uci.edu/ml/datasets/Wine, see that page or http://archive.ics.uci.edu/ml/machine-learning-databases/wine/wine.names for information of the source of the data. First, read the original data in, keep the response class, and predictors Alcalinity_of ash and Color_intensity. We only use 2 classes (there are 3 classes in the original dataset) and we re-code them to be y = 0 or 1. Also, we rename Alcalinity_of_ash and Color_intensity to be r, and 12. Then, we make plot and visualize the relation between the variables. Look at the pairwise correlation between 21, 12, and y. library (ggplot2) library (GGally) wine = read. table(file = "http://archive. ics. uci. edu/ml/machine-learning-databases/wine/wine. data", sep = ", ", head=F) colnames (wine) = c("class", "Alcohol", "Malic_acid", "Ash", "Alcalinity_of_ash", "Magnesium", "Total_phenols", "Flavanoids", "Nonflavanoid_phenols", "Proanthocyanins", "Color_intensity", "Hue", "OD280/0D315_of_diluted wines", "Proline") wine = wine [which (wine$class!=3) , c (1,5,11)] wine$class=as. factor(wine$class-1) colnames (wine)=c("y","x1","x2") ggpairs (wine, ggplot2: : aes(color=y) ) + theme_bw(18) y x1 x2 60- 40 - 20 0 - 30 Corr : - 0. 433+* * 25- 20 0: -0. 211 15 1 -0.086 10 7.5- 5.0- 2.5 TTT 0.0.5.0101 0(50.5.0:10.60 15 20 25 30 2.5 5.0 7.5 Obviously, the data is a roughly balanced dataset.Then, we would like to use Logistic regression, LDA, and KNN methods to estimate the test error rates. To do so, we will use the Validation Set approach. So, now we split the dataset to be the train and test datasets. n

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

ACC/291 Principles of Accounting II - Discussion Questions [This isn't an essay] Discussion on this Learning Activity: Week 2 Electronic Reserve Readings Wk2 DQ#6 - Liability Why is the definition of...

Table of Contents Main Objective of the assessment 1 Description of the Assessment 1 Learning Outcomes and Marking Criteria. 4 Format of the Assessment 6 Submission Instructions. 7 Avoiding...

JPMA-01726; No of Pages 12 Available online at www.sciencedirect.com ScienceDirect International Journal of Project Management xx (2015) xxx - xxx www.elsevier.com/locate/ijproman Does Agile work? A...

Budgeting for Nonprofit Organizations Although budgeting is just as important for nonprofit organizations as for for-profit companies, the approach taken toward budgeting can be very different. In...

1 . Use the ML Practitioner Assessment project to answer this question. Which two of the following statements about the "grade" column in the schools _ data dataset are true? ( so uma ) The median is...

3.2 Identifying FR level of importance using factor analysis The level of importance of the FRs is determined in this study by the application of factor analysis (principal component technique). The...

I have literally posted the complete assignment information, can I please have some help with these two problems? MAT 375 Module Two Guided Activity: (Continuous) Dynamical Systems Our textbook has a...

Part A (Items #1 and #12 are required but not graded) You will submit one file, a Word document. Please limit each response to 250 words or less. Name the file in the following format:...

Each of the following scatterplots shows a cluster of points and one "stray" point. For each, answer these questions: 1) In what way is the point unusual? Does it have high leverage, a large...

Li & Fung Established in 1906, Hong Kong-based Li & Fung is now one of the largest multinational trading companies in the developing world, with annual sales of over $11 billion in 2008, up...

5. The intNums array is declared as follows: Dim intNums() As Integer = {10, 5, 7, 2}. Which of the following blocks of code correctly calculates the average value stored in the array? The intTotal,...

Problem 13-18 Reward-to-Risk Ratios [LO4] Stock Y has a beta of 1.4 and an expected return of 14.7 percent. Stock Z has a beta of .7 and an expected return of 8.7 percent. If the risk-free rate is...