Question: 5. In this problem, we will analyze the spam email dataset. These data were collected by Hewlett-Packard (HP) in the 90's to build system for

 5. In this problem, we will analyze the spam email dataset.These data were collected by Hewlett-Packard (HP) in the 90's to build

5. In this problem, we will analyze the spam email dataset. These data were collected by Hewlett-Packard (HP) in the 90's to build system for automatically identifying and dis- carding spam email. Each row of this dataset corresponds to a single email. The response spamEmail is a binary variable taking the value 1 if the email was spam and taking the value O if the email was legitmate. For predictors, we will use: charHash: the percentage of characters in the email that were "#", wordMoney: the percentage of words in the email which were "money". To load this data into R, use: # # Load spam data # install.packages ("kernlab") library(kernlab) data (spam) wordMoney

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!