Question: 5. In this problem, we will analyze the spam email dataset. These data were collected by Hewlett-Packard (HP) in the 90's to build system for


5. In this problem, we will analyze the spam email dataset. These data were collected by Hewlett-Packard (HP) in the 90's to build system for automatically identifying and dis- carding spam email. Each row of this dataset corresponds to a single email. The response spamEmail is a binary variable taking the value 1 if the email was spam and taking the value O if the email was legitmate. For predictors, we will use: charHash: the percentage of characters in the email that were "#", wordMoney: the percentage of words in the email which were "money". To load this data into R, use: # # Load spam data # install.packages ("kernlab") library(kernlab) data (spam) wordMoney
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
