Question: You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for
You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for task A from http://www.ecmlpkdd2006.org/data_task_a.zip.
Train a Naive Bayes classifier using the data found in task_a_labeled_train.tf file. Divide the data into a training (70%) and a test sample (30%) and test the performance of your classifier on the test sample.
Also test the performance of your classifier on the data found in task_a_u00_tune.tf and comment on your finding.
In your report, briefly describe how you approached the problem, what results you obtained, what practical difficulties you faced, and how you overcame these difficulties.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
