Question: You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for

You can read about the ECML/PAKDD discovery challenge 2006 which dealt with email spam detection here: http://www.ecmlpkdd2006.org/challenge.html. Your task is to download the dataset for task A from http://www.ecmlpkdd2006.org/data_task_a.zip.

Train a Naive Bayes classifier using the data found in task_a_labeled_train.tf file. Divide the data into a training (70%) and a test sample (30%) and test the performance of your classifier on the test sample.

Also test the performance of your classifier on the data found in task_a_u00_tune.tf and comment on your finding.

In your report, briefly describe how you approached the problem, what results you obtained, what practical difficulties you faced, and how you overcame these difficulties.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!