Question: Problem 2 : Spam Email Classification begin { tabular } { | l | l | l | l | l | l |
Problem : Spam Email Classification
begintabularlllllll
hline & begintabularl
Sender
Domain
Nominal
endtabular & begintabularl
Email Length
Continuous
endtabular & begintabularl
Attachment
Nominal
endtabular & begintabularl
Spam Link
Count
Discrete
endtabular & begintabularl
Urgency
Level
Ordinal
endtabular & begintabularl
Is
Sampam
endtabular
hline & example.com & & No & & Low & No
hline & spammer.net & & Yes & & High & Yes
hline & legitimate.org & & No & & Medium & No
hline & example.com & & Yes & & High & Yes
hline & legitimate.org & & No & & Low & No
hline & spammer.net & & No & & Medium & Yes
hline & example.com & & Yes & & Medium & No
hline & spammer.net & & No & & High & Yes
hline & legitimate.org & & Yes & & High & Yes
hline & legitimate.org & & & & Now
hline
endtabular
You are tasked with classifying emails as spam or not spam using Naive Bayes classification. You need to estimate the conditional probabilities for different attributes and the class variable Is Spam."
Exercises:
Calculate Prior Probabilities: Calculate the prior probabilities of an email being spam or not spam based on the provided dataset.
Conditional Probabilities for Nominal Attributes: Calculate the conditional probabilities of an email being spam or not spam for different sender domains Nominal attribute
Conditional Probabilities for Continuous Attributes: Calculate the conditional probabilities of an email being spam or not spam based on the email length Continuous attribute You can assume a Gaussian distribution.
Conditional Probabilities for Nominal Attributes with Multiple Categories: Calculate the conditional probabilities of an email being spam or not spam based on whether it has an attachment Nominal attribute
Conditional Probabilities for Discrete Attributes: Calculate the conditional probabilities of an email being spam or not spam based on the number of spam links Discrete attribute
Conditional Probabilities for Ordinal Attributes: Calculate the conditional probabilities of an email being spam or not spam based on the urgency level Ordinal attribute
Spam Classification: Given a new email with the following attributes, use Naive Bayes to classify it as spam or not spam:
Sender Domain: "spammy.biz"
Email Length:
Attachment: Yes
Spam Link Count:
Urgency Level: Medium
These exercises will help you practice estimating probabilities for a Naive Bayes classification problem with different types of attributes.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
