Question: (PYTHON) (PLEASE SHOW PROOF OF OUTPUT) Your task this week is to write a very simple spam classifier in Python. It will classify messages as

(PYTHON) (PLEASE SHOW PROOF OF OUTPUT)

Your task this week is to write a very simple spam classifier in Python. It will classify messages as either SPAM (unwanted) or HAM (wanted).

The program will have a set of SPAM_WORDS, words that are known to appear in spam messages.

You will also define a spam threshold which reflects the allowed percentage of spam words in the message. You'll compute a 'spam indicator', which is the ratio of spam words to the total number of unique words in the message. You will round the spam indicator to two decimals. If the spam indicator exceeds the spam threshold, the message is classified as spam. Otherwise it is classified as ham. We'll assume that the spam threshold is a constant and has a value of 0.10.

Your program will prompt the user for a message and then will print the corresponding classification.

The program will be case insensitive. The spam words are detected whether they are in lower case or upper case or mixed case.

For simplicity, we'll ignore punctuation.

Testing: (MAKE SURE TO FOLLOW THIS)

Make sure that you test your solution before you submit it. Here are a few test cases with the expected output. Feel free to add your own.

Test case 1 - classify message correctly as SPAM - Make sure the SPAM indicator is correct

Please enter your message: The widow of a deposed dictator wants your help in getting his money out of the country

SPAM indicator: 0.27

This message is: SPAM

Test case 2 - classify message correctly as HAM

Please enter your message: I got a new job offer today. It looks good. Are you free for lunch tomorrow? We can meet downtown at noon.

SPAM indicator: 0.09

This message is: HAM

Test case 3 - classify message correctly regardless of the case

Please enter your message: Do not miss out on this once in a lifetime OPPORTUNITY call NOW

SPAM indicator: 0.23

This message is: SPAM

Test case 4 - classify message correctly based on the number of unique words

Please enter your message: It is urgent that you call us immediately yada yada yada yada yada yada

SPAM indicator: 0.11

This message is: SPAM

Test case 5 - A message with a SPAM indicator 0.1 is classified as HAM.

Please enter your message: Congratulations on your new job! I hope you like it.

SPAM indicator: 0.1

This message is: HAM

Use the following template:

# ----------------------------------------------------------------------------- # Name: spam # Purpose: # # Author: # Date: # ----------------------------------------------------------------------------- """ Enter your module docstring with a one-line overview here  and a more detailed description here. """ SPAM_WORDS = {'opportunity', 'inheritance', 'money', 'rich', 'dictator', 'discount', 'save', 'free', 'offer', 'credit', 'loan', 'winner', 'warranty', 'lifetime', 'medicine', 'claim', 'now', 'urgent', 'expire', 'top', 'plan', 'prize', 'congratulations', 'help', 'widow'} def spam_indicator(text): """  Enter your function docstring here  """  # This function returns the spam indicator rounded to two decimals  def classify(indicator): """  Enter your function docstring here  """  # This function prints the spam classification  def get_input(): """  Enter your function docstring here  """  # Prompt the user for input and return the input  def main(): # Get the user input and save it in a variable  # Call spam_indicator to compute the spam indicator and save it  # Print the spam_indicator  # Call classify to print the classification  if __name__ == '__main__': main()

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

(PYTHON) Your task this week is to write a very simple spam classifier in Python. It will classify messages as either SPAM (unwanted) or HAM (wanted). The program will have a set of SPAM_WORDS, words...

Python assignment with this format: --------------------- """ Enter your module docstring with a one-line overview here and a more detailed description here. """ import string SPAM_WORDS =...

Machine learning-based SMS Spam Filtering Project Statements - Objective For this project, you are asked to implement a detection program supporting Short Message Service (SMS) spam filtering. The...

Please answer this. B D 1 Residential Rate Exercise 3 Calculate a residential rate using an inverted block rate structure. Required data is provided below. Rates should be stated in dollars per...

Hello, hope all is well. Please I need your help with a project, specifically questions 2 and 10 below. I'm going to attach below a screenshot of the data needed to solve the questions Context for...

Hello! hope all is well. Please I need your help with a project, specifically questions 2 and 10 below. I'm going to attach below a screenshot of the data needed to solve the questions Context for...

Hello, hope all is well. Please I need your help with a project, specifically questions 2 and 10 below. I'm going to attach below a screenshot of the data needed to solve the questions Context for...

In Python.. Please show program and screenshot of output and i will upvote and leave positive comment. Thanks. You are the manager of a team of ten programmers who have just completed a seminar in...

1. What is "polymorphism" in python? Please show examples when analyzing. 2. What is "immutability" in python? Please list the objects that are immutable and mutable in python. 3. How could you show...

an uncle that you have never heard off who was living in an island in Thailand died, you are his unique heir and he left you $500M. Since you are a very down to earth student, you have decided that...

What is the net income ratio of a property where the potential gross income is $110,000; the vacancy and collection loss is estimated at $10,000, and the operating expenses are estimated at $30,000?...

Exercise 3.2 Hexadecimal (base 16) is also a commonly used numbering system for representing values in computers. In fact, it has become much more popular than octal. The following table shows pairs...

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

2. Describe the business intelligence capabilities of the PDC portal.

Eliciting data and reporting requirements using interviews, document analysis, requirements workshops, site visits, use cases, data analysis, and workflow analysis.

4. How did Anthems new data analytics capabilities change the Human Resources function at the company?