The Police Department has asked you to assist with the interrogation of several suspects accused of...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
The Police Department has asked you to assist with the interrogation of several suspects accused of cheating at AI course. Each suspect makes a single statement during interrogation, and from this statement you areto determine whether they are Guilty (G) or Innocent (I). To assist you, the police department provides you with records of past cheating cases: statements made by suspects, labeled with whether the suspect ended up being found Guilty (G) or Innocent (I). Training Statements I am definitely innocent, officer Officer, I swear I am not lying I am not lying, I swear I am innocent, officer, I swear Officer, I am definitely not lying (a) You plan to apply a Naive Bayes classifier to this problem. Given the class label, this model treats each word in the sentence as an independent feature. The parameters of this model take the form of conditional probabilities P (G), P (W ord = "am" G). Using the training data, find the maximum likelihood estimate of the parameters (they will be the class-conditional relative frequencies of each word). Ignore punctuation and capitalization. (b) (G/I) (1) (1) (1) (G) (G) Word P(Word G) P (Word) I am Prior Prob definitely G innocent I officer swear not lying Using the probabilities found above, compute the following probabilities: P(G, "Officer, I am not lying") P(I, "Officer, I am not lying") What is the most likely classification for the above sentence? (c) Another suspect has been caught; this time, he gives the statement "I am honest, officer". Using the same Naive Bayes model, compute the probability P(G, "I am honest, officer"). (d) Instead of maximum likelihood, use Laplace (add-one) smoothing to find new values for the model parameters (using the same training data as in (a)) and use these new parameters to compute the probability P(G"I am honest, officer"). Assume for the purposes of smoothing that every word you will see is contained in your training set and test sentence (but do not use the test sentence when computing parameter estimates). Make sure to smooth both the prior P (G), P () and conditional distributions. The Police Department has asked you to assist with the interrogation of several suspects accused of cheating at AI course. Each suspect makes a single statement during interrogation, and from this statement you areto determine whether they are Guilty (G) or Innocent (I). To assist you, the police department provides you with records of past cheating cases: statements made by suspects, labeled with whether the suspect ended up being found Guilty (G) or Innocent (I). Training Statements I am definitely innocent, officer Officer, I swear I am not lying I am not lying, I swear I am innocent, officer, I swear Officer, I am definitely not lying (a) You plan to apply a Naive Bayes classifier to this problem. Given the class label, this model treats each word in the sentence as an independent feature. The parameters of this model take the form of conditional probabilities P (G), P (W ord = "am" G). Using the training data, find the maximum likelihood estimate of the parameters (they will be the class-conditional relative frequencies of each word). Ignore punctuation and capitalization. (b) (G/I) (1) (1) (1) (G) (G) Word P(Word G) P (Word) I am Prior Prob definitely G innocent I officer swear not lying Using the probabilities found above, compute the following probabilities: P(G, "Officer, I am not lying") P(I, "Officer, I am not lying") What is the most likely classification for the above sentence? (c) Another suspect has been caught; this time, he gives the statement "I am honest, officer". Using the same Naive Bayes model, compute the probability P(G, "I am honest, officer"). (d) Instead of maximum likelihood, use Laplace (add-one) smoothing to find new values for the model parameters (using the same training data as in (a)) and use these new parameters to compute the probability P(G"I am honest, officer"). Assume for the purposes of smoothing that every word you will see is contained in your training set and test sentence (but do not use the test sentence when computing parameter estimates). Make sure to smooth both the prior P (G), P () and conditional distributions.
Expert Answer:
Answer rating: 100% (QA)
a Maximum Likelihood Estimation of Parameters Count Words Count occurrences of each word in the training data am 4 occurrences out of 20 total words d... View the full answer
Related Book For
Auditing and Assurance services an integrated approach
ISBN: 978-0132575959
14th Edition
Authors: Alvin a. arens, Randal j. elder, Mark s. Beasley
Posted Date:
Students also viewed these programming questions
-
You are a newly licensed immigration consultant, working in a firm with others. Your firm has asked you to assist Maryam Helou Tannous with an inland application for permanent residence based on...
-
The Crazy Eddie fraud may appear smaller and gentler than the massive billion-dollar frauds exposed in recent times, such as Bernie Madoffs Ponzi scheme, frauds in the subprime mortgage market, the...
-
In Problem, p is the price per unit in dollars and q is the number of units. If the weekly demand function is p = 30 - q and the supply function before taxation is p = 6 + 2q, what tax per item will...
-
What is miscellaneous expense?
-
The Booth Company?s sales are forecasted to double from $1,000 in 2019 to $2,000 in 2020. Here is the December 31, 2019, balance sheet: Booth?s fixed assets were used to only 50% of capacity during...
-
Refer to the information in Problem 21-1B. Tohono Companys actual income statement for 2017 follows. Required 1. Prepare a flexible budget performance report for 2017. Analysis Component 2. Analyze...
-
Trico Company set the following standard unit costs for its single product. Direct materials (30 Ibs. @ $ 4 per Ib.) . . . . . . . . . . . . . . . . . $ 120.00 Direct labor (5 hrs. @ $ 14 per hr.) ....
-
I am having trouble instaling Oracle database 11g express edition and SQL developer to my computer, validate your installation, and write a short 3- to 4-page paper with screenshots of your download....
-
Vincent Cardoza is the owner and manager of a machine shop that does custom order work. This Wednesday afternoon, he has received calls from two customers who would like to place rush orders. One is...
-
A monopolist firm has the following cost function: C(q)=0.5q^ 2 +10q+2 The (inverse) demand function for the monopolists output is as follows: P(q) = 100 - q a. Assume that the monopolist must charge...
-
Think of a brand they engage with on social media or have. For the discussion talk about how the brand is effective on social media? What do they do well? What types of posts do they make?
-
What is the advantage of markup pricing? which industries do you think would benefit from this strategy?
-
In 250-300 words An example of a literature review for a homeland security research topic is the document Understanding Community Resilience in the Context of National Health Security: A Literature...
-
How can I audit my social media presence to align with my career goals as a senior Intelligence Analyst, team leader, or project manager, as well as academic accomplishments of completion of my...
-
A semi-circular loop with radius R = 26.4 cm, carrying current 0.668 A is placed in a magnetic field of strength B =333 mT, as shown in the picture. What is the magnitude of the torque exerted on the...
-
Pure breeding red fleshed tomatoes crossed with yellow fleshed tomatoes produce an all red F1. Among 400 F2 plants, 90 were yellow. It is hypothesized that a single pair of alleles is involved such...
-
Periwinkle Company is a multinational organization. Its Parts Division is located in Lavender Land, while its Assembly Division is located in North Orchid. During the current year Periwinkle Companys...
-
Define what is meant by inherent risk. Identify four factors that make for high inherent risk in audits.
-
State what is meant by cost accounting records and explain their importance in the conduct of an audit.
-
How does the use of a database management system affect risks?
-
We are a global, science-led biopharmaceutical business. Return to shareholders Revenue from the sale of our medicines generates cash flow, which helps us fund business investment. It also enables us...
-
Our mission and supporting strategies The FRCs mission is to promote high quality corporate governance and reporting to foster investment. Why have we landed on this mission? The capital markets are...
-
National Express and Stagecoach are both major providers of transport services in the UK. Here we compare their accounts of financial performance in their UK Bus divisions. National Express plc UK...
Study smarter with the SolutionInn App