Question: Assignment Exercise 2: PART A: Q1: Using a data set of your choice, do the following. Remember that the only pre-requisite before setting about with

Assignment Exercise 2: PART A: Q1: Using a data set of your choice, do the following. Remember that the only pre-requisite before setting about with Naive classification is to have an existing set of examples for each category/ class [Training data] that we wish to bucket / categorize pieces of text into Classifier A: Train a Naive Bayes classifier using Bag of words approach. Compare performance for the above model [confusion matrix + performance metrics accuracy, precision and recall]. Q2: Design and implement Hidden Markov Model (HMM) based Part-of-Speech (POS) tagger with the following assumptions: - Markov assumption length 1 - Probability of any state sk depends on its previous state only, i.e., P(sk | sk-1) Evaluate the following in your HMM and report: - Precision, recall and F1-score. - Confusion matrix (Each element Aij of matrix A denotes the number of times tag i classified as tag j) Note: The brown corpus in NLTK comes with its data POS tagged, so you can use the brown tagged POS data set to train your HMM PART B: Given a tweet and an emotion X, determine the intensity or degree of emotion X felt by the speaker -- a real-valued score between 0 and 1. The maximum possible score 1 stands for feeling the maximum amount of emotion X (or having a mental state maximally inclined towards feeling emotion X). The minimum possible score 0 stands for feeling the least amount of emotion X (or having a mental state maximally away from feeling emotion X). The tweet along with the emotion X will be referred to as an instance. Features to implement: You are provided with a paper pdf. You need to construct the feature vector as given in the paper ((Section 2.3.1) + N-grams features (N= 1, 2)) DATA SET: https://saifmohammad.com/WebPages/EmotionIntensity-SharedTask.html You are required to use the training sets of the two emotion classes - Anger and Joy, for training the model and the test sets of the emotion classes for testing the model. Machine learning algorithms: A. Naive Bayes: Implement the algorithm from scratch on the provided dataset. Train the model on the train set and report the performance metrics on the test set. Evaluation: a. Report accuracy, precision, recall

with code in python solve any one part and please send code in executable form

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

03-3 Using previously learned techniques load the data for Exercise 1.5 (page 16) into the CrunchIt spreadsheet environment. Click the \"Graphics\" tab and select \"Histogram\". In the \"Sample\" box...

PSY 230 Module 3 Assignment (A-3) For this assignment you will use multiple data sets, each embedded within the assignment below. Q1) Find the z values that form the boundaries of the critical region...

Download and analyze the data in the CreditCard dataset. The following variables are included in the dataset: 1. card: was the application for a card accepted? (Binary: 1/0) Response Variable 2....

CSC 411 / CSC 2515 Introduction to Machine Learning ASSIGNMENT # 1 Due at NOON on: Oct. 19 (CSC 411) / Oct. 20 (CSC 2515) 1 Logistic Regression (40 points) 1.1 (10 points) Bayes' Rule Suppose you...

PSYC 354 HOMEWORK 5 Z-Scores When submitting this file, be sure the filename includes your full name, course and section. Example: HW5_JohnDoe_354B01 Be sure you have reviewed this module/week's...

The HamilGine Piston Decision This case addresses many issues that affect insourcing/outsourcing decisions. A complex and important topic facing businesses today is whether to produce a component,...

PSYC 354 HOMEWORK 8 Single-Sample T-Test When submitting this file, be sure the filename includes your full name, course and section. Example: HW8_JohnDoe_354B01 Be sure you have reviewed this...

Please solve the edit me parts using R coding ##### Homework tips: 1. Recall the following useful RStudio hotkeys. Keystroke | Description ----------|------------- ` ` | Autocompletes commands and...

Direct Toiletries manufactures several varieties of toiletries utilized by small and large hotels around the world, You are the cost analysis for Direct Toiletries. It is your responsibility to...

A sales manager for a major pharmaceutical company analyzes last years sales data for her 96 sales representatives, grouping them by region (1 = East Coast United States; 2 = Mid West United States;...

Assume Manuela's FCF today ( FCFO ) is $ 5 4 8 2 8 million and the FCF ' s are expected to grow at a constant rate of 6 . 1 2 % per year in the future. The market value of Manuela's outstanding debt...

Currently a company that designs Web sites PLEASE ANSWER FOR ALL SECTIONS A AND B Currently a company that designs Web sites has five customers in its backlog. The time since the order arrived,...

2. What, according to Sergey, was strange at this meeting?

1. Opportunities for face-to-face contact will be diminished, and information from nonverbal cues will be reduced. Consequently, opportunities for random spontaneous information sharing will be...

3. [From your subordinates] We really think that we deserve more money for doing this job. a. Thats silly. Youre really not worth what were paying you now. b. You know that the legislature wont give...