Question: 5.1) (10 points) Consider a logistic regression model to classify a document into two classes: postive class (sports), negative class (not sports). The model is

5.1) (10 points) Consider a logistic regression model to classify a document into two classes: postive class (sports), negative class (not sports). The

5.1) (10 points) Consider a logistic regression model to classify a document into two classes: postive class (sports), negative class (not sports). The model is trained and the weights are wi-In(), w - In(2) and w - In(3) with a bias term uo = 0. A given document has feature vector i = 1, 21, and = 1. Based on this logistic regression model, what is the probability that this document is about sports? Please provide your answer in the form of a fraction, you do not need a calculator for this question, logs for the weights just to make the math easier. 5.2) (11 points) For this question, you are interested in modelling customer response to food deals offered by different restaurants. The data shown in the table below lists 10 data points: the training data with labels. The first column in the table lists the deal id. The second column indicates the label, whether the user got the deal and ordered the food or not. The remaining three attributes describe the deals: Price Category specifies where the price level of the deal, Food type indicates whether the deal is on is Pizan or on Pide, Delivery speed indicates whether the restaurant's delivery speed is Fast or Slow. You will try to determine whether the customer will get the deal based on the attributes of the food heal (Price Category, Food type, Delivery Speed) by building a decision tree classifier. Deal id Ordered (Y) Price Category (X1) Food Type(X2) Delivery Speed(X) 1 No High Pide Slow 2 Yes Low Pizza Fast 3 No High Pizza Slow 4 No Medium Pide Fast 5 Yes Low Pizza Slow 6 No High Pide Slow 7 Yes Low Pide Slow 8 Yes Medium Pizza Fast 9 Yes Medium Pizza Fast 10 No Medium Pide Fast Table 1: Deal order data, 5.2a) (4 pts. What is the initial entropy of Ordered based on the training data? Show your work. 5.2b) [7 pts.) Assume that Price Category is chosen for the root of the decision tree. What is the information gain associated with this attribute? 5.3) (5 points, -2 for wrong answor] Which of the following methods will cluster the data in panel (*) of the figure below into the two clusters (red circle and blue horizontal line) shown in panel (b)? Every dot in the circle and the line is a data point. In all the options that involve hierarchical clustering, the algorithm is run until we obtain two clusters. Choose ALL correct choices: there may be more than one correct choice, but there is alwnys at least one correct choice. No explanation is required. (a) Unclustered (b) Desired clustering ************************************ ******** a) Hierarchical agglomerative clustering with Euclidean distance and complete linkage (MAX) b) Hierarchical agglomerative clustering with Euclidean distance and single linkage (MIN) c) Hierarchical agglomerative clustering with Euclidean distance and centroid linkage d) k-means clustering with k = 2

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Question 5 - Machine Learning [26 pts.] Parts 5.1, 5.2 and 5.3 are unrelated. 5.1) [10 points] Consider a logistic regression model to classify a document into two classes: postive class (sports),...

CSC 411 / CSC 2515 Introduction to Machine Learning ASSIGNMENT # 1 Due at NOON on: Oct. 19 (CSC 411) / Oct. 20 (CSC 2515) 1 Logistic Regression (40 points) 1.1 (10 points) Bayes' Rule Suppose you...

Task 2: Multinomial logistic regression (softmax classifier) on MNIST dataset In this task, we will implement the generalization of binary logistic regression to classify multiple classes (10 digits)...

Need help getting started on these questions. I am supposed to add code where it says "implement me" and write the answer where it says answer in one or two line. Need to fill in the "Implement me"...

Need to fill in all parts that say "Implement me" and answer in one or two lines here. The following cell contains code that will be referred to as the Preprocessing Block from now on. It contains a...

Exercises Chapter 2 2.1 Marginal and conditional probability: The social mobility data from Section 2.5 gives a joint probability distribution on (Y1 , Y2 )= (father's occupation, son's occupation)....

STAB22H3 Statistics I 2020 1.A researcher used a linear regression model to investigate the relationship between sexually transmitted disease rate (y) and poverty rate (x) in the U.S.A. The...

(REALLY NEED HELP CREATING THIS CODE IN FULL AND ITS COMPLETE ENTIRETY... ALL OF THE DETAILS ARE PROVIDED AND THE CODE SHOULD HAVE EACH PART FOR EACH QUESTION LABELED SEPARATELY... PLEASE HELP ME AND...

Download and analyze the data in the CreditCard dataset. The following variables are included in the dataset: 1. card: was the application for a card accepted? (Binary: 1/0) Response Variable 2....

hi, please help me with these questions Question 1. [3 marks] Consider the problem of predicting the amount of rainfall that will take place on a given day using linear regression [not logistic...

Hazel is a quoted company which operates in the IT industry. In the year ended 30 June 2019, Hazel reported a profit after tax of 2,320,000. On 1 July 2018 Hazel had 4 million 0.50 shares in issue....

Write a Rock Paper Scissors program as below: I. Ask for user's choices by using the Scanner class II. Send both choices into a method name determine as parameters III. Have the method print out who...

When the cost - of - goods - solid method is used adjust cout to "net realicable value in welower - ur uns ( LCNRV ) approach what account is debited? Alowance to Redoce Imentory to Market Value....

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

Give an example of a Composite Primary Key use in a HCM Payroll Table.

How are Third Normal Form rules disregarded in Dimensional Database Design?

Provide examples of Dimensional Tables.