Question: Part I: Similarity Measures ( 5 points ) What do you mean by similarity metric? List and explain any two. ( 0 3 points )
Part I: Similarity Measures points
What do you mean by similarity metric? List and explain any two. points
Find Euclidean distance between two points in D space plane where point
is denoted by and point is denoted by point
Find Jaccard similarity between two sets:
Set :
Set : point
Part II: Classification trees points
Dataset description: A simulated data set containing sales of child car seats at different
stores. It has observations on the following variables.
Variables with description:
High: A factor variable to denote if a store is a high sales store or lowsales store. It is
equal to "Yes" if a store sells more than units else No
CompPrice: Price charged by a competitor at each location in dollars
Income: Community income level in thousands of dollars
Advertising: Local advertising budget for the company at each location in thousands of
dollars
Price: The price company charges for car seats at each site in dollars
ShelveLoc: A factor with levels Bad, Good, and Medium indicating the quality of the
shelving location for the car seats at each site
Age: Average age of the local population in years
swer the following questions:
Out of the total observations in the node within the orange box, what is the proportion
of observations with the target variable label "Yes"? points
Calculate the total number of observations in the node inside the box. points
What is the entropy of the root node in this dataset? points
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
