Question: Consider a dataset with data points each having 3 features, e . g . , x 1 = f Atlanta ; house
Consider a dataset with data points each having features, eg x fAtlanta; house; kg
and x fHouston; house; kg Dene a proper similarity function dxi; xj for this kind of data,
and argue why it is a reasonable choice. Hint: The feature vector consists of categorial and realvalued
features; for categorical variables, it is better to convert them into onehotkeying binary vectors and
use Hamming distance, and for realvalued features, you may use Euclidean distance, for instance. And
then you can combine the similarity measure in some way.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
