Question: ( a ) Please identify the appropriate data transformation methods for the following situations. Give a brief description about your answers: [ 4 ] Consider
a Please identify the appropriate data transformation methods for the following situations.
Give a brief description about your answers:
Consider a dataset containing information about student performance in two
subjects: Math and English. The Math scores range from and mean
standard deviation while the English scores range from to mean
standard deviation
For each feature, apply normalization transformed data has: and
calculate the new mean and new standard deviation of the normalized feature.
Compare their means and standard deviations. And
for each feature, apply standardization to it and show the range of transformed
data and compare their ranges.
During the design of an artificial neural network, we sometimes need to transform
a variable that has a range of to an open set zin Note that
monotonically increases as increases in this transformation. Please specify a
proper function for such transformation.
b In natural language processing NLP there are diverse ways to represent words such
as onehot encoding, bag of words, IDF, and distributed word representations. In
one hot encoding, a bit vector whose length is the size of the vocabulary of words is
created, where only the associated word bit is on ie while all other bits are off ie
Here is a toy example: suppose there is a dimensional feature vector to represent
a vocabulary of five words: king queen, man, woman, power In this case, 'king' is
encoded into 'queen' is encoded into etc. Due to the nature of this
representation, the feature vector encodes the vocabulary of a sentence where all words
are equally distant. On the other hand, in distributed word vectors, a realvalued
vector whose length is defined by some common properties of words is created, then
each word can be represented as a linear combination of the defined properties. Using
the toy example above, given a dimensional feature vector of man woman, power as
the common properties, then words such as 'king', 'queen', 'man', and 'woman' could be
encoded into
and respectively.
In this case, if you subtract a vector of 'man' from a vector of 'king', and add a vector
of 'woman', then you will get a vector close to a vector of 'queen'.
What is a major advantagedisadvantage of one hot encoding as compared to
distributed word vectors. Briefly justify your answer.
What is a major advantagedisadvantage of distributed word vectors as com
pared to one hot encoding. Briefly justify your answer.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
