Question: In this problem we will build our intuition for constructing optimal decision boundaries. Consider a natural language processing ( NLP ) task where we are
In this problem we will build our intuition for constructing optimal decision boundaries.
Consider a natural language processing NLP task where we are trying to determine whether an Amazon
book review is positive or negative. We have checked each review to see if they contain the words def
initelygreat and bestdisregarding capitalization which we encoded into a feature vector in the
following way: x x x xT where xi is if the word exists in a review and otherwise in the order
definitelygreat and best This is a form of text representation known as a bag of words model.
Our training dataset consists of the following reviews and corresponding ratings yi represents the label
for review i yi is for positive reviews and for negative reviews:
Review yi
My Professor said this book was great, but I was definitely disappointed
My sister Radhika loved this book! So great! Its definitely the best gift.
This author is the best. Definitely my favorite.
Just bad. So so bad. Will definitely never read again.
Table : Dataset for Question
For example, the review My sister Radhika loved this book! So great! Its the best gift. would be encoded
in the bag of words model as x T We aim to find parameters theta b such that for all xi in our
training set signtheta xi yi
apt Provide a mathematical representation of the feature space ie all possible feature vectors
bpt Provide a mathematical representation of the label space ie all possible output classes for this
classification problem.
cpts If b ie no explicit offset would it be possible to learn such a theta Provide an example of
such theta or explain why its not possible.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
