Question: Q 1 . You are performing text mining on a customer review dataset containing 2 0 0 customer reviews. Answer the following questions: 1 .
Q You are performing text mining on a customer review dataset containing customer reviews. Answer the following questions:
Suppose each review was limited to no more than words. In the termdocument matrix, which dimension is more likely to be larger, the number of documents or the number of terms? Explain your choice in one sentence.
You are considering to use stemming or lemmatization for processing the review text. The term 'increasing' appeared in many reviews. What are the results of stemming and lemmatization of this term, respectively?
In addition to the review text data, each customer also provided a rating score, with star representing poor and star representing excellent. Suppose your text mining task is to predict ratings based on the customer reviews. Which of the three techniques below is NOT appropriate for your task? Choose only one answer.
i J decision tree algorithm
ii support vector regression
iii kmeans algorithm
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
