Question: Trivago is a German technology company that is essentially a hotel price comparison website. It claims to be the worlds largest online hotel search site.

Trivago is a German technology company that is essentially a hotel price comparison website. It claims to be the world’s largest online hotel search site. As part of the information on their websiteTrivago displays the Trivago Rating Index (TRI), a number between 0 and 100 for every hotel (with 100 being the highest rating and 0 the lowest).

In the data provided are over 4,500 text comments from Trivago for various hotels and lodges. Also in the data is a variable titled “Score High/Low” that translates to a “1” if the TRI is above 49 and a “0” if the TRI is 49 or below.

Use a sample of 500 of the text comments to attempt to classify a hotel as either a “1” or “0.” After mining the text, you will need to apply a classification model to the SVD-derived concept document matrix. The target for the classification model will be the “Score High/Low” variable.

Be aware that there are few negative reviews, and so the naive model would be to simply rate every hotel as a “1.” Try to estimate a model using the text comments alone that does a better job than a naive model. Use standard diagnostic statistics to evaluate the model (C11P8).

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Forecasting Predictive Analytics Questions!