Question: Complete a regression analysis on the data set flight price.csv . The dataset contains information on flights for Indian airline companies. The goal is to

Complete a regression analysis on the data set flight price.csv. The dataset contains information on flights for Indian airline companies. The goal is to predict flight prices, which is the last column in the data set.
Review the notes found in full_classification.html to learn how to preprocess the data.
Apply the following steps.
Delete the two columns 'Unnamed: 0' and 'flight'.
Split the dataset using the code
flight_train, flight_test = train_test_split(flight, random_state =2, train_size =250000, shuffle = True)
Show statistical values for numerical attributes.
Show frequency values for categorical attributes.
Preprocess the training data similarly to what was done in full_classification.html.
Train a regression model on the training data. Use the algorithm RandomForestRegressor(random_state=1).
Apply the same preprocessing on the test data.
Make predictions on the test data.
Output the log mean squared error: np.log(np.mean(np.square(y_pred - y_test)))
The value I obtain is 15.8.
 Complete a regression analysis on the data set flight price.csv. The

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!