Question: Regression Models 1. Create a python file named myregressor.py. Import the following package. import pickle import numpy as np from sklearn import linear_model import sklearn.metrics

Regression Models

1. Create a python file named myregressor.py. Import the following package.

import pickle

import numpy as np

from sklearn import linear_model

import sklearn.metrics as sm

import matplotlib.pyplot as plt

2. Add the following lines. Read these lines and explain their purpose?

input_file = ' regressor_data.txt'

data = np.loadtxt(input_file, delimiter=',')

X, y = data[:, :-1], data[:, -1]

num_training = int(0.8 * len(X))

num_test = len(X) - num_training

X_train, y_train = X[:num_training], y[:num_training]

X_test, y_test = X[num_training:], y[num_training:]

3. Add the following lines. What is the purpose for these added lines?

regressor = linear_model.LinearRegression()

regressor.fit(X_train, y_train)

y_test_pred = regressor.predict(X_test)

4. Add the following lines. Run the program. Save the plot diagram to your local computer and insert the diagram below.

plt.scatter(X_test, y_test, color='green')

plt.plot(X_test, y_test_pred, color='black', linewidth=4)

plt.xticks(())

plt.yticks(())

plt.show()

5. Explain what have been drawn in the graph?

6. Modify the above code to display in a same diagram the scatter plots of (1) training data set in blue, (2) testing data set in green, and predicted data set in red. Please show your code and insert the diagram you saved.

7. Add the following lines and run your program. Please show the printout.

print("Linear regressor performance:")

print("Mean absolute error =", round(sm.mean_absolute_error(y_test, y_test_pred), 2))

print("Mean squared error =", round(sm.mean_squared_error(y_test, y_test_pred), 2))

print("Median absolute error =", round(sm.median_absolute_error(y_test, y_test_pred), 2))

print("Explain variance score =", round(sm.explained_variance_score(y_test, y_test_pred), 2))

print("R2 score =", round(sm.r2_score(y_test, y_test_pred), 2))

8. Use the equations to explain what are mean_absolute_error and mean_squared_error?

9. From the provided document, learn what is explained variation and what is R squared?

10. Add the following lines and run your program. What is the printout?

output_model_file = 'myregressor.pkl'

with open(output_model_file, 'wb') as f:

pickle.dump(regressor, f)

with open(output_model_file, 'rb') as f:

regressor_model = pickle.load(f)

y_test_pred_new = regressor_model.predict(X_test)

print(" New mean absolute error =", round(sm.mean_absolute_error(y_test, y_test_pred_new), 2))

11. Read these above lines and consider what the intent of these lines?

12. According to the previous labs, consider how to use model_selection to split the training and testing data set. Answer the following questions: (1) Which library package should be imported? (2) Which function is used for splitting the data set? (3) Write the code to replace the last four lines in the previous question No.2. (4) Run your code and show the printout only.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Logistic Regression 1. Create a python file named mylogistic-0.py. Import the following package. import numpy as np from sklearn import linear_model import matplotlib.pyplot as plt 2. Add the...

Perceptron and Simple Neural Network 1. Create a python file named myperceptron.py. Import the following packages. import numpy as np import matplotlib.pyplot as plt import neurolab as nl 2. Load the...

Support Vector Machine 1. Create a python file named mysvm.py. Import the following packages. import numpy as np import matplotlib.pyplot as plt from sklearn import preprocessing from sklearn.svm...

Tips: In order to work on this lab, you have to get some software packages such as numpy and sklearn installed on your computer. In python environment (non-anaconda), here is the installation steps...

i want this python code in C# import csv import pandas as pd import numpy as np from sklearn.naive_bayes import GaussianNB from sklearn.model_selection import train_test_split from sklearn import...

PLEASE RESPOND COMPLETE ANSWER I WILL RATE Problem #2: Data processing In this problem, you will be working with a sample of data recorded from an electrocardiogram (ECG) amplifier. An ECG is a test...

I'm running into problems with questions following this initial question. I don't know where I'm getting it wrong. And I also don't know how to print the correlation coefficient for the data. This is...

Data Science, Python, Jupyter Notebook I have a term project for my Capstone class in Data Science. Below is the syllabus, dataset, and the Jupiter Notebook. I am creating a Classification model to...

Answer using sci-kit learn here is the dataset https://www.kaggle.com/rush4ratio/video-game-sales-with-ratings In this part, you will work with scikit-learn, an industry standard package for machine...

Python language please. Thank you! #%% md # Setup First, let's import a few common modules, ensure MatplotLib plots figures inline and prepare a function to save the figures. We also check that...

Brett is a 25-year-old researcher at Texas Parks and Wildlife. He makes $60,000 per year. Your firm uses an end age of 95 and a top-down ratio of 78%. Brett has 1 kid and wants to have another. He'd...

Matt injured Buddy in an automobile accident. The court awarded Buddy $ 30,000 in damages, but Matt was only able to pay Buddy $ 12,000. They both then considered the matter closed. Under these...

I am looking for the correct answer to this general accounting problem using valid accounting standards. Assume that the current ratio for Omega Corporation is 3.5, its quick ratio is 1.8, and its...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

KEY QUESTION Assume that in a particular year the natural rate of unemployment is 5 percent and the actual rate of unemployment is 9 percent. Use Okuns law to determine the size of the GDP gap in...

KEY QUESTION Complete the following table: a. Show the consumption and saving schedules graphically. b. Find the break-even level of income. Explain how it is possible for households to dissave at...

LAST WORD What is the central economic idea humorously illustrated in Art Buchwalds piece, Squaring the Economic Circle? How does the central idea relate to recessions, on the one hand, and vigorous...