Question: Please help.. Adaline and Logistic Classifiers. You are required to implement Adaline and Logistic Regression classifiers from scratch. All the code you need to implement

Please help..

Adaline and Logistic Classifiers. You are required to implement Adaline and Logistic Regression classifiers from scratch.

All the code you need to implement is quoted between ###begin coding and ###end coding

In between these quotes, I also insert place holder for you. If, say a variable J, is a numerical value for you to implement, a place holder like J=0 or J=0.0 is there. If, say a variable delta is an object, a place holder like delta=None will be placed.

In [2]:

%matplotlib inline

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

Read the Iris data: We will use Iris flower dataset in this project. The Iris flower data set or Fisher's Iris data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems as an example of linear discriminant analysis. The data set consists of 50 samples from each of three species of Iris (Iris setosa, Iris virginica and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters. Based on the combination of these four features, Fisher developed a linear discriminant model to distinguish the species from each other.

df = pd.read_csv('https://archive.ics.uci.edu/ml/'

 'machine-learning-databases/iris/iris.data', header=None)

df.iloc[0::10]

Out[3]:

	0	1	2	3	4
0	5.1	3.5	1.4	0.2	Iris-setosa
10	5.4	3.7	1.5	0.2	Iris-setosa
20	5.4	3.4	1.7	0.2	Iris-setosa
30	4.8	3.1	1.6	0.2	Iris-setosa
40	5.0	3.5	1.3	0.3	Iris-setosa
50	7.0	3.2	4.7	1.4	Iris-versicolor
60	5.0	2.0	3.5	1.0	Iris-versicolor
70	5.9	3.2	4.8	1.8	Iris-versicolor
80	5.5	2.4	3.8	1.1	Iris-versicolor
90	5.5	2.6	4.4	1.2	Iris-versicolor
100	6.3	3.3	6.0	2.5	Iris-virginica
110	6.5	3.2	5.1	2.0	Iris-virginica
120	6.9	3.2	5.7	2.3	Iris-virginica
130	7.4	2.8	6.1	1.9	Iris-virginica
140	6.7	3.1	5.6	2.4	Iris-virginica

Plotting the Iris data

In [4]:

# extract sepal length and petal length

X = df.iloc[:, [0, 2]].values

# plot data

plt.scatter(X[:50, 0], X[:50, 1],

 color='red', marker='o', label='setosa')

plt.scatter(X[50:100, 0], X[50:100, 1],

 color='green', marker='x', label='versicolor')

plt.scatter(X[100:, 0], X[100:, 1],

 color='blue', marker='x', label='virginica')

plt.xlabel('sepal length [cm]')

plt.ylabel('petal length [cm]')

plt.legend(loc='upper left')

# plt.savefig('images/02_06.png', dpi=300)

plt.show()

In [22]:

# select versicolor and virginica

y = df.iloc[50:, 4].values

y = np.where(y == 'Iris-versicolor', -1, 1)

X = df.iloc[50:, [0,2]].values

In [23]:

#show some data

print(X.shape, y.shape)

print(X[::25, :])

print(y[::25])

(100, 2) (100,) [[7. 4.7] [6.6 4.4] [6.3 6. ] [7.2 6. ]] [-1 -1 1 1]

Implementing an adaptive linear neuron (ADALINE)

In [38]:

class AdalineGD(object):

 def __init__(self, eta=0.01, n_iter=50, random_state=None):

 self.eta = eta

 self.n_iter = n_iter

 self.random_state = random_state

 def fit(self, X, y):

 rgen = np.random.RandomState(self.random_state)

 n, m =X.shape

 X1 = self.stack1(X)

 self.w_ = rgen.normal(loc=0.0, scale=0.01, size=X1.shape[1])

 self.cost_ = []

 for i in range(self.n_iter):

 # Please note that the "activation" method has no effect

 # in the code since it is simply an identity function. We

 # could write `output = self.net_input(X)` directly instead.

 # The purpose of the activation is more conceptual, i.e.,

 # in the case of logistic regression (as we will see later),

 # we could change it to

 # a sigmoid function to implement a logistic regression classifier.

 cost, d = self.J_Delta(X1, y)

 ### begin coding

 self.w_ = None

 ### end coding

 self.cost_.append(cost)

 return self

 def stack1(self, X):

 """Stack a column of constant 1s to the left of X"""

 n,m = X.shape

 return np.hstack((np.ones((n,1)), X))

 def net_input(self, X1):

 """Calculate net input"""

 ### begin coding

 netinput = None

 ### end coding

 return netinput

 def activation(self, X1):

 """Compute linear activation"""

 return X1

 def J_Delta(self, X1, y):

 """Computer the cost function and the gradeint function"""

 n,m = X1.shape

 ### begin coding

 cost = 0.0

 d = None

 ### end coding

 return cost, d

 def predict(self, X):

 """Return class label after unit step"""

 X1 = self.stack1(X)

 ### begin coding

 output = None

 ### end coding

 return np.where( output>= 0.0, 1, -1)

In [39]:

fig, ax = plt.subplots(nrows=1, ncols=2, figsize=(10, 4))

ada1 = AdalineGD(n_iter=20, eta=0.1).fit(X, y)

ax[0].plot(range(1, len(ada1.cost_) + 1), np.log10(ada1.cost_), marker='o')

ax[0].set_xlabel('Epochs')

ax[0].set_ylabel('log(Sum-squared-error)')

ax[0].set_title('Adaline - Learning rate 0.1')

ada2 = AdalineGD(n_iter=100, eta=0.03).fit(X, y)

ax[1].plot(range(1, len(ada2.cost_) + 1), ada2.cost_, marker='o')

ax[1].set_xlabel('Epochs')

ax[1].set_ylabel('Sum-squared-error')

ax[1].set_title('Adaline - Learning rate 0.01')

# plt.savefig('images/02_11.png', dpi=300)

plt.show()

In [40]:

from matplotlib.colors import ListedColormap

def plot_decision_regions(X, y, classifier, resolution=0.02):

 print(X.shape)

 # setup marker generator and color map

 markers = ('s', 'x', 'o', '^', 'v')

 colors = ('red', 'blue', 'lightgreen', 'gray', 'cyan')

 cmap = ListedColormap(colors[:len(np.unique(y))])

 # plot the decision surface

 x1_min, x1_max = X[:, 0].min() - 1, X[:, 0].max() + 1

 x2_min, x2_max = X[:, 1].min() - 1, X[:, 1].max() + 1

 xx1, xx2 = np.meshgrid(np.arange(x1_min, x1_max, resolution),

 np.arange(x2_min, x2_max, resolution))

 Z = classifier.predict(np.array([xx1.ravel(), xx2.ravel()]).T)

 Z = Z.reshape(xx1.shape)

 plt.contourf(xx1, xx2, Z, alpha=0.3, cmap=cmap)

 plt.xlim(xx1.min(), xx1.max())

 plt.ylim(xx2.min(), xx2.max())

 # plot class samples

 for idx, cl in enumerate(np.unique(y)):

 plt.scatter(x=X[y == cl, 0],

 y=X[y == cl, 1],

 alpha=0.8,

 c=colors[idx],

 marker=markers[idx],

 label=cl,

 edgecolor='black')

In [41]:

plot_decision_regions(X, y, classifier=ada1)

plt.title('Adaline - Gradient Descent')

plt.xlabel('sepal length [standardized]')

plt.ylabel('petal length [standardized]')

plt.legend(loc='upper left')

plt.tight_layout()

# plt.savefig('images/02_14_1.png', dpi=300)

plt.show()

plot_decision_regions(X, y, classifier=ada2)

plt.title('Adaline - Gradient Descent')

plt.xlabel('sepal length [standardized]')

plt.ylabel('petal length [standardized]')

plt.legend(loc='upper left')

plt.tight_layout()

# plt.savefig('images/02_14_1.png', dpi=300)

plt.show()

Standardize features

In [42]:

# standardize features

X_std = np.copy(X)

X_std[:, 0] = (X[:, 0] - X[:, 0].mean()) / X[:, 0].std()

X_std[:, 1] = (X[:, 1] - X[:, 1].mean()) / X[:, 1].std()

In [43]:

ada = AdalineGD(n_iter=20, eta=1.)

ada.fit(X_std, y)

plot_decision_regions(X_std, y, classifier=ada)

plt.title('Adaline - Gradient Descent')

plt.xlabel('sepal length [standardized]')

plt.ylabel('petal length [standardized]')

plt.legend(loc='upper left')

plt.tight_layout()

# plt.savefig('images/02_14_1.png', dpi=300)

plt.show()

plt.plot(range(1, len(ada.cost_) + 1), ada.cost_, marker='o')

plt.xlabel('Epochs')

plt.ylabel('Sum-squared-error')

plt.tight_layout()

# plt.savefig('images/02_14_2.png', dpi=300)

plt.show()

Implement Logistic Regression From Scratch

Note that unlike Adaline, the default binary values of logistic regression is 0 and 1 rather than -1 and 1.

In [44]:

class LogisticGD(object):

 def __init__(self, eta=0.01, n_iter=50, random_state=None):

 self.eta = eta

 self.n_iter = n_iter

 self.random_state = random_state

 def fit(self, X, y):

 rgen = np.random.RandomState(self.random_state)

 n, m =X.shape

 X1 = self.stack1(X)

 self.w_ = rgen.normal(loc=0.0, scale=0.01, size=X1.shape[1])

 self.cost_ = []

 for i in range(self.n_iter):

 cost, d = self.J_Delta(X1, y)

 ### begin coding

 self.w_ += None

 ### end coding

 self.cost_.append(cost)

 return self

 def stack1(self, X):

 """Stack a column of constant 1s to the left of X"""

 n,m = X.shape

 return np.hstack((np.ones((n,1)), X))

 def net_input(self, X1):

 """Calculate net input"""

 ### begin coding

 netinput = None

 ### end coding

 return netinput

 def activation(self, X1):

 """Compute linear activation"""

 ### begin coding

 result = None

 ### end coding

 return None

 def J_Delta(self, X1, y):

 """Compute the cost function and the gradeint function"""

 n,m = X1.shape

 ### begin coding

 cost = 0.0

 d = None

 ### end coding

 return cost, d

 def predict_proba(self, X):

 """ Return probability """

 X1 = self.stack1(X)

 ### begin coding

 prob = None

 ### end coding

 return prob

 def predict(self, X):

 """Return class label after unit step"""

 ### begin coding

 prob = None

 ### end coding

 return np.where(prob >= 0.5, 1, 0)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

(REALLY NEED HELP CREATING THIS CODE IN FULL AND ITS COMPLETE ENTIRETY... ALL OF THE DETAILS ARE PROVIDED AND THE CODE SHOULD HAVE EACH PART FOR EACH QUESTION LABELED SEPARATELY... PLEASE HELP ME AND...

Part II. Customer lifetime value Estimate the value of a customer by calculating the customer value multiplier for the modified "customervalue.xlsx" dataset from the textbook. Also, carry out...

PLEASE HELP IN PYTHON When we are in an airplane, we count on the pilots and control towers maintaining clear communication. The voice signal of the pilot can be impacted by the turbine noise making...

This course is Service Marketing. Please use the (Integrated Logistic Berhad) company to answer the question. Company Name: Integrated Logistic Berhad **************For the...

This is Service Marketing. Please use the (Integrated Logistic Berhad) company to answer the question. Company Name: Integrated Logistic Berhad **************For the...

Hi, I need help for the below question. And my code that I was submitted is not correct. I don't know how to do it. Please help me on the code. Thank you Question: Part D [5 pts, Peer Review]: Here...

PLEASE HELP ME COMPLETE THIS WHOLE PYTHON PROGRAMMING PROJECT Activity 1: Create Dummy Dataset In this activity, you have to create a dummy dataset for multiclass classification. The steps to be...

PLEASE HELP ME COMPLETE THESE PYTHON PROGRAMMING ACTIVITIES Activity 5: Random Forest Classification - Model Training Implement Random Forest Classification using sklearn module in the following way:...

What are the similarities and differences between what-if and sensitivity analyses?

Excerpts of statements of cash flows reported by Kraft Foods, Kellogg's, and General Mills, three companies in the food industry, are provided below (dollars in millions). For each company compute...

Which of the following statements is most CORRECT? Group of answer choices In most mergers, the benefits of synergy are divided equally between the shareholders of the acquiring and target firms. The...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

Why do HCMSs exist? Do they change over time?

Suppose the price of oil falls sharply (as it did in 1986 and again in 1998). a. Show the impact of such a change in both the aggregate-demand/aggregate-supply diagram and in the Phillips-curve...

When did the shift from Text-based Business Application Software to GUI-based Applications begin?