Question: 3. Consider the perceptron learning algorithm using (batch) gradient descent, discussed in the class. We will assume that the data is defined using two features

Consider the perceptron learning algorithm using (batch) gradient descent, discussed in the class. We will assume that the data is defined using two features (x₁ and x₂). We are interested in learning the perceptron boundary by training on the following training data:

x₁	x₂	y
0.80	0.20	1
0.90	0.25	1
0.30	0.70	-1
0.25	0.65	-1

Initialize the weight vector to all 1s and the run the gradient descent algorithm with learning rate, eta = 0.2. After two updates of the weight vector, which of the following will be true (round the decimals to first two significant digits). Assume that we take care of the intercept term by absorbing the intercept term into the beginning of the weight vector.

		a)	The gradient of the objective function with respect to the augmented weights, just before the second update is [-1.95, 0.14]
		b)	The squared loss value for the updated perceptron after second update on the training data is 2.05
		c)	The weight vector w after two iterations will be [-0.61, 0.31, 0.11]
		d)	The perceptron after second update makes 0 mistakes on the training data

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Please help with this Machine Learning question in detail , thank you ! 3. Consider the perceptron learning algorithm using (batch) gradient descent, discussed in the class. We will assume that the...

3. Consider the perceptron learning algorithm using (batch) gradient descent, discussed in the class. We will assume that the data is defined using two features (x 1 and x 2 ). We are interested in...

Please explain the following questions , thanks! Consider the perceptron learning algorithm using (batch) gradient descent, discussed in the class. We will assume that the data is defined using two...

Python language please. Thank you! #%% md # Setup First, let's import a few common modules, ensure MatplotLib plots figures inline and prepare a function to save the figures. We also check that...

Please write the code without using any packages The Iris flower dataset (https://en.wikipedia.org/wiki/lris_flower_data_set) was organized by Ronald Fisher in 1936. It is a commonly used dataset for...

Python language please. Thank you! #%% md # Setup First, let's import a few common modules, ensure MatplotLib plots figures inline and prepare a function to save the figures. We also check that...

INSTRUCTIONS ---> Python There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III....

INSTRUCTIONS There are three parts to this project in Python. Please read all sections of the instructions carefully. I. Perceptron Learning Algorithm II. Linear Regression III. Classification You...

Name Id No . Branch Mechanical Note: Attempt all questions. Q . 1 - 1 6 carry 1 mark each and Q 1 7 & . 1 8 carry 2 marks each. Give the answer by encircling the correct option ( a ) , ( b ) , ( c )...

Many companies that use the declining-balance method of depreciation switch to the straight-line method at some point in the life of a depreciable asset. When does that switch normally occur? A). In...

The probability that production will increase if interest rates decline more than 0.5 percentage point for a given period is 0.72. The probability that interest rates will decline by more than 0.5...

Approximately what proportion of passenger cars sold today in China are either full EVs or plug - in hybrids? Multiple Choice Two thirds One fifth One fourth One half One third

What was the major cause of the British pound's crisis in 1992? Select one: a. Unhelpfulness of the other European countries b. George Soros and his greediness c. Joining the ERM with a too strong...

5 There is also a need to think from different perspectives and at the different multifunctional, corporate and subsidiary levels. This requires managers to be open-minded, flexible and adaptable.

Whilst delayering and downsizing may be aspects of organizing which are contingent on responding to the needs of economic downturn, shareholders interests and efficiency, they may be less effective...

3 What specific practices, such as autonomous and cross-functional teams, and principles, such as empowerment, are best suited to which cultural contexts? How are these practices (with their...