Question: Ex 5 . 5 : Function optimization. Consider the function f ( x , y ) = x 2 + 2 0 y 2 shown

5.5

: Function optimization. Consider the function

f (x, y) = x^{2} + 20 y^{2}

shown in Fig

-

ure

5.63

.

Begin by solving for the following:

Calculate gradf, i

.

.,

the gradient of

f .

Evaluate the gradient at

x = - 20, y = 5 .

Implement some of the common gradient descent optimizers, which should take you from

the starting point

x = - 20, y = 5

to near the minimum at

x = 0, y = 0 .

Try each of the

following optimizers:

Standard gradient descent.

Gradient descent with momentum, starting with the momentum term as

= 0.99 .

Adam, starting with decay rates of

_{1} = 0.9

and

b_{2} = 0.999 .

Play around with the learning rate

.

For each experiment, plot how

x

and

y

change over

time, as shown in Figure

5.63

.

How do the optimizers behave differently? Is there a single learning rate that makes all

the optimizers converge towards

x = 0, y = 0

in under

200

steps? Does each optimizer

monotonically trend towards

x = 0, y = 0 ?

Figure

5.63

Function optimization:

(

)

the contour plot of

f (x, y) = x^{2} + 20 y^{2}

with

the function being minimized at

(0, 0)

;

(b)

ideal gradient descent optimization that quickly

converges towards the minimum at

x = 0, y = 0 .

Would batch normalization help in this case?

Note: the following exercises were suggested by Matt Deitke.

5.5

: Function optimization. Consider the function

f (x, y) = x^{2} + 20 y^{2}

shown in Fig

-

ure

5.63

.

Begin by solving for the following:

Calculate gradf, i

.

.,

the gradient of

f .

Evaluate the gradient at

x = - 20, y = 5 .

Implement some of the common gradient descent optimizers, which should take you from

the starting point

x = - 20, y = 5

to near the minimum at

x = 0, y = 0 .

Try each of the

following optimizers:

Standard gradient descent.

Gradient descent with momentum, starting with the momentum term as

= 0.99 .

Adam, starting with decay rates of

_{1} = 0.9

and

b_{2} = 0.999 .

Play around with the learning rate

.

For each experiment, plot how

x

and

y

change over

time, as shown in Figure

5.63

.

How do the optimizers behave differently? Is there a single learning rate that makes all

the optimizers converge towards

x = 0, y = 0

in under

200

steps? Does each optimizer

monotonically trend towards

x = 0, y = 0 ?

Ex 5.5: Function optimization. Consider the function f(x,y)=x2+20y2 shown in Fig-

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Ex 5 . 5 : Function optimization. Consider the function f ( x , y ) = x 2 + 2 0 y 2 shown in Fig - ure 5 . 6 3 a . Begin by solving for the following: Calculate gradf, i . e . , the gradient of f ....

buckley (jdb4687) - Review - beshaj - (53521) This print-out should have 60 questions. Multiple-choice questions may continue on the next column or page - find all choices before answering. Find the...

Functions (Plot) The aim is to fill two vector arrays xv and yv with x and f(x) values, respectively, where the probability density function for Pareto is: (,)=+1 f ( x , b ) = b x b + 1 for >1 x > 1...

3 Complete this assignment after you have finished Unit 4, and submit your work to your tutor for grading. Total points: 100 Weight: 10% 1 (9 points) An Earth-observing satellite can see only a...

calculus homework 5.1 The Mean Value Theorem m Write your questions and thoughts here! We use the MVT to justify conclusions about a function over an interval. Mean Value Theorem: If a function f is...

MATH 107 QUIZ 4 NAME: _______________________________ I have completed this assignment myself, working independently and not consulting anyone except the instructor. INSTRUCTIONS The quiz is worth...

MATH 107 QUIZ 4 NAME: _______________________________ Instructor: J. Alexander I have completed this assignment myself, working independently and not consulting anyone except the instructor....

please use R!!! please use R!!! please use R!!! please use R!!! please use R!!! please use R!!! Question 1 Your lecturer loves chocolate and has two boxes of chocolates in her office, one in the...

4 CSE 191 Fall 2016 Due: Friday 10/28/2016 This assignment will touch on different properties and concepts of functions. We have numerous ways to define a function. The most familiar representation...

Altira Corporation provides the following information related to its inventory during the month of August 2024: August 1 Inventory on hand-2,500 units; cost $6.60 each. August 8 Purchased 12,500...

The area of a field is in the form of a quadrilateral ABCD as shown in Fig. 11.39. Determine its area. 39.8 m, 21.4 m Figure 11.39 42.5 m 56" 62.3 m

Which of the following is not true about the chart in your document? It is located beside the Sales by Product table. It is located below the Sales by Product table. It is approximately the same...

CT Corp Comprehensive Question Canadian Tire Corporation, Limited ( Canadian Tire ) is a family of companies that includes a retail segment and a financial services division, among others. The retail...

Write a sentence that repeats key words to emphasize that communication technology helps create a global society.

Use length. You want to help students at a local high school understand the importance of performing well on their upcoming ACT test for college admission. Express this idea in a sentence giving...

Write a sentence that uses location to emphasize the importance of effective interpersonal relationships to job success.