Part 3 Model Building Backwards Pass We are ready to complete the function that computes the backward pass of our model You should start by reviewing the lecture slides on backpropagation One difference between the slides and our implementation here is that the slides express the required computations for computing the gradients of the loss for a single data point However, our implementation of backpropagation is further vectorized to compute gradients of the loss for a batch consisting of multiple data points We begin with applying the backpropagation algorithm on our forward pass steps from earlier Recall that our model's forward pass is as follows begin align bf x a textrm the one hot vector for word 1 bf x b textrm the one hot vector for word 2 bf x c textrm the one hot vector for word 3 bf v a bf W ( word ) bf x a bf v b bf W ( word ) bf x b bf v c bf W ( word ) bf x c bf v textrm concatenate ( bf v a , bf v b , bf v c ) bf m bf W ( 1 ) bf v bf b ( 1 ) bf h textrm ReLU ( bf m ) bf z bf W ( 2 ) bf h bf b ( 2 ) bf y textrm softmax ( bf z ) L mathcal L textrm Cross Entropy ( bf y , bf t ) end align Following the steps discussed in this week's lecture, we should get the following backward pass computation ( verify this yourself ) begin align overline bf z bf y bf t overline W ( 2 ) overline bf z bf h T overline bf b ( 2 ) overline bf z overline bf h W ( 2 ) T overline z overline W ( 1 ) overline bf m bf v T overline bf b ( 1 ) overline bf m overline bf m overline bf h circ textrm ReLU ' ( bf m ) overline bf v W ( 1 ) T overline bf m overline bf v a dots overline bf v b dots overline bf v c dots overline bf W ( word ) dots end align Task What is the error signal $ overline bf v a $ How does this quantity relate to $ overline bf v $ To answer this question, reason about the scalars that make up the elements of $ overline bf v $ Which of these scalars also appear in $ overline bf v a $ Express your answer by computing va bar ( representing the quantity $ overline bf v a $ ) given v bar ( representing the quantity $ overline bf v $ )

The Answer is in the image, click to view ...

Question: ## Part 3 . Model Building: Backwards Pass We are ready to complete the function that computes the backward pass of our model! You should

## Part

3 .

Model Building: Backwards Pass

We are ready to complete the function that computes the backward pass of

our model!

You should start by reviewing the lecture slides on backpropagation.

One difference between the slides and our implementation here is that the

slides express the required computations for computing the gradients of

the loss for a

*

single data point

* .

However, our implementation of backpropagation is further vectorized to

compute gradients of the loss for a

*

batch consisting of multiple data points

* .

We begin with applying the backpropagation algorithm on our forward pass

steps from earlier. Recall that our model's forward pass is as follows:

\

begin

{

align

*}

\

{

_

}

= \

textrm

{

the one

-

hot vector for word

1} \ \

\

{

_

}

= \

textrm

{

the one

-

hot vector for word

2} \ \

\

{

_

}

= \

textrm

{

the one

-

hot vector for word

3} \ \

\

{

_

}

= \

{

}^{(

word

)} \

{

_

} \ \

\

{

_

}

= \

{

}^{(

word

)} \

{

_

} \ \

\

{

_

}

= \

{

}^{(

word

)} \

{

_

} \ \

\

{

}

= \

textrm

{

concatenate

} (\

{

_

}, \

{

_

}, \

{

_

}) \ \

\

{

}

= \

{

^{(1)}} \

{

} + \

{

^{(1)}} \ \

\

{

}

= \

textrm

{

ReLU

} (\

{

}) \ \

\

{

}

= \

{

^{(2)}} \

{

} + \

{

^{(2)}} \ \

\

{

}

= \

textrm

{

softmax

} (\

{

}) \ \

L &

= \

mathcal

{

}_\

textrm

{

Cross

-

Entropy

} (\

{

}, \

{

}) \ \

\

end

{

align

*}

Following the steps discussed in this week's lecture, we should get

the following backward

-

pass computation

(

verify this yourself!

)

\

begin

{

align

*}

\

overline

{{\

bf z

}}

= {\

bf y

} - {\

bf t

} \ \

\

overline

{

^{(2)}}

= \

overline

{{\

bf z

}} {\

bf h

}^

\ \

\

overline

{{\

bf b

^{(2)}}}

= \

overline

{{\

bf z

}} \ \

\

overline

{{\

bf h

}}

= {

^{(2)}}^

\

overline

{

} \ \

\

overline

{

^{(1)}}

= \

overline

{{\

bf m

}} {\

bf v

}^

\ \

\

overline

{{\

bf b

}^{(1)}}

= \

overline

{{\

bf m

}} \ \

\

overline

{{\

bf m

}}

= \

overline

{{\

bf h

}} \

circ

\

textrm

{

ReLU

}' ({\

bf m

}) \ \

\

overline

{{\

bf v

}}

= {

^{(1)}}^

\

overline

{{\

bf m

}} \ \

\

overline

{{\

bf v

_

}}

= \

dots

\ \

\

overline

{{\

bf v

_

}}

= \

dots

\ \

\

overline

{{\

bf v

_

}}

= \

dots

\ \

\

overline

{{\

bf W

^{(

word

)}}}

= \

dots

\ \

\

end

{

align

*}

* *

Task

* *

: What is the error signal $

\

overline

{{\

bf v

_

}}

?

How does this quantity relate to $

\

overline

{{\

bf v

}}

?

To answer this question, reason about the scalars that make up the elements of

\

overline

{{\

bf v

}}

.

Which of these scalars also appear in $

\

overline

{{\

bf v

_

}}

?

Express your answer by computing

`

_

bar

` (

representing the quantity $

\

overline

{{\

bf v

_

}}

)

given

`

_

bar

` (

representing the quantity $

\

overline

{{\

bf v

}}

) .

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Example 3 ACTIVITY-ON-ARROW FOR MILWAUKEE PAPER Draw the complete AOA project network for Milwaukee Paper's problem. APPROACH Using the data from Table 3.1 in Example 1. draw one activity at a time,...

Need to fill in all parts that say "Implement me" and answer in one or two lines here. The following cell contains code that will be referred to as the Preprocessing Block from now on. It contains a...

Case 7 The Very Big Bank Stress FIGURE 7-3 Constants section Tax Rate - The bank pays federal, state, and foreign income taxes. The combined tax rate is expected to be 10 percent each year. Cash...

Reference: https://www.coursehero.com/tutors-problems/Project-Management/22249759-Assignment-In-this-assignment-start-the-process-of-building-or-buying/ Assignment In our previous assignment we...

A B D E F G M 1 2 3 4 5 6 On the tab labeled "Solutions", there are 5 questions about the problem that will require you to either make an input to the Excel file model or interpret the results of the...

On the tab labeled "Solutions", there are 5 questions about the problem that will require you to either make an input to the Excel file model or interpret the results of the model upon entering the...

Question 1) On the tab labeled "CPM Network", you will find a skeleton AON network. The layout includes the activities required to complete the project. Based on the list of activities and...

Journal Article Review 1. Write Title that reflects the main focus 2. Cite the article 3. Article Identification 4. Introduction 5. Summarize the Article 6. Critique 7. Conclusion The interaction...

State the purpose for each of the following built-in spreadsheet functions. (a) PV (i%, n, A, F) (b) FV (i%, n, A, P) (c) RATE (n, A, P, F) (d) IRR (first_cell: last_cell) (e) PMT (i %, n, P, F) (f)...

The Jordan family recently purchased their first home. The house has a 30-year, $200,000 mortgage. The mortgage has a nominal annual interest rate of 7.5 percent. All mortgage payments are made at...

During the COVID - 1 9 pandemic in 2 0 2 0 , the laggest source of federal revenue was which was personal income tax, typical payroli tax; unexpected corpormte income tax, unexpected payroil tax;...

A closed economy with government will be in equilibrium when: a) All of the above b) Desired AE = AO (or Aggregate Expenditures = Aggregate Output) c) The AE line crosses the 45-degree line d) Y = C...