Question: 2 - Consider the following code, which represents the operation Y = a x + Y for a vector length of 1 0 0 .

2 -

Consider the following code, which represents the operation

Y = a x + Y

for a vector length of

100 .

Assume

the pipeline latencies shown below and a

1 -

cycle delay branch that is resolved in the ID stage. In addition, the

pipeline uses branch forwarding that forwards the result of an ALU operation from the MEM stage to the ID

stage

(

i

.

e

.,

MEM

-

to

-

ID forwarding

) .

Latencies of FP operations

(

a

)

Show how this loop would execute without any scheduling. Maximize the performance of this code by

applying both instruction reordering

(

also known as pipeline scheduling

)

and delay branch techniques.

Ignoring the startup delays and assuming the loop executes

100

times, determine the number of cycles

required to execute the code before and after the optimizations. Do not be concerned about what happens

after the loop.

(

b

)

Unroll the loop once

(

i

.

e

.,

make two copies

)

to schedule it without stalls and show the instruction schedule.

Again, assuming the loop executes

100

times, determine the number of cycles required to execute the code

before and after unrolling.

2- Consider the following code, which represents the operation Y=ax+Y for

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Q:

[20pts. 2- Consider the following code assuming the pipeline latencies shown below and a 1-cycle delay branch that is resolved in the ID stage. In addition, the pipeline uses MEM-to-ID forwarding to...

Q:

This homework is based on Problem 4.57 in the third edition of "Computer Systems, a programmers perspective" by Randal E. Bryant and David R. OHallaron , which is Problem 4.56 in the second edition....

Q:

In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop (double-precision aX plus...

Q:

3.14 In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop (double-precision ax...

Q:

In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop (double-precision aX plus...

Q:

Use the following code fragment:In this exercise, we look at how software techniques can extract instruction - level parallelism ( ILP ) in a common vector loop. The following loop is the so - called...

Q:

Problem 1: In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop...

Q:

In this exercise, we look at how software techniques can extract instruction-level parallelism (ILP) in a common vector loop. The following loop is the so-called DAXPY loop (double-precision aX plus...

Q:

for computer architecture 1 Question#2: Consider the following code fragment: Loop: LD FO, O(R1) LD F1, O(R2) MULTD F2, F, F1 ADDD F3, F3, F2 ADDI R1, R1, 8 ADDI R2, R2, 8 SUB R5, R5, 1 BNEZ R5, Loop...

Q:

Give the typing rules for Peano natural numbers and their eliminator.(ii) Using the rules given above, define the addition function.] (iii) Let a binary tree be either a leaf Leaf or a node...

Q:

The pK a of acetone, CH 3 COCH 3 , is 19.3. Which of the following bases is strong enough to de-protonate acetone? (a) KOH (p K a of H 2 O = 15.7) (b) Na + C CH (p K a of C 2 H 2 = 25) (c) NaHCO 3...

Q:

How might a tax adviser ignoring the present-value approach to tax planning arrive at an improper conclusion? Illustrate.

Q:

Dont use ai and sopve this 23. The price of a security at the beginning of year is 100, the price at the end of the year is 125 and dividend paid at the end of the year is Rs.5. The current return of...

Q:

A proposed budget inn of 1 5 0 guestrooms is scheduled to open. The occupancy is expected to be 7 0 % . The owner seeks your advice on pricing. Although he knows that he will modify your...

Q:

8. Describe the training courses that you have taken. How have they helped you? Provide recommendations for improving the courses.

Q:

7. How might technology influence the importance of training professionals roles? Can technology reduce the importance of any of the roles? Can it result in additional roles?

Q:

9. How does training differ between companies that are considered BEST Award winners and those that are not?

Recommended Textbook

More Books

Database Security

Authors: Alfred Basta, Melissa Zgola

1st Edition

1435453905, 978-1435453906

Ask a Question and Get Instant Help!