Question: Please solve all parts In this exercise we compare the performance of 1 - issue and 2 - issue processors, taking into account program transformations

Please solve all parts

In this exercise we compare the performance of

1 -

issue and

2 -

issue processors, taking into account program transformations that can

be made to optimize for

2 -

issue execution. Problems in this exercise refer to the following loop

(

written in C

)

for

(i = 0

;

i j

;

i + = 2)

b [i] = a [i] - a [i + 1]

;

A compiler doing little or no optimization might produce the following MIPS assembly code:

addi

x 12, x 0, 0

jal ENT

TOP: slli

x 5, x 12, 3

add

x 6, x 10, x 5

x 7, 0 (x 6)

x 29, 8 (x 6)

sub

x 30, x 7, x 29

add

x 31, x 11, x 5

x 30, 0 (x 31)

addi

x 12, x 12, 2

ENT: bne

x 12, x 13,

TOP

The code above uses the following registers:

Assume the two

-

issue, statically scheduled processor for this exercise has the following properties:

One instruction must be a memory operation; the other must be an arithmetic

/

logic instruction or a branch.

The processor has all possible forwarding paths between stages

(

including paths to the ID stage for branch resolution

) .

The processor has perfect branch prediction.

Two instruction may not issue together in a packet if one depends on the other.

)

If a stall is necessary, both instructions in the issue packet must stall.

As you complete these exercises, notice how much effort goes into generating code that will produce a near

-

optimal speedup.

(

)

Draw a pipeline diagram showing how RISC

-

V code given above executes on the two

-

issue processor. Assume that the loop exits

after two iterations.

(

)

What is the speedup of going from a one

-

issue to a two

-

issue processor?

(

Assume the loop runs thousands of iterations.

)

(

)

Rearrange

/

rewrite the RISC

-

V code given above to achieve better performance on the one

-

issue processor. Hint: Use the

instruction "beqz $

s 1,

Done" to skip the loop entirely if

j = 0 .

(

)

Rearrange

/

rewrite the RISC

-

V code given above to achieve better performance on the two

-

issue processor.

(

Do not unroll the

loop, however.

)

(

)

Repeat Exercise

4.31.1,

but this time use your optimized code from Exercise

4.31.4 .

(

)

What is the speedup of going from a one

-

issue processor to a two

-

issue processor when running the optimized code from

Exercises

4.31.3

and

4.31.4 .

(

)

Unroll the RISC

-

V code from Exercise

4.31.3

so that each iteration of the unrolled loop handles two iterations of the original loop.

Then, rearrange

/

rewrite your unrolled code to achieve better performance on the one

-

issue processor. You may assume that

j

is a

multiple of

4 .

(

)

Unroll the RISC

-

V code from Exercise

4.31.4

so that each iteration of the unrolled loop handles two iterations of the original loop.

Then, rearrange

/

rewrite your unrolled code to achieve better performance on the two

-

issue processor. You may assume that

j

is a

multiple of

4 . (

Hint: Re

-

organize the loop so that some calculations appear both outside the loop and at the end of the loop. You

may assume that the values in temporary registers are not needed after the loop.

)

(

)

What is the speedup of going from a one

-

issue processor to a two

-

issue processor when running the unrolled, optimized code

from Exercises

4.31.7

and

4.31.8 ?

(

)

Repeat Exercises

4.31.8

and

4.31.9,

but this time assume the two

-

issue processor can run two arithmetic

/

logic instructions

together.

(

In other words, the first instruction in a packet can be any type of instruction, but the second must be an arithmetic or

logic instruction. Two memory operations cannot be scheduled at the same time.

)

Please solve all parts In this exercise we compare the performance

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

In this exercise we compare the performance of 1-issue and 2-issue processors, taking into account program transformations that can be made to optimize for 2-issue execution Problems in this exercise...

In this exercise we compare the performance of 1-issue and 2-issue processors, taking into account program transformations that can be made to optimize for 2-issue execution. Problems in this...

4.31 In this exercise we compare the performance of 1-issue and 2-issue processors, taking into account program transformations that can be made to optimize for 2-issue execution. Problems in this...

Exercise 4.28 In this exercise we compare the performance of 1-issue and 2-issue processors, taking into account program transformations that can be made to optimize for 2-issue execution. Problems...

Rearrange your code from 4.28.1 to achieve better performance on a 2-issue statically scheduled processor from Figure 4.69. Exercise 4.28.1 Translate this C code into MIPS instructions. Your...

Repeat 4.28.2, but this time use your MIPS code from 4.28.3. Exercise 4.28.3. Rearrange your code from 4.28.1 to achieve better performance on a 2-issue statically scheduled processor from Figure...

In this exercise we compare the performance of 1-issue and 2-issue processors, taking into account program transformations that can be made to optimize for 2-issue execution. Problems in this...

i (3 4i) (1 + 2i) hand. Give an exact answer, including square roots, if needed. (1) Find No extensive calculations required. Do not use Matlab/Octave, do it by (2) a) Find (1 - i)50 by first...

(a) Only a small amount (less than 0.01%) of the enol form of diethyl malonate is present at equilibrium. Write a structural formula for this enol. (b) Enol forms are present to the extent of about...

The journal voucher control system is said to be one of the most effective means of internal control when designed to fit a specific enterprise and properly administered. (a) Explain how the voucher...

Capitalism and Socialism: Case Study: Uber You have four tasks for your initial post. In order to present an organized post, address each one of these tasks in a separate paragraph and in the...

From a Comparable Worth Standpoint, what is the situation with regard to Federal Gender-based Employee Pay Equity?

Provide an example of how drilling down further into information can yield new results.

What do Dimensions represent in OLAP Cubes?