( 3 0 points ) Consider the following instruction sequence running on the five stage pipelined processor beq x 1 1 , x 1 2 , Label sd x 1 5 , 0 ( 2 3 ) ld x 1 5 , 0 ( x 2 4 ) add 1 1 , 6 , 1 2 sub x 1 1 , x 1 3 , x 1 2 Assume x 1 1 x 1 2 Note that in the following questions, structural hazards are considered only in ( a ) and ( b ) ( a ) ( 1 0 points ) Assume the processor predicts each branch instruction to be not taken If we only have one memory ( for both instructions and data ) , there is a structural hazard every time we need to fetch an instruction in the same cycle in which another instruction accesses data To guarantee the processor to work correctly, this structural hazard must always be resolved in favor of the instruction that accesses data In other words, there is a hazard detection unit in the IF stage, and if a structural hazard occurs, the instruction in the IF stage needs to stall for that cycle What is the total execution time of this instruction sequence We have learned that data hazards can be eliminated by adding NOPs to the code, so can you do the same with this structural hazard and why ( b ) ( 5 points ) Assume we use the same processor in ( a ) What is the minimum number of cycles you can achieve by adjusting the order of the instructions without losing the correctness Also give the new sequence of instructions after re ordering ( c ) ( 5 points ) Assuming Stall on Branch ( i e , wait until the branch outcome is determined before fetching next instruction ) , what speedup is achieved on this instruction sequence if branch outcomes are determined in the ID stage, relative to the execution where branch outcomes are determined in the MEM stage ( d ) ( 5 points ) Assume the processor predicts each branch instruction to be not taken Also assume each individual pipeline stage of IF , ID , EX , MEM, and WB has the latency of 2 1 0 p s , 1 6 0 p s , 2 2 0 p s , 1 8 0 p s , and 1 0 0 p s , respectively If we change load store instructions to use a register ( without an offset ) as the address, these instructions no longer need to use the ALU As a result, MEM and EX stages can be overlapped and the pipeline has only 4 stages Assuming this change does not affect the clock period, what speedup is achieved in this instruction sequence compared to the original five stage one ( e ) ( 5 points ) Given the pipeline stage latencies in ( d ) , repeat the speedup calculation of ( d ) by considering the ( possible ) change in the clock period as follows When EX and MEM are done in a single stage ( called EX MEM stage ) , most of their work can be done in parallel As a result, the EX MEM stage now has a latency that is the larger of the original two, plus 2 5 p s needed for the work that could not be done in parallel Show all images Show all images Show all images done loading

The Answer is in the image, click to view ...

Question: ( 3 0 points ) Consider the following instruction sequence running on the five - stage pipelined processor: beq x 1 1 , x 1

(30

points

)

Consider the following instruction sequence running on the five

-

stage pipelined

processor:

beq x

11,

12,

Label

x 15, 0 (23)

x 15, 0 (x 24)

add

11, 6, 12

sub

x 11, x 13, x 12

Assume

x 11 x 12 .

Note that in the following questions, structural hazards are considered only in

(

)

and

(

) .

(

) (10

points

)

Assume the processor predicts each branch instruction to be not taken. If we only

have one memory

(

for both instructions and data

),

there is a structural hazard every time we

need to fetch an instruction in the same cycle in which another instruction accesses data. To

guarantee the processor to work correctly, this structural hazard must always be resolved in

favor of the instruction that accesses data. In other words, there is a hazard detection unit in

the IF stage, and if a structural hazard occurs, the instruction in the IF stage needs to stall for

that cycle. What is the total execution time of this instruction sequence? We have learned

that data hazards can be eliminated by adding NOPs to the code, so can you do the same with

this structural hazard and why?

(

) (5

points

)

Assume we use the same processor in

(

) .

What is the minimum number of cycles

you can achieve by adjusting the order of the instructions without losing the correctness?

Also give the new sequence of instructions after re

-

ordering.

(

) (5

points

)

Assuming Stall on Branch

(

.

.,

wait until the branch outcome is determined before

fetching next instruction

),

what speedup is achieved on this instruction sequence if branch

outcomes are determined in the ID stage, relative to the execution where branch outcomes are

determined in the MEM stage?

(

) (5

points

)

Assume the processor predicts each branch instruction to be not taken. Also assume

each individual pipeline stage of IF

,

,

,

MEM, and WB has the latency of

210 p s, 160

p s, 220 p s, 180 p s,

and

100 p s,

respectively. If we change load

/

store instructions to use a

(

without an offset

)

as the address, these instructions no longer need to use the ALU.

As a result, MEM and EX stages can be overlapped and the pipeline has only

4

stages.

Assuming this change does not affect the clock period, what speedup is achieved in this

instruction sequence compared to the original five

-

stage one?

(

) (5

points

)

Given the pipeline stage latencies in

(

),

repeat the speedup calculation of

(

)

considering the

(

possible

)

change in the clock period as follows. When EX and MEM are

done in a single stage

(

called EX

/

MEM stage

),

most of their work can be done in parallel. As

a result, the EX

/

MEM stage now has a latency that is the larger of the original two, plus

25 p s

needed for the work that could not be done in parallel.

(30 points) Consider the following instruction sequence running on the five-stage

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Computer Organization and Networks Practicals 2021/22 October 9, 2021 Computer Organization and Networks Practicals 2021/22 b68495714b Contents Contents 0 Introduction 3 0.1 Registration . . . . . ....

Consider the following instruction sequence that runs on our MIPS 5-stage pipeline: ori $5, $0, 1 or $6, $0, $0 sll $2, $0, 31 srl $2, $0, 3 add $8, $4, $4 lw $9 44($2) beq $6, $7, exit Assume that...

27. Critically evaluate the use of Internet or online advertising. Say whether you think this method is an effective communications tool or not. 28. Evaluate the potential marketplace for the Ice...

6. (2 points Completeness) Forwarding Unit Logic. Consider the following code being executed on a 5-stage MIPS pipelined processor (with a forwarding unit, as shown in the figure): addi $3, $1, 15...

3. [12 points] Consider the following pseudo assembly language code: I1: SUB R9- R3 R4 12: ADD R4 R5+R6; 13: LW R2 = MEM [R3 100]; 4: LW R2-MEM [R2+0] I5: SW MEM [R4 100] R2; 16: AND R2- R2 &R1 17:...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

"Fortran, Algol and Lisp invented most programming language concepts 50 years ago; adding the concept of object-orientation suffices to explain all programming languages to date". To what extent is...

3. Draw a pipeline execution diagram for the following diagram using the 5 stage pipelined processor. SUB r5, r4, r0 ADD r3, r2, r4 DIV r2, r5, r2 BEQ r8, #0,r10 ASH r12, r13, r14 4. If we use the...

Prolog You are approached to compose a Prolog program to work with twofold trees. Your code shouldn't depend on any library predicates and you ought to expect that the mediator is running without...

plz ans ASAP Question 10.0 points possible (graded, results hidden) Assume a pipelined processor with five pipeline stages where each stage takes one clock cycle. Further, assume that the processor...

For each of the following values for the MPC, determine the size of the simple spending multiplier and the total change in real GDP demanded following a $10 billion decrease in autonomous spending:...

The following two statements have been taken directly or with some modification from the accounting literature. Each of them is either taken out of context, involves circular reasoning, and/or...

DescriptionItemsA. Records and tracks the bondholders' names.B . Is unsecured; backed only by the issuer's credit standing.C . Maintains a separate asset account from which bondholders are paid at...

5. Develop a scenario comparing two PH programs and involving the use of a CBA.

A specialist rather than generalist understanding of managerial work. Management is seen as an element of a job, not the job itself, with the emphasis being placed on the specialism and the knowledge...

Neither bureaucratic nor authoritarian. Formal modes of behaviour, and the weakness of an informal system, does not mean a love of bureaucracy and concern for authority. Studies have suggested the...

Predominance of subject-based education in management training. Managers tend to be graduates, with possession of a doctorate common, and managers are typically qualified in law, economics or...