Question: With a superpipelined CPU design shown below, where instruction fetch takes two cycles (IF1 and IF2), data Load and Stores take two cycles (ME1 and

With a superpipelined CPU design shown below, where instruction fetch takes two cycles (IF1 and IF2), data Load and Stores take two cycles (ME1 and ME2) and the execution takes two cycles (EX1 and EX2). Branches are handled in EX2 and always predicted untaken by hardware:

consider the following program, which searches an area of memory and counts the number of times a memory word is equal to a key word:

SEARCH: LW R5, 0(R3)

SUB R6, R5, R2

BNEZ R6, NOMATCH

ADDI R1, R1, #1

NOMATCH: ADDI R3, R3, #4

BNE R4, R3, SEARCH

Branches are predicted untaken always and are taken in EX if needed. Hardware support for branches is included in all cases. Consider several possible pipeline interlock designs for data hazards and answer the following questions for each loop iteration, except for the last iteration.

a) Assume first that the pipeline has no forwarding unit and no hazard detection unit. Values are not even forwarded inside the register file. Re-write the code by inserting NOOPs wherever needed so that the code will execute correctly.

b) Assume no forwarding at all, but a hazard detection unit that stalls instructions in ID to avoid hazards. How many clocks does it take to execute one iteration of the loop (1) on a match and (2) on no match?

c) Assume full forwarding and a hazard detection unit that stalls instructions in ID to avoid hazards. How many clocks does it take to execute one iteration of the loop (1) on a match and (2) on no match?

d) Identify basic blocks (using instruction numbers). Is it possible to save cycles by local optimizations? Why?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

MIPS Processor Design Problem 1: For Problem la and lb, we are running the program C19, with instruction mix: Loads Stores ALU Branches 35% 10% 35% 20% 70% of the loads incur load-use stalls; the...

Problem 1 (30 points): Suppose we have two versions of a MIPS pipeline. Version 1: this is the 5-stage pipeline discussed in Chapter 4 up to Slide 73 or so. Taken branches take one stall, non-taken...

Provide a summary technical report with your own words about Pipelined Execution which is also named as Instruction Level Parallelism, addressing mainly the following areas: 1. What is Pipelined...

Suppose we have two versions of a MIPS pipeline Version 1: this is the 5-stage pipeline discussed in Chapter 4 up to Slide 73 or so. Taken branches take one stall, non-taken branches do not stall....

Suppose we have two versions of a MIPS pipeline. Version 1: this is the 5-stage pipeline discussed in Chapter 4 up to Slide 73 or so. Taken branches take one stall, non-taken branches do not stall....

Provide a summary technical report with about Pipelined Execution which is also named as Instruction Level Parallelism, addressing mainly the following areas: 1. What is Pipelined Execution and its...

THE REQUIREMENTS DOCUMENTS ARE AS FOLLOWS: PLEASE PROVIDE OP CODE TABLE & MACHINE CYCLE DIAGRAM IN EXCEL OR PDF Either Word, Visio, PDF, or any acceptable format that clearly shows a block diagram of...

ADVANCED COMPUTER ARCHITECTURE CPCS504 Assignment1 Spring 2021 /1442 Due Date 1st March 2021 Chapter1: 1. In Example 1 of Section 1.2.1, we assumed that the cache miss penalty was 20 cycles. With...

Need Detailed Answer of each part Consider the 7-stage pipelined processor with two execution units and pipelined caches, as shown below. Instruction fetch is a two-stage process followed by decode...

Use the following code fragment:In this exercise, we look at how software techniques can extract instruction - level parallelism ( ILP ) in a common vector loop. The following loop is the so - called...

Starting in 2005, a chain of events, including the war in Iraq, Hurricane Katrina, and the expanding economies in India and China, lead to a sharp increase in fuel costs. As a result, the U.S....

Todays zero-rate curve is summarized in the table below. Time period (years) Zero rate% p.a. 0.5 5.755 1.0 6.250 1.5 6.455 2.0 6.555 2.5 6.600 3.0 6.610 Calculate the price (per $100 par value), to...

Question 9 1 pts There are many closing costs that will be paid when burying a house. Some will be a foxed amount and some will be tied to the amount of the loan. One of the main closing costs that...

ABC Corporation uses the perpetual inventory system and had the following inventory transactions during the month of March: March 1 : Beginning inventory of 5 0 0 units at $ 1 5 each. March 5 :...

18. If you have power, then people will dislike and fear you.

How does Johns experience relate to questions of organizational power and politics?

1. What do you think has happened here? Who, and in what way, are the various participants responsible for the outcome?