Question: Your branch target buffer (and branch history buffer) each have 8 slots. Your CPU executes the following code fragment. Assume that prediction takes place in

Your branch target buffer (and branch history buffer) each have 8

Your branch target buffer (and branch history buffer) each have 8 slots. Your CPU executes the following code fragment. Assume that prediction takes place in the Fetch cycle, and that the result of a branch is actually known after the eXecute cycle. Assume full forwarding (MX, WX, WM) addi x12, x, 5 add x11, x0, x add x19, x0, x addi x2, x 150 // x190 LI: bge x11, x12, Exit s111 x13, x11, 3 add x14, x16, x13 ld x15, 0(x14) blt x15, x20, L2 sd x0 0(x14) add x19, x15, x19 //x16 address of array [0] 2 addi x15, x15, 1 sd x15, 0(x14) L3: addi x11, x11, 1 jal x L1 Exit: (a) Draw a pipeline diagram for the execution of the program, assuming a static predict-not-taken policy (yes, this means that the BTB and BHT aren't used... yet). Circle the stages in which the outcome of a branch is known. Make sure to include stalls due to branch misprediction. How many cycles are lost due to stalls? (b) Assume that you are using a 1-bit branch predictor, and redraw the pipeline diagram. How many cycles are lost due to stalls? Make sure that you incorporate the cycle in which the outcome of a branch is known, if necessary (c) Show the values read and written from the BTB and BHT (from 2b, and the cycles in which the read/write occurs, You need not show reads due to fetches of non-branch instructions, (d) Assume that you are using a 2-bit branch predictor, and redraw the pipeline diagram. How many cycles are lost due to stalls? Make sure that you incorporate the cycle in which the outcome of a branch is known, if necessary Show the values read and written from the BTB and BHT (from 2d, and the cycles in which the read/write occurs. You need not show reads due to fetches of non-branch instructions. (e) Your branch target buffer (and branch history buffer) each have 8 slots. Your CPU executes the following code fragment. Assume that prediction takes place in the Fetch cycle, and that the result of a branch is actually known after the eXecute cycle. Assume full forwarding (MX, WX, WM) addi x12, x, 5 add x11, x0, x add x19, x0, x addi x2, x 150 // x190 LI: bge x11, x12, Exit s111 x13, x11, 3 add x14, x16, x13 ld x15, 0(x14) blt x15, x20, L2 sd x0 0(x14) add x19, x15, x19 //x16 address of array [0] 2 addi x15, x15, 1 sd x15, 0(x14) L3: addi x11, x11, 1 jal x L1 Exit: (a) Draw a pipeline diagram for the execution of the program, assuming a static predict-not-taken policy (yes, this means that the BTB and BHT aren't used... yet). Circle the stages in which the outcome of a branch is known. Make sure to include stalls due to branch misprediction. How many cycles are lost due to stalls? (b) Assume that you are using a 1-bit branch predictor, and redraw the pipeline diagram. How many cycles are lost due to stalls? Make sure that you incorporate the cycle in which the outcome of a branch is known, if necessary (c) Show the values read and written from the BTB and BHT (from 2b, and the cycles in which the read/write occurs, You need not show reads due to fetches of non-branch instructions, (d) Assume that you are using a 2-bit branch predictor, and redraw the pipeline diagram. How many cycles are lost due to stalls? Make sure that you incorporate the cycle in which the outcome of a branch is known, if necessary Show the values read and written from the BTB and BHT (from 2d, and the cycles in which the read/write occurs. You need not show reads due to fetches of non-branch instructions. (e)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

2. [10 points] You may wish to refer to the RISC-V ISA manual (https://content.riscv. org/wp-content/uploads/2017/05/riscv-spec-v2.2.pdf). Your branch target buffer (and branch history buffer) each...

2. [10 points] Your branch target buffer (and branch history buffer) each have 8 slots. Your CPU executes the follovw ing code fragment. Assume that prediction takes place in the Fetch cycle, and...

Computer Architecture question: Your branch target buffer (and branch history buffer) each have 8 slots. Your CPU executes the follow- ing code fragment. Assume that prediction takes place in the...

Provide a summary technical report with your own words about Pipelined Execution which is also named as Instruction Level Parallelism, addressing mainly the following areas: 1. What is Pipelined...

1 . 2 Complete the timeline of the program assuming that you have an always - taken branch predictor. Assume that the branch prediction happens within the ID stage, and the full branch outcome is...

Question: (a) from a uniform pack Derive the reason components of a uniform quadratic B-spline by starting with a uniform pack vector. To put it another way, get N1,3 from the bundle vector, and a...

Provide a summary technical report with about Pipelined Execution which is also named as Instruction Level Parallelism, addressing mainly the following areas: 1. What is Pipelined Execution and its...

In C++ please. Assignment 4 This assignment is the first in a sequence of three. It is not strictly necessary to complete this one in order to do the other two, but the understanding you gain in...

The following code is for an LC3 Simulator written in C. The assignment is to complete the sections labeled FILL ME IN as described * Fill in code where there are FILL ME IN comments. * */ #include...

Johnson and Wilson were the principal shareholders in XYZ Corporation, located in the city of Jonesville, Wisconsin. This corporation was engaged in the business of manufacturing paper novelties,...

Solve for x. log x + log(x-15) = 2 Select the correct choice below and, if necessary, fill in the answer box to complete your choice. A. The solution (s) is/are x = [ (Type an integer or a simplified...

22. The figure shows a circle with centre O passing through points A and B. AC is a tangent to the circle at A and OBC is a straight line. Given that AC = 18 cm and BC = 12 cm, find (i) the radius of...

When products are sold, the account , Finished Goods Inventory Cost of Goods Sold Accounts Payable Work - in - Process Inventory is debited

2. What do threats to validity have to do with training evaluation? Identify internal and external threats to validity. Are internal and external threats similar? Explain.

3. What are the strengths and weaknesses of each of the following designs: posttest-only, pretest/posttest with comparison group, and pretest/posttest only?

4. Trainees and their managers provide estimates of training benefits.