Assume that you are given a dual-issue pipeline for RISC-V ISA. You can schedule ALU or...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Assume that you are given a dual-issue pipeline for RISC-V ISA. You can schedule ALU or branch instruction in the first issue slot, and load/store instruction in the second issue slot of the issue package. You are given the following sequence of instructions. Loop: lw add SW addi bne x7, 0(x10) x7, x7, x11 x7, 0(x10) x10, x10, 0x4 x10, x12, Loop #read from memory #update value #store it back to memory #update the address #Loop until the address is not equal to the content of x12 a) [10 points] Assume that the loop index for the given loop above is divisible by 3. Unroll the given loop 3 times. Use register renaming as necessary to avoid any name dependencies. Package 1 Package 2 Package 3 Package 4 Package 5 Package 6 b) [10 points] How would the unrolled loop in (a) be scheduled on the given static dual-issue pipeline? You may use the following template to show the schedule. 1st Slot (ALU or branch) 2nd Slot (load or store) c) [5 points] Calculate and report the IPC of the system after the loop unrolling, separately. How much speedup achieved after loop unrolling? Assume that you are given a dual-issue pipeline for RISC-V ISA. You can schedule ALU or branch instruction in the first issue slot, and load/store instruction in the second issue slot of the issue package. You are given the following sequence of instructions. Loop: lw add SW addi bne x7, 0(x10) x7, x7, x11 x7, 0(x10) x10, x10, 0x4 x10, x12, Loop #read from memory #update value #store it back to memory #update the address #Loop until the address is not equal to the content of x12 a) [10 points] Assume that the loop index for the given loop above is divisible by 3. Unroll the given loop 3 times. Use register renaming as necessary to avoid any name dependencies. Package 1 Package 2 Package 3 Package 4 Package 5 Package 6 b) [10 points] How would the unrolled loop in (a) be scheduled on the given static dual-issue pipeline? You may use the following template to show the schedule. 1st Slot (ALU or branch) 2nd Slot (load or store) c) [5 points] Calculate and report the IPC of the system after the loop unrolling, separately. How much speedup achieved after loop unrolling?
Expert Answer:
Answer rating: 100% (QA)
The question presents a code snippet that includes a loop and the task involves optimizing this loop for a dualissue pipeline in a RISCV instruction set architecture ISA processor ensuring that both i... View the full answer
Related Book For
Computer Organization and Design The Hardware Software Interface
ISBN: 978-0124077263
5th edition
Authors: David A. Patterson, John L. Hennessy
Posted Date:
Students also viewed these programming questions
-
QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...
-
can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...
-
In the Akerlof example, the individuals are treated as indifferent to risk. What would you expect to see in these markets if individuals wanted to avoid risk? What if there were some risk lovers?
-
What business strategy recommendations would you make to Kaspar Rorsted, Henkel's CEO? Should the company redouble its efforts to broaden its leadership and cultural diversity? Should the company...
-
A cube 1 cm on a side is made up of one giant polyethylene molecule, having a density of 1.0 g/cm3 . (a) What is the molecular weight of this molecule? (b) Assuming an all trans conformation, what is...
-
Three parallel-plate capacitors each store the same amount of charge. Capacitor 1 has a plate area \(A\) and a plate separation \(d\). Capacitor 2 has a plate area \(2 A\) and a plate separation...
-
Information concerning a product produced by Ender Company appears here: Sales price per unit .................... $200 Variable cost per unit ................... $80 Total annual fixed manufacturing...
-
Part A A string that is 9.6 m long is tied between two posts and plucked. The string produces a wave that has a frequency of 320 Hz and travels with a speed of 192 m/s. How many full wavelengths of...
-
I need a summarization of this updated pro forma balance sheet for The Sweet Password: ASSETS Current Assets Cash Net accounts receivable Inventory Temporary investment Prepaid expenses Fixed Assets...
-
The standard coupling between railroad cars must be capable of withstanding the maximum tensile force exerted on any coupling in a given train. (a) If a locomotive is pulling ten cars and speeding...
-
Using a CPU that runs an operating system that uses EDF, try to get the CPU utilization as close to 100% as possible without failing. Try a variety of data arrival times to determine how sensitive...
-
Build a CAN bus monitoring system.
-
What interaction does a bathroom (spring) scale actually measure?
-
You throw a baseball straight up, and because of air resistance some of the ball's energy is converted to thermal energy during the whole trip. Which takes longer: the trip up or the trip down?
-
Target is considering opening mini-Target stores in airports across the United States. Target knows that 25% of the items in its store account for 70% of the store's sales and expects that this...
-
The following data are supplied for the common stocks of Nikola Corporation, Tesla, Inc. and General Motors: Nikola Corp (NKLA) Tesla Inc. (TSLA) Close Price ($) Close Price ($) 67.53 30.00 40.81...
-
Based on your answers to 3.35 and 3.36, does (3.41796875 10 -3 6.34765625 10 -3 ) 1.05625 10 2 = 3.41796875 10 -3 (6.34765625 10 -3 1.05625 10 2 )?
-
Translate the following C code to MIPS assembly code. Use a minimum number of instructions. Assume that the values of a, b, i, and j are in registers $s0, $s1, $t0, and $t1, respectively. Also,...
-
Assume that we would like to expand the MIPS register file to 128 registers and expand the instruction set to contain four times as many instructions. 1. How this would this affect the size of each...
-
Summer and winter constellations are different because (a) of the spin of Earth about its polar axis. (b) the night sky faces in opposite directions in summer and winter. (c) of the tilt of Earths...
-
The longest-lived stars are those of (a) low mass. (b) high mass. (c) intermediate mass. (d) infinite mass.
-
What is the star nearest Earth? (a) Proxima Centauri (b) Polaris (c) Mercury (d) the Sun
Study smarter with the SolutionInn App