Question: Problem 2 : Consider the following computation loop x i + 1 = a x i - b Y i which is the inner loop

Problem 2:
Consider the following computation loop xi+1=axi-bYi which is the inner loop in a numerical
algorithmic process. For numerical convergence, this loop is supposed to run for a large number of iterations.
The constants a and b are initialized in float registers.
Assume the pipeline data dependent latencies between instructions (WRITE to READ operands) are given
by the following Table. For example, LD F0, O(R1) and MULTF F0,F0,F2 have a RAW dependency with
latency (load slot)1 slot. Also, the machine uses float arithmetic units (ADDF and MULTF) which are
pipelined and embedded in the instruction pipeline. You may consider 4-stage float pipes. Furthermore,
delayed branching is available.
a) Unroll the above loop as many times as necessary to schedule it without any delays, collapsing the
loop overhead instructions.
b) Briefly show the scheduling of the entire loop iterations.
c) Comment if there are other unrollings feasible.
Problem 2 : Consider the following computation

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!