Question: Linear loop transformations are useful in reductions of cache misses and increase possible parallelization. Consider the following two loops. Which of these loops would yield
Linear loop transformations are useful in reductions of cache misses and increase possible parallelization. Consider the following two loops. Which of these loops would yield better performance? Why?
For I = 1, 1000
For J= 1, 1000
c(J) = c(J) + a(I,J)*b(J)
For J=1, 1000
For J=1, 1000
c(J) = c(J) + a(I,J)*b(J)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
