Question: Assume that the vector reduction instruction is executed on the vector functional unit, similar to a vector add instruction. Show how the code sequence lays
Assume that the vector reduction instruction is executed on the vector functional unit, similar to a vector add instruction. Show how the code sequence lays out in convoys assuming a single instance of each vector functional unit. How many chimes will the code require? How many cycles per FLOP are needed, ignoring vector instruction issue overhead?
Step by Step Solution
3.25 Rating (151 Votes )
There are 3 Steps involved in it
18 chimes 4 results 15 FLOPS per result 1815 12 cycles per FLOP ... View full answer
Get step-by-step solutions from verified subject matter experts
