Question: Problem 4 : You experiment with a GPU to compute the following multiplication C = A B , the elements of A , B ,

Problem 4: You experiment with a GPU to compute the following multiplication C=AB, the elements of A,B, and C are all with the same bit-width. Please answer the following questions (please show detailed steps.)(20 pts + Bonus 10 pts).
A=[x1,1cdotsx1,lvdotsddotsvdotsxm,1cdotsxm,l] and B=[y1,1cdotsy1,nvdotsddotsvdotsyl,1cdotsyl,n]
a) If one thread is used to computer one element in matrix C, what types of Arithmetic Logic Unit (ALU) operators will be used to support the thread operation? (5 pts)
b) For a single-thread operation. Please point out the number of operations for each type of ALU operation. (5 pts)
c) If we compute on a SISD computer, what is the total number of operations required for each type of ALU operation to compute the entire matrix C?(10 pts)
d) Based on the observation from the answer above, when executing on GPU, assuming we have sufficient hardware resources, please write a pseudocode for computing matrix C. The code should clearly demonstrate the data movement and proper configuration of kernels and threading to optimize the parallelism. (Bonus 10 pts)
Problem 4 : You experiment with a GPU to compute

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!