Question: For the SGEMM code developed above for the i7 processor, include the use of AVX intrinsics to improve the performance. In particular, try to vectorize
For the SGEMM code developed above for the i7 processor, include the use of AVX intrinsics to improve the performance. In particular, try to vectorize your code to better utilize the AVX hardware. Compare the code size and performance to the original code.
Step by Step Solution
3.39 Rating (165 Votes )
There are 3 Steps involved in it
When optimizing code the first efforts should be focused on vectorization The most fundamental way to efficiently utilize the resources in modern processors is to write code that can run in vector mod... View full answer
Get step-by-step solutions from verified subject matter experts
