Question: Newer processors such as Intels i7 Sandy Bridge include support for AVX vector/multimedia instructions. Write a dense matrix multiply function using single-precision values and compile
Newer processors such as Intel’s i7 Sandy Bridge include support for AVX vector/multimedia instructions. Write a dense matrix multiply function using single-precision values and compile it with different compilers and optimization flags. Linear algebra codes using Basic Linear Algebra Subroutine (BLAS) routines such as SGEMM include optimized versions of dense matrix multiply. Compare the code size and performance of your code to that of BLAS SGEMM. Explore what happens when using double-precision values and DGEMM.
Step by Step Solution
3.38 Rating (170 Votes )
There are 3 Steps involved in it
Solution St... View full answer
Get step-by-step solutions from verified subject matter experts
Document Format (2 attachments)
1782_61d57f25d821f_820115.pdf
180 KBs PDF File
1782_61d57f25d821f_820115.docx
120 KBs Word File
