Question: I would like to ask a question about how to wriate a funcion to transpose a 64*64 matrix with less misses. this is the function

I would like to ask a question about how to wriate a funcion to transpose a 64*64 matrix with less misses.

this is the function i'm about to write

void transpose_submit(size_t M, size_t N, double A[N][M], double B[M][N], double *tmp){}

Performance (26 pts) For each matrix size, the performance of your transpose submit function is evaluated by using LLVM-based instrumentation to extract the address trace for your function, and then using the reference simulator to replay this trace on a cache with parameters s 5, E 1, b 6). Using the reference cache simulator, each transpose function will be assigned some number of clock cycles m A cache miss is worth 100 clock cycles, while a cache hit is worth 4. Your performance score for each matrix size will scale linearly with m up to some threshold. The scores are computed as: 32 x 32 10 points if m 35,000 0 points if m 45,000 64 x 64 10 points if m 150,000,0 points if m 200,000 63 x 65 6 points if m 280,000, 0 points if m 350,000 For example, a solution for the 32 x 32 matrix with 1764 hits and 284 misses (m 1764 x 4 284 x 100 35456) would receive 9.5 of the possible 10 points You can optimize your code specifically for the three cases in the performance evaluation. In particular, it is perfectly OK for your function to explicitly check for the matrix sizes and implement separate code optimized for each case

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

After engaging with Money Quest: Choose Your Challenge, reflect on how financial literacy impacts real-life decision-making. Choose one of the following prompts to guide your discussion: Based on...

I am doing a project for my c programming class, and was hoping I could get some help. The Project: Dynamic Arrays and File I/O with Matrix Operations In this program we will create a program that...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

) Explain the term overloading in the context of Java constructors and methods. [2 marks] (b) Without describing the details of either, outline the relationship between the Java methods...

Please help me to solve this question Comment: I want to solve it with the method in the description below + using array .. thank you so much .. OUTPUT: For this assignment, your mission is to write...

Briefly describe ASCII and Unicode and draw attention to any relationship between them. [3 marks] (b) Briefly explain what a Reader is in the context of reading characters from data. [3 marks] A...

Prolog You are approached to compose a Prolog program to work with twofold trees. Your code shouldn't depend on any library predicates and you ought to expect that the mediator is running without...

e can be reused to make things simpler. Where sensible , copy and paste the text from above so that it matches! ( Ware the spacing though - that's always one that can trip you up . ) You are...

According to the annual report record of Royal bank of Scotland(showed via pictures just below the questions) and answer the following questions: 1.How does the bank address ethical concerns? 2.Where...

What are the biggest ah-ha! moments from Oracy Development? 6 English-Language Oracy Development Learning Outcomes After reading this chapter, you should be able to ... . Describe the basics of...

Robbins Companys cost and production data for two recent months included the following: .:. Required a. Separately calculate the rental cost per unit and the utilities cost per unit for both March...

An incident beam of photons is scattered through 100.0; the wavelength of the scattered photons is 124.65 pm. What is the wavelength of the incident photons?

It is very conflicting. It says "the before tax profits to rise by 3 5 0 , 0 0 0 per year." If we are to ignore taxes, then the annual cash inflow is 3 5 0 , 0 0 0 plus the depreciation tax shield....

It is desired to find the dissociation constant of acetic acid by conductivity measurement. The laboratory temperature is 2 0 \ deg C . The conductivity values of different concentrations of acid...

Assume that the banking system has total reserves of $100 billion. Assume also that required reserves are 10 percent of checking deposits and that banks hold no excess reserves and households hold no...

As shown in Figure 3, the overall labor-force participation rate of men declined between 1970 and 2000. At the same time, the labor-force participation rate of women increased sharply. This overall...

The Bureau of Labor Statistics announced that in February 2008, of all adult Americans, 145,993,000 were employed, 7,381,000 were unemployed, and 79,436,000 were not in the labor force. Use this...