Question: This is problem is slightly different from what i found and I dont understand how it changes. Please help with this problem Suppose we wish

This is problem is slightly different from what i found and I dont understand how it changes. Please help with this problem

Suppose we wish to write a procedure that computes the inner product of two vectors u and v. An abstract version of the function has a CPE of 14-18 with x86-64 for different types of integer and floating-point data. By doing the same sort of transformations we did to transform the abstract program combine1 into the more efficent combine4, we get the following code:

/* Inner product. Acculate in temporary*/

void inner4(vec_ptr u, vec_ptr v, data_t *dest)

{

long i;

long length = vec_length(u);

data_t *udata = get_vec_start(u);

data_t *udata = get_vec_startv)

data_t sum = (data_t) 0;

for (i=0; i < length; i++){

sum = sum + udata[i] * vdata[i];

}

*dest = sum;

}

Our measurements show that this function has CPEs of 1.50 for integer data and 3.00 for floating-point data. For data type double, the x86-64 assembly code for the inner loop is as follows:

Inner loop of inner4. data_t = double, OP = *

udata in %rbp, vdata in %rax, sum in %xmm0

i in %rcx, limit in %rbx

1 .L15: Loop:

2 vmoved 0(%rbp,%rcx,8), %xmm1 get udata[i]

3 vmulsd (%rax,%rcx,8), %xmm1, %xmm1 multiply by vdata[i]

4 vaddsd %xmm1, %xmm0, %xmm0 add to sum

5 addq $1, %rcx increment i

6 cmpq %rbx, %rcx compare i to limit

7 jne .L15 if !=, goto loop

Figure 5.12

			Integer		Floating Point
Operation	Latency	Issue	Capacity	Latency	Issue	Capacity
Addition	1	1	4	3	1	1
Multiplication	3	1	1	5	1	2
Division	3-30	3-30	1	3-15	3-15	1

Assume that the functional units have the characteristics listed in Figure 5.12.

A. Diagram how this instruction sequence would be decoded into operations

and show how the data dependencies between them would create a critical

path of operations, in the style of Figures 5.13 and 5.14.(data-flow graph)

B. For data type double, what lower bound on the CPE is determined by the

critical path?

C. Assuming similar instruction sequences for the integer code as well, what

lower bound on the CPE is determined by the critical path for integer data?

D. Explain how the two floating-point versions can have CPEs of 3.00, even

though the multiplication operation requires either 5 clock cycles

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Moving Up the Socioeconomic Ladder Jamal Walker was raised in the northern section of a large urban area that was known for its crime and destitution. He was the third child of a single mother who...

please help with this case write up that is due tomorrow. "Speed Ventures" Parts A and B In one page (double-spaced), answer the following question(s): Would you race or not? Justify your decision...

Hi! I have a homework problem. I am particularly having an issue finding the difference between problem/reason. May you please help me? I posted the question below. In well written paragraphs, answer...

1. identification issues 2. problem identification statement 3. devlope alternatives to address problem statement. 4. evaluate alternatives. Choose 1. 5. implement plan. 6. evaluate chosen...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

MATHEMATICIANS RISE TO A CHALLENGE ne of the theorems we teach in eighth grade is a + b= *, where c is the length of the hypotenuse of a right triangle in Euclidean space, and a and b are the lengths...

A creative engineer suggests structuring the TLB so that not all the bits of the presented address need match to result in a hit. Suggest how this might be achieved, and what might be the costs and...

Please write the selecting tools and approaches (according to the subject R esearch Method ) for this topic SECOND LANGUAGE ACQUISITION: LEVEL OF INTEREST IN LEARNING MANDARIN CHINESE LANGUAGE AS A...

CHAPTER 3 PROBLEM MANAGEMENT 77 FIGURE 3.2 The CAPRA Problem-Management System CLIENTS Who are the clients (direct and indirect)? ACQUIRING AND ANALYZING INFORMATION IF What is the apparent problem?...

What is the hybridization at all atoms, except hydrogen's in these compounds? a) CHNH, d) b) CH=CHCHC=N OH 6 H NH

The country of Boodang is the leading producer of sausage. Boodang imposes three taxes on its residents and companies to encourage production of sausage and discourage its consumption. Each tax...

10. Which of the following instantiates an Item object and assigns it to the phone variable? a. Dim phone As Item b. Dim phone As New Item c. Dim phone As Item phone = New Item d. both b and c

Governments limit currency convertibility to protect their: Question 2 options: national sovereignty domestic interest rates foreign exchange reserves political stature

=+ c. How would this change in productivity affect the labor market if unions prevent real wages from falling?

=+ 7. When workers wages rise, their decision about how much time to spend working is affected in two confl icting waysas you may have learned in courses in microeconomics. The income effect

=+Specifi cally, what happens to employment, output, and the total amount earned by workers?