Question: Assume you have the following codevoid inner 4 ( vec _ ptr u , vec _ ptr v , data _ t * dest )

Assume you have the following codevoid inner

4 (

vec

_

ptr u

,

vec

_

ptr v

,

data

_

*

dest

)

{

int length

=

vec

_

length

(

)

;data

_

*

vdata

=

get

_

vec

_

start

(

)

;for

(

= 0

; i

length; i

+ +) {}

*

dest

=

sum;

}

and you modify the code to use

4 -

way loop unrolling and four parallel accumulators. Measurements for this function with the x

86 -

64

architecture shows it achieves a CPE of

2.0

for all types of data.

Assuming the model of the Intel i

7

architecture shown in class

(

one branch unit, two arithmetic units, one load and one store unit

),

the performance of this loop with any arithmetic operation can not get below

2.0

CPE because of

When the same

4 4

code is compiled for the IA

32

architecture, it achieves a CPE of

2.75,

worse than the CPE of

2.25

achieved

with just four

-

way unrolling. The mostly likely reason this occurs is because

Assume you have the following codevoid inner4(vec_ptr u, vec_ptr v, data_t

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Q3) You are given a data structure to store vectors as follows typedef int data t; typedef struct { long len; data t *data; vec_rec, *vec ptr; /*Return length of vector/ long vec_length(vec_ptr v) f...

answer the question clearly You are building a flight-control system for which a convincing safety case must be made. Would you assign the tasks of safety requirements engineering, test case...

Can you please do both A and B parts quick and right solutions Important Notice: If your submitted code is not working properly, i.e. throws error or fails in all test cases, your submission will be...

Can you please be quick and do both parts right and readable solutions Bonus part is not necessarily can you just do it the A and B your tank regeirmente caffially. Cinand lierk" Pheselrainis...

Can you please be quick and do part A and B only and report also Project Definition: In this project, you will implement common data structures such as vector and stack to EREF. Please, read each...

Need help with some accounting, document attached. 1. Activity-based costing requires four steps. Requirement R1. Rank the following steps in the order in which they would be completed. Number the...

PLEASE COMPLETE NO LATER THAN 10/14 @3:30PM Each question(1,2,& 3) must be a minimum of 200 words. Please EXPLAIN answers in FULL detail and make answers knowledgeable based off the attached reading,...

See if you can determine what APR you are charging a consumer loan customer if you grant the customer a loan for five years payable in monthly installments, and the customer must pay a finance charge...

You work for a consulting firm and have been given the assignment of deciding whether a particular company president is overpaid both in absolute terms and relative to presidents of comparable...

A theory that devermines the namber of market makees necersacy tor an efficeet market Need help? Review theie concept intrecg:

P-1) (100 Pts.) A chemical manufacturing company (CMC) has a contract for the procurement of the neccssaly chemicals from four suppliers. The chemicals purchased from Supplier A are priced at $20...

1. In what ways has flexible working revolutionised employment?

4. To what extent do you agree with some critics who have claimed that Richard Bransons statements on time off work for his employees is a publicity stunt?

2. What are the benefits and dis-benefits of flexible working to employers and employees?