Layout the following code sequence in convoys and compute the momperor sequences takes considering one Load Store Unit, one FP multiplier, one FP Adder and vector register length to be 6 4 elements vid v 1 , 1 0 0 ( x 5 ) double precision vector load vid v 2 , 2 0 0 ( x 6 ) Vadd vv v 3 , v 1 , v 2 double precision vector add ( vector , vector ) vlsd v 4 , 0 ( x 7 ) , x 4 double precision vector load with stride double precision vector mul ( vector , vector ) vmul vv v 5 , v 4 , x 5 double precision vector sub ( vector , scalar ) vsub vx v 5 , v 5 , v 3 vsub vv vsd v 4 , v 4 , v 2 5 , 2 0 0 ( 7 ) double precision vector store For the code sequence of part a ) , now consider that there are two lanes Layout the same sequence in convoys and compute the cycles F LOPs Scanned with CamScar 4 c ) What features are available in vector processors to support the following i Conditional Execution Loading the non zero elements of a sparse matrix d ) What is chaining n context to vector architecture e ) What is meant by strided access while loading a vector from memory 2 1 1 the MIPS code after loop unrolling the following MIPS code twice, explain the benefits of loop unrolling by calculating the CPI for the original code and the loop unrol twice code No rescheduling required You have to show the steps of calculating the CPI final answer without steps will not get credits ( 8 points ) Loop lw R 2 , 0 ( R 1 ) add R 2 , R 2 , R 3 sw R 2 , 0 ( R 1 ) addi R 1 , R 1 , 4 bne R 1 , R 5 , Loop

The Answer is in the image, click to view ...

Question: Layout the following code sequence in convoys and compute the momperor sequences takes considering one Load / Store Unit, one FP multiplier, one FP Adder

Layout the following code sequence in convoys and compute the momperor sequences takes considering one Load

/

Store Unit, one FP multiplier, one FP Adder and vector register length to be

64

elements.

vid

1, 100 (

5)

/ /

double precision vector load

vid

2, 200 (

6)

Vadd.vv

3,

1,

2

/ /

double precision vector add

(

vector

,

vector

)

vlsd

-

4, 0 (

7),

4

/ /

double precision vector load with stride

/ /

double precision vector mul

(

vector

,

vector

)

vmul.vv

5,

4,

5

/ /

double precision vector sub

(

vector

,

scalar

)

vsub.vx

5,

5,

3

vsub.vv

vsd

4,

4,

2

5, 200 (* 7)

/ /

double precision vector store

For the code sequence of part

-

),

now consider that there are two lanes. Layout the same sequence in convoys and compute the cycles

/

*

LOPs

Scanned with CamScar

- 4 -

)

What features are available in vector processors to support the following:

.

Conditional Execution

Loading the non

-

zero elements of a sparse matrix

)

What is chaining

n context to vector architecture?

)

What is meant by strided access while loading a vector from memory?

[2]

[1

[1

the MIPS code after loop unrolling the following MIPS code twice, explain the benefits of loop unrolling by calculating the CPI for the original code and the loop unrol twice code. No rescheduling required. You have to show the steps of calculating the CPI final answer without steps will not get credits.

(8

points

)

Loop: lw R

2, 0 (

1)

add R

2,

2,

3

sw R

2, 0 (

1)

addi R

1,

1, - 4

bne R

1,

5,

Loop

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Layout the following code sequence in convoys and compute the momperor sequences takes considering one Load / Store Unit, one FP multiplier, one FP Adder and vector register length to be 6 4...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Describe how to construct the function cpo ((D E), v) of two cpos (D, vD) and (E, vE). Prove that ((D E), v) is a cpo. (You may use facts about least upper bounds provided you state them clearly.)...

Provide a summary technical report with your own words about Pipelined Execution which is also named as Instruction Level Parallelism, addressing mainly the following areas: 1. What is Pipelined...

Consider the following context-free grammar of expressions E ::= n | (E, E) where n ranges over integers. (a) Present a right-most derivation of the expression ((21, 18), 17). [2 marks] (b) List the...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

Let r and s be solutions to the quadratic equation x 2 b x + c = 0. For n N, define d0 = 0 d1 = r s dn = b dn1 c dn2 (n 2) Prove that dn = r n s n for all n N. [4 marks] (b) Recall that a commutative...

Give Correct ANSWERS Human-Computer Interaction (a) If you had been one of the original inventors of the WIMP interface, and engineers on the technical team had been sceptical about the advantages...

The most critical submission of the theory of computation has been to establish that the halting problem is not decidable. Give a clear statement of this result (you are not asked to prove it). [5...

Sitwell Corporation manufactures titanium and aluminum tennis racquets. Sitwell's total overhead costs consist of assembly costs and inspection costs. The Following information is available: Stilwell...

Shown in the figure is a 12-gauge (0.1094-in) by ¾ -in latching spring that supports a load of F = 3 lbf. The inside radius of the bend is 1/8 in. Estimate the stresses at the inner and outer...

4. Farmers that grow vegetable oil crops often use large quantities of ammonium nitrate fertiliser. NH,NO, Calculate the percentage by mass of nitrogen in ammonium nitrate. [2]

All activities managers should do to create a customer reponsive culture

2. What factors infl uence our perceptions?

4. Does mind reading help or hinder communication?

3. How does the self-serving bias aff ect the accuracy of our perceptions?