Question: Latency and Throughput Bounds A superscalar with four function units can perform integer multiplication and floating point multiplication with computation times (measured in clock cycles)

Latency and Throughput Bounds A superscalar with four function units can perform integer multiplication and floating point multiplication with computation times (measured in clock cycles) as follows:

Latency Issue Capacity

mul. 3 1 2

fmul 5 5 2

For the purposes of the calculations, assume that the cycle time is 250 ps, i.e., an equivalent scalar machine would run at a peak rate of 4 GIPS.

(a) [2 marks] In the case of mul, calculations can be issued every clock cycle (I = 1), but they take 3 clock cycles to complete (L = 3). Does it make sense that the issue time is strictly less than the latency? Why or why not? Would it make sense if the issue time was strictly greater than the latency? Why or why not?

(b) [2 marks] Compute the latency bound and throughput bound for mul and fmul. Express your answers in cycles per instruction (CPI) and GIPS.

(c) [1 mark] Consider a program that does nothing but a long sequence of multiplication instructions. If there are no data dependencies among the multiplications, i.e., they could be run in any order with maximum parallelism, how quickly could multiplications be performed? Express your answer using GIPS. Give one answer for integer mul and one for floating-point fmul.

(d) [1 mark] If the program instead contained one long critical path, i.e., one linear data dependent sequence of multiplications like:

x <- x * a

x <- x * b

x <- x * c

x <- x * d

x <- x * e

x <- x * f ...

how quickly could multiplications be performed? Again, give two answers: one for mul, and one for fmul.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

2. For the following questions please use the following latencies and issues data for operations Integer Double Precision Operation Addition Multiplication Latency Issue Capacity Latency Issue...

On modern processors, why does the floating-point add operation take longer than the integer add operation (assume the same number of bytes in each data type)? As an example, see figure 5.12 on page...

Jupiter Notebook We have covered some of the limitations of single layer neural networks in class, but they are still powerful learning systems that provide a good way to begin learning about how to...

In this question you will be asked to reflect on a project you have been involved in or observed, in which a design evolved, or could have evolved, through applying a theory of user behaviour. You...

Consider the trigonometric series a0 2 + X r=1 (ar cos rx + br sin rx) where a0, a1, a2, . . . and b1, b2, . . . are constants and suppose that f(x) is a periodic function of x with period 2. (a)...

In a Hopfield neural network configured as an associative memory, with all of its weights trained and fixed, what three possible behaviours may occur over time in configuration space as the net...

This is problem is slightly different from what i found and I dont understand how it changes. Please help with this problem Suppose we wish to write a procedure that computes the inner product of two...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Portray in words what transforms you would have to make to your execution to some degree (a) to accomplish this and remark on the benefits and detriments of this thought.You are approached to compose...

The Sale of Goods Act in each province implies certain terms into contracts of sale relating to the fitness and quality of the product. Some Canadian jurisdictions make these provisions mandatory in...

7 Let W be the subspace of R spanned by the vectors Find the projection matrix P that projects vectors in R* onto 7 W. 17 7.

If a businessperson were traveling to a foreign country to do business, how would he or she promote a positive business relationship?

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

In the Data Source View in Visual Studio, what option is available to view data in any Source View Table? What are the primary uses this capability?

What Microsoft Analysis Services Extension for Visual Studio 2017 needs to be installed before beginning work on a Multidimensional OLAP Cube Project? How can the installation be verified?

Why would the FedScope Employment database be more representative of the General Population in terms of Salary Data than the CPS studies?