Question: PLEASE ONLY ANSWER NUMBER 2! #1) A particular program expressed in a particular ISA executes 200 ALU instructions, 10 Loads, 16 Stores, and 4 Branches.

PLEASE ONLY ANSWER NUMBER 2!

#1) A particular program expressed in a particular ISA executes 200 ALU instructions, 10 Loads, 16 Stores, and 4 Branches. A simple, non-pipelined, implementation of that ISA takes 8 CPI for each ALU instruction, 20 CPI for each load, 10 CPI for each Store, and 10 CPI for each Branch. The original clock frequency is 2GHz. How many clock cycles would the program take to execute? How many microseconds would the program take to execute?

# CPI ALU 200 8 Load 10 20 Store 16 10 Branch 4 10

Cycles = (200*8) + (10*20) + (16*10) + (4*10) = 1600 + 200 + 160 + 40 = 2000 T = 1/f = 0.5 ns Time = 2000 * 0.5ns = 1000ns = 1us

#2) Given the circumstances described in question 1 above, which of the following changes by itself would yield at least 2X speedup?

a) A clever compiler is able to eliminate all the Branch instructions b) An improved ALU design reduces ALU instruction CPI from 8 to 2 c) Rewriting the program reduces the number of ALU instructions to 100 d) Adding a cache reduces Load CPI from 20 to 5 and Store CPI from 10 to 5 e) New VLSI fabrication technology halves the clock period, but doesn't change memory speed so Load takes 40 CPI

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!