Question: 1 ) How many simultaneous and how many concurrent instruction streams can we support in each of the four schemes? 2 ) What is the
How many simultaneous and how many concurrent instruction streams can we support in each of the four schemes? What is the size in bits of the registers in the execution contexts of each chip? Which one of chips b c d offers the maximum memory latency reduction and which the minimum? How many pieces of independent work are needed to run each chip with max latency hiding ability? Which chip between c and d offers the max latency hiding ability? Why? For which chip do you believe that the compiler does the heaviest work? Why? For which chip do you believe that the compiler does the lightest work? Why? Under which circumstances nature and characteristics of programs run on each does the chip b have better performance than chip d and viceversa? Assume that you know you are running a program that has high temporal locality. What do you think is preferable in this case? To have a large cache and no or few hardware threads, To have a small or no cache but many hardware threads, to have large cache and many hardware threads.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
