Question: 5 . 2 6 Chip multiprocessors ( CMPs ) have multiple cores and their caches on a single chip. CMP on - chip L 2
Chip multiprocessors CMPs have multiple cores and their caches on a single chip. CMP onchip L cache design has interesting tradeoffs. The following table shows the miss rates and hit latencies for two benchmarks with private vs shared L cache designs. Assume L cache has a miss rate and a cycle access time. Private Shared Benchmark A missesperinstruction Benchmark B missesperinstruction Assume the following hit latencies: Private Cache Shared Cache Memory
Which cache design is beer for each of these benchmarks? Use data to support your conclusion.
Offchip bandwidth becomes the boleneck as the number of CMP cores increases. How does this boleneck affect private and shared cache systems differently? Choose the best design if the latency of the first offchip link doubles.
Discuss the pros and cons of shared vs private L caches for both singlethreaded, multithreaded, and multiprogrammed workloads, and reconsider them if having onchip L caches.
Would a nonblocking L cache produce more improvement on a CMP with a shared L cache or with a private L cache? Why?
Assume new generations of processors double the number of cores every months. To maintain the same level of percore performance, how much more offchip memory bandwidth is needed for a processor released in three years?
Consider the entire memory hierarchy. What kinds of optimizations can improve the number of concurrent misses?
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
