Question: The performance of a snooping cache-coherent multiprocessor depends on many detailed implementation issues that determine how quickly a cache responds with data in an exclusive

The performance of a snooping cache-coherent multiprocessor depends on many detailed implementation issues that determine how quickly a cache responds with data in an exclusive or M state block. In some implementations, a CPU read miss to a cache block that is exclusive in another processor's cache is faster than a miss to a block in memory. This is because caches are smaller, and thus faster, than main memory. Conversely, in some implementations, misses satisfied by memory are faster than those satisfied by caches. This is because caches are generally optimized for "front side" or CPU references, rather than "back side" or snooping accesses.
For the multiprocessor illustrated in Figure 4.37, consider the execution of a sequence of operations on a single CPU where
€¢ CPU read and write hits generate no stall cycles.
€¢ CPU read and write misses generate Nmemory and Ncache stall cycles if satisfied by memory and cache, respectively.
€¢ CPU write hits that generate an invalidate incur Ninvalidate stall cycles.
€¢ A writeback of a block, either due to a conflict or another processor's request to an exclusive block, incurs an additional Nwriteback stall cycles.
Consider two implementations with different performance characteristics summarized in Figure 4.38.
Consider the following sequence of operations assuming the initial cache state in Figure 4.37. For simplicity, assume that the second operation begins after the first completes (even though they are on different processors):
P1: read 110
P15: read 110
For Implementation 1, the first read generates 80 stall cycles because the read is satisfied by P0's cache. P1 stalls for 70 cycles while it waits for the block, and P0 stalls for 10 cycles while it writes the block back to memory in response to P1's request. Thus the second read by P15 generates 100 stall cycles because its miss is satisfied by memory. Thus this sequence generates a total of 180 stall cycles.
For the following sequences of operations, how many stall cycles are generated by each implementation?
a. P0: read 120
P0: read 128
P0: read 130
b. P0: read 100
P0: write 108 P0: write 130 c. P1: read 120
P1: read 128
P1: read 130
d. P1: read 100
P1: write 108 P1: write 130

Figure 4.38 Snooping coherence latencies.

Parameter Implementation 1 100 70 15 10 Implementation 2 cache invalidate writcback 100 130 15 10

Step by Step Solution

★★★★★

3.49 Rating (175 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

a P0 read 120 Read miss satisfied by memory P0 read 128 Read mis... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Document Format (1 attachment)

903-C-S-S-A-D (3196).docx

120 KBs Word File

Students Have Also Explored These Related Systems Analysis And Design Questions!

Many snooping coherence protocols have additional states, state transitions, or bus transactions to reduce the overhead of maintaining cache coherency. In Implementation 1 of Exercise 4.2, misses are...

The switched interconnect increases the performance of a snooping cache-coherent multiprocessor by allowing multiple requests to be overlapped. Because the controllers and the networks are pipelined,...

For the multiprocessor illustrated in Figure 4.42 implementing the protocol described in Figure 4.43 and Figure 4.44, assume the following latencies: ¢ CPU read and write hits generate no stall...

[20/20/20/20] The performance of a snooping cache-coherent multiprocessor depends on many detailed implementation issues that determine how quickly a cache responds with data in an exclusive or...

5.2: The performance of a snooping cache-coherent multiprocessor depends on many detailed implementation issues that determine how quickly a cache responds with data in an exclusive or M state block....

Use the formulae bellow to find ROCE for HOME DEPOT using there 10-k. ROCE = RNOA + LEVERAGE * (RNOA - NET BORROWING COST) 10-K bellow:- Table of Contents UNITED STATES SECURITIES AND EXCHANGE...

A Tale of Two Hospitals A Tale of Two Hospitals How an Electronic Health Records (EHR) Implementation Can Be A Strategic Advantage 1 A Tale of Two Hospitals Academic Abstract The Tale of Two...

The next three question use data from the BMC 3/31/2013 10-K, which can be viewed by clicking here. You are doing a Comparable Companies analysis for BMC. After the Company issued its 10-K, the...

"The light that burns twice as bright burns half as long..." The origin of the above quote (with "flame" or "candle" sometimes substituted for "light") is unclear. It is often attributed to either...

Dani Corporation has 7 million shares.of.common stock outstanding. The current share price is $79, and the book value per share is $10. The company also has two bond issues outstanding The first bond...

What is the starting point when devoloping the Master Budget? A . Salos budget B . Production budget C . Operating expense budgot D . Financial budgets

How does the shape of the t distribution compare to a normal distribution? a. The t distribution is flatter and more spread out, especially when n is small. b. The t distribution is flatter and more...

The results of running Skippy are shown for a mock disk (Disk Alpha) in Figure 6.25. a. What is the minimal transfer time? b. What is the rotational latency? c. What is the head switch time? Figure...

Assume that reconstruction of the RAID 4 array begins at time t. a. What read and write operations are required to perform the reconstruction? b. For offline reconstruction, when will the...

In this exercise, we will investigate the mean time until data loss (MTDL). In RAID 4, data is lost only if a second disk fails before the first failed disk is repaired. a. What is the likelihood of...

show step by step please. (Security market line) James Fromholtz is considering whether to invest in a newly formed investment fund. The fund's investment objective is to acquire home mortgage...

Course Project 1 - Brainstorming They will provide the roadmap and specific examples for how to complete your work. The Course Project consists of a Needs Assessment, an Action Plan based on what you...

help ASAP please!! sustainable growth As a firm grows, it must support increases in revenue with new investments in assets. The self-supporting growth model helos a firm asseks how apidly it can...