Question: Assume both benchmarks have a base CPI of 1 (ideal L2 cache). If having non-blocking cache improves the average number of concurrent L2 misses from

Assume both benchmarks have a base CPI of 1 (ideal L2 cache). If having non-blocking cache improves the average number of concurrent L2 misses from 1 to 2, how much performance improvement does this provide over a shared L2 cache? How much improvement can be achieved over private L2?

Both Barcelona and Nehalem are chip multiprocessors (CMPs), having multiple cores and their caches on a single chip. CMP on-chip L2 cache design has interesting trade-offs. The following table shows the miss rates and hit latencies for two benchmarks with private vs. shared L2 cache designs. Assume L1 cache misses once every 32 instructions. Benchmark A misses-per-instruction Benchmark B misses-per-instruction Private 0.30% 0.06% Shared 0.12% 0.03% The next table shows hit latencies. a. b. Private Cache 5 10 Shared Cache 20 50 Memory 180 120

Benchmark A misses-per-instruction Benchmark B misses-per-instruction Private 0.30% 0.06% Shared 0.12% 0.03%

Step by Step Solution

★★★★★

3.37 Rating (150 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

Benchmark A CPI with Shared L2 1 0384 20 868 CPI with NonBlocking L2 1 0384 2 20 1536 Performance Im... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Computer Organization Design Questions!

Chip multiprocessors (CMPs) have multiple cores and their caches on a single chip. CMP on-chip L2 cache design has interesting trade-off s. Th e following table shows the miss rates and hit latencies...

Shared cache latency increases with the CMP size. Choose the best design if the shared cache latency doubles. Off-chip bandwidth becomes the bottleneck as the number of CMP cores increases. Choose...

Consider the entire memory hierarchy. What kinds of optimizations can improve the number of concurrent misses? Both Barcelona and Nehalem are chip multiprocessors (CMPs), having multiple cores and...

Exercise 5.17 Both Barcelona and Nehalem are chip multiprocessors (CMPs), having multiple cores and their caches on a single chip. CMP on-chip L2 cache design has interesting tradeoffs. The following...

5 . 2 6 Chip multiprocessors ( CMPs ) have multiple cores and their caches on a single chip. CMP on - chip L 2 cache design has interesting trade - offs. The following table shows the miss rates and...

Chip multiprocessors (CMPs) have multiple cores and their caches on a single chip. CMP on-chip L2 cache design has interesting trade-offs. The following table shows the miss rates and hit latencies...

(30pts.) A 2 GHz pipelined processor implemented in 22nm process technology has an ideal CPU CPI of 1 when there are no misses at the first level of the memory hierarchy. On-die first level (L1)...

3. (30pts.) A 2 GHz pipelined processor implemented in 22nm process technology has an ideal CPU CPI of 1 when there are no misses at the first level of the memory hierarchy. On-die first level (L1)...

1 a. Parallelism at multiple levels is now the driving force of computer design across all four classes of computers, with energy and cost being the primary constraints. There are basically two kinds...

5 . 2 6 . 1 [ 1 5 ] $ 5 . 1 3 > Which cache design is better for each of these benchmarks? Use data to support your conclusion. 5 . 2 6 . 2 [ 1 5 ] 5 . 1 3 > Off - chip bandwidth becomes the...

Airwave Corporation, an IFRS reporter, was required to write down one of its plant assets by $ 1,650,000 on December 31 of the current year because the assets technology was rendered obsolete. It...

The feed to an ammonia synthesis reactor contains 25 mole % nitrogen and the balance hydrogen. The flow rate of the stream is 3000 kg/h. Calculate the rate of flow of nitrogen into the reactor in...

Some risk - avoidance hedge funds follow a Blank _ _ _ _ _ _ strategy, investing in securities perceived to be selling at deep discounts relative to their intrinsic values.

Insomnia Treatment A clinical trial was conducted to test the effectiveness of the drug zopiclone for treating insomnia in older subjects. After treatment with zopiclone, 16 subjects had a mean wake...

IMAP allows users to fetch and download email from a remote mailbox. Does this mean that the internal format of mailboxes has to be standardized so any IMAP program on the client side can read the...

Is the vacation agent part of the user agent or the message transfer agent? Of course, it is set up using the user agent, but does the user agent actually send the replies? Explain your answer.

In any standard, such as RFC 5322, a precise grammar of what is allowed is needed so that different implementations can inter work. Even simple items have to be defined carefully. The SMTP headers...

MAC Corp.'s sales last year were $435,000, its operating costs were $364,500, its operating income (EBIT) was $70,500 and its interest charges were $10,500. What was the firm's times interest earned...

Is it possible for an organisation to provide excellent customer service without every employee having a job they feel makes a real contribution? Explain your answer. (50100 words)

El 1 de abril de 2016, Cyclone's Backhoe Co. compra una zanjadora por $280 000. Se espera que la mquina dure cinco aos y tiene un valor de rescate de $40,000. Calcule el gasto de depreciacin para los...