Question: (5) [22 pts] Consider two processors with different instruction set architecture, P1 and P2. Processor P1 runs on a clock rate of 2.5 GHz and
(5) [22 pts] Consider two processors with different instruction set architecture, P1 and P2. Processor P1 runs on a clock rate of 2.5 GHz and P2 runs on 3.0 GHz. There are four classes of instructions: arith logic, load, store, and control, whose CPIs and frequencies of each processor are given in the following table Instruction class Execution Frequency Execution Frequency CPIs of P1 CPIs of P2 (base CPI with (base CPI with ideal memory) ideal memory) arith/logic 0.9 40% 1.0 50% load 1.2 25% 1.4 20% store 1.5 15% 1.8 10% control 1.1 20% 1.3 20% Suppose these two CPUs use separate level one (LI) caches for instructions and data (Harvard memory architecture) with different miss rates for instruction and data access: - A cache hit incurs no stall cycles. - Assume a cache miss rate of 0.6% for instruction fetch and a cache data miss rate of 5% in Li cache. - The second level of cache could be accessed in 6 clock cycles. - Assume a cache miss rate of 4% for instruction fetch and data access in L2 cache. - While a L2 cache miss incurs 200 stall cycles for memory reads and 300 stall cycles for memory writes respectively. (a) [16 pts) For these two processors, find the AMAT and CPI using this cache. Supposed the same number of instructions is executed by these two processors when running a benchmark, which processor is faster? () [6 pts] It is about Amdahl's Law. Suppose that we can improve the Arith/Logic instructions performance of machine by a factor of 20 (the same Arith/Logic instructions run 20 times faster on this new machine) because we use a new implementation of ALU. Calculate the speedup for these two processors respectively. (5) [22 pts] Consider two processors with different instruction set architecture, P1 and P2. Processor P1 runs on a clock rate of 2.5 GHz and P2 runs on 3.0 GHz. There are four classes of instructions: arith logic, load, store, and control, whose CPIs and frequencies of each processor are given in the following table Instruction class Execution Frequency Execution Frequency CPIs of P1 CPIs of P2 (base CPI with (base CPI with ideal memory) ideal memory) arith/logic 0.9 40% 1.0 50% load 1.2 25% 1.4 20% store 1.5 15% 1.8 10% control 1.1 20% 1.3 20% Suppose these two CPUs use separate level one (LI) caches for instructions and data (Harvard memory architecture) with different miss rates for instruction and data access: - A cache hit incurs no stall cycles. - Assume a cache miss rate of 0.6% for instruction fetch and a cache data miss rate of 5% in Li cache. - The second level of cache could be accessed in 6 clock cycles. - Assume a cache miss rate of 4% for instruction fetch and data access in L2 cache. - While a L2 cache miss incurs 200 stall cycles for memory reads and 300 stall cycles for memory writes respectively. (a) [16 pts) For these two processors, find the AMAT and CPI using this cache. Supposed the same number of instructions is executed by these two processors when running a benchmark, which processor is faster? () [6 pts] It is about Amdahl's Law. Suppose that we can improve the Arith/Logic instructions performance of machine by a factor of 20 (the same Arith/Logic instructions run 20 times faster on this new machine) because we use a new implementation of ALU. Calculate the speedup for these two processors respectively
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
