Question: Consider 1 - D convolution, with 7 - element input { I 0 , I 1 , I 2 , I 3 , I 4
Consider D convolution, with element input element
weight and element output The cache global
buffer can accommodate input elements, weights and output elements. Each global
buffer operates as a depth FIFO. Initial output values are all zero and need not to be read
from the DRAM and all final output results need to be written back to the DRAM. There
is a single Processing Element. Please estimate the numbers of DRAM accesses for the
following scenarios and show intermediate results.
a Weightstationary design points
b Outputstationary design points
c Weightstationary while the output is partitioned into two tiles of equal size
points
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
