Question: cuda program to implement convolution operation I have implemented the code / / CUDA kernel for convolution / / CUDA kernel for convolution global _

cuda program to implement convolution operation I have implemented the code // CUDA kernel for convolution
// CUDA kernel for convolution
global__ void convolutionKernel(long int* inputMatrix, long int* outputMatrix, long int* filter, int m, int n,int k){
int row = blockIdx.y*blockDim.y+ threadIdx.y;
int col = blockIdx.x*vlockDim.x+ threadIdx.x;
???shared??long int tileTILESIZE +FILTERSIZE -1][TILESIZE+FILTERSIZE -1];
int tileRow = threadIdx.y;
int tilecol = threadIdx.x;
int inputRow = row - FILTER_SIZE ?2;
int inputCol = col - FILTER_SIZE ?2;
If (inputRow 0 && inputRow <m && inputCol 0 && inputCol <n

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!