Question: Grayscale Host: Allocate device memory. Copy host memory (the bitmap pixel data) to device. Create a width-by-height grid of 1-by-1 blocks Each block corresponds to

Grayscale

Host:

Allocate device memory.

Copy host memory (the bitmap pixel data) to device.

Create a width-by-height grid of 1-by-1 blocks

Each block corresponds to an individual pixel, whose coordinates are given as blockIdx.x + blockIdx.y * gridDim.x. (Remember that access to global memory is only in the form of 1-D arrays.) Invoke a CUDA kernel which you will write. Insert this kernel code prior to imgProc().

Copy results from device to host.

Deallocate device memory.

(1) How many floating operations are being performed in your color conversion kernel? EXPLAIN.

(2) How many global memory reads are being performed by your kernel? EXPLAIN.

(3) How many global memory writes are being performed by your kernel? EXPLAIN.

(4) Describe what possible optimizations can be implemented to your kernel to achieve a performance speedup.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Optimize the reduction algorithm in kernel.cu so that the max value of the data is obtained. (other related files are provided) kernel.cu #define BLOCK_SIZE 512 #define SIMPLE __global__ void...

Edit kernel.cu o complete the functionality of histogram use a block size of 512. here are three modes of operation for the application. Check main() for a description of the modes (repeated below)....

#include #include #include #include "./ppmFile.c" #define MAX_VALUE 1000 #define MAX_WIDTH 1680 #define MAX_HEIGHT 1050 struct Point{ float r,g,b; float x,y; }; __constant__ Point p[MAX_VALUE];...

Question No 1 [CLO-1, C3] [17] 11. 111 1V. V. a. Carry out the following operations to accelerate the application execution by offloading the Jacobi kernel to GPGPU device i. Declare and allocate...

Use Google Colab and write a CUDA program to multiply two vectors (e.g., a, b) element wise, and store them to another vector (e.g., c). Each vector should have 2000000 elements, and initialize...

Objective The objective of this problem is to implement a tiled image convolution using both shared and constant memory. We will have a constant 5 x 5 convolution mask, but will have arbitrarily...

I am requesting the following code in CUDA with C++. Instructions are first. Your program must be a CUDA program. When executed, it should launch a grid of 256 blocks, with 256 threads in each block....

No command argument for filter radius is necessary in this assignment, as we utilize the constant average filter of size 5 5 where the filter radius is 2 . Within " convolution.cu " , implement one...

Task Description: In this assignment, you are tasked with developing a complete CUDA C C + + program for an image blur application, also known as image smoothing that we learned in Module 3...

computer science your input image is named "santa - grayscale.jpg " , the execution command would be " / / convolution santa - grayscale.jpg " No command argument for filter radius is necessary in...

How does a government determine which governmental funds are major funds? How does a government decide which proprietary funds are major funds?

In what situations can collisions occur in all three networks? Distinguish between collisions on PHY and MAC layer. How do the three wireless networks try to solve the collisions or minimise the...

11. When income is equal to consumption, saving is . ( LO5 , 6 ) a) negative b) zero c) positive d) impossible to calculate because there is insufficient information

The following information is taken from Michelle Corporation at 31 December 2019, the end of Michelles fiscal year: Account Amount___ Sales revenue 1,500,000 Service revenue 180,000 Interest revenue...

5. Get feedback from employees on the initial design. Use focus groups, town hall meetings, suggestion systems, or email polls. Share all the details so that employees can effectively evaluate the...

4. Design the program by using a crossfunctional team that includes HR, line managers, and senior managers. This way, input from many different areas of the organizations is provided.

2. Require employees to log in. Required use of company passwords and sign-on credentials limits access to this internal document to those who have a legitimate right to read it. The handbook is not...