Question: Use Google Colab and write a CUDA program to multiply two vectors (e.g., a, b) element wise, and store them to another vector (e.g., c).

Use Google Colab and write a CUDA program to multiply two vectors (e.g., a, b) element wise, and store them to another vector (e.g., c). Each vector should have 2000000 elements, and initialize first, and second vector values with 4, and 9 respectively. Print the last element of the multiplied matrix for verification (i.e., 36).

This is the guideline code:

#include #include #include #include #include #include

#define N 2000000 #define MAX_ERR 1e-6

__global__ void vector_add(float *out, float *a, float *b, int n) { for(int i = 0; i out[i] = a[i] + b[i]; } }

int main(){ float *a, *b, *out; float *d_a, *d_b, *d_out;

// Allocate host memory a = (float*)malloc(sizeof(float) * N); b = (float*)malloc(sizeof(float) * N); out = (float*)malloc(sizeof(float) * N);

// Initialize host arrays for(int i = 0; i a[i] = 1.0f; b[i] = 2.0f; }

// Allocate device memory cudaMalloc((void**)&d_a, sizeof(float) * N); cudaMalloc((void**)&d_b, sizeof(float) * N); cudaMalloc((void**)&d_out, sizeof(float) * N);

// Transfer data from host to device memory cudaMemcpy(d_a, a, sizeof(float) * N, cudaMemcpyHostToDevice); cudaMemcpy(d_b, b, sizeof(float) * N, cudaMemcpyHostToDevice);

// Executing kernel vector_add>>(d_out, d_a, d_b, N); // Transfer data back to host memory cudaMemcpy(out, d_out, sizeof(float) * N, cudaMemcpyDeviceToHost);1,1>

// Verification for(int i = 0; i assert(fabs(out[i] - a[i] - b[i]) } printf("out[0] = %f ", out[0]); printf("PASSED ");

// Deallocate device memory cudaFree(d_a); cudaFree(d_b); cudaFree(d_out);

// Deallocate host memory free(a); free(b); free(out); }

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

CANMNMM January of this year. (a) Each item will be held in a record. Describe all the data structures that must refer to these records to implement the required functionality. Describe all the...

QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Write a program that takes an integer command-line argument n and creates an n-by-n boolean array a[][] such that a[i][j] is true if i and j are relatively prime (have no common factors), and false...

The Accounting Standards Board (AcSB) and the International Accounting Standards Board have identified the goal of comparability of financial statements as one toward which all companies should...

The following information is based on a real company whose name has been disguised. Opus One operates in a single business segment, the retailing and servicing of home audio, car audio, and video...

1. Understand the way in which psychological approaches to offenders with Intellectual and Developmental Disability (IDD) might differ from mainstream approaches

aA scientist claims that pneumonia causes weight loss in mice. The table shows the weights (in grams) of six mice before infection and two days after infection. At = 0.01, is there enough evidence...

Why is it important that the two semaphore functions are executed atomically? Explain in detail what can go wrong in case this isn t the case.

need help with questions. kindly see if you can be of help. thank you

Given the contents of the receipt.txt file; write a series of piped commands that will read the file and output a count of the number of lines that contain a negative number. receipt.txt Burrito...

The Secretary of the Interior recently announced the creation of the "Bring Waldo Home" project. When asked if this project was justified he responded that it would create over 8,000 jobs. Explain...

A new member of an agile development team believes it is unfair to track defect counts at the team level when the continuous integration tool identifies the developer associated with each logged...

King Builders gets a CIP for a $200 million construction program involving work in 12 states. Construction is expected to last two years. Which of the following suggests that the construction program...

When controlling priorities in production scheduling, economies of scope depend on decreasing which of the following costs or times? Group of answer choices Average cost per unit made Average cost to...

The contribution margin ratio is 20% for Crowne Company and the break-even point in sales is $270,000. If Crowne Company's target operating profit is $67,000, sales would have to be: $605,000....

Which property determines whether a control is available to the user during run time? a. Available b. Enabled c. Unavailable d. Disabled

Avoid Buzzwords and Figures of Speech (LO6.1) Revise the following sentences to eliminate buzzwords and clichd figures of speech. A. The latest hot news for the industry is that Kelloggs and General...

According to the Office of Personnel Management (www.OPM.gov), in 2015 the average hourly wage rate for federal employees was slightly higher than $43. The file titled Paychecks contains a random...

Use Action Verbs When Possible (LO6.1) Improve each of the following sentences by using action verbs. A. Kellogg Company has new product developments and marketing efforts as part of its rebranding...