Question: Consider the problem of multiplying a dense n x n matrix A with an n x 1 vector B to generate an n x 1

Consider the problem of multiplying a dense n x n matrix A with an n x

1

vector B to generate an n x

1

vector C

.

The ith element, C

[

],

corresponds to the dot

-

product of the ith row of A and the input vector B

,

as illustrated in the following Figure

1 .

Part

1

: Describe how you partition the computation tasks, organize threads, and map threads to the tasks.

Part

2

: Write a matrix

-

vector multiplication CUDA kernel matrixVectorMulKernel by completing the following code:

_

global

_

void matrixVectorMulKernel

(

float

*

,

float

*

,

float

*

,

int vectorLen

) {

Part

3

: Write a host function matrix VectorMul that can be called in the main function with four parameters: pointer to the input matrix, pointer to the input vector, pointer to the output vector, and the number of elements in each dimension. This function should include statements for memory allocation, data transfer, thread organization, kernel function call and free memory. Complete the following code:

void matrixVectorMul

(

float

*

_

,

float

*

_

,

float

*

_

,

int vectorLen

) {

Part

4

: If matrix

-

vector multiplication is implemented on a distributed memory system using multiple CPUs instead of GPUs and CUDA, which collective communication operations

(

.

.,

one

-

-

all broadcast, all

-

-

all broadcast, all

-

-

one reduction, all

-

-

all reduction, scatter, gather

)

can be utilized to enhance performance? Describe how these operations can be applied effectively

Consider the problem of multiplying a dense n x n

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Enlightened Eats in Anchorage, Alaska, has six employees who are paid semimonthly. Calculate the net pay from the information provided below for the November 15 pay date. Assume that all wages are...

Dense matri x -vector multiplication . Consider th e multiplication of a dense n x n matrix A with a vector b to yield anther vector y . The ith element y [l] of the product vector is the dot-product...

2. (40 points) (Assume that an integer is a word, and it takes 4 bytes) Consider a memory system with a single cycle cache and 100 cycle latency DRAM with the processor oper- ating at 1 GHz. The...

Please Code in Matlab Let A be an n x n matrix and b a column vector with n entries. There are a number of ways to solve the linear system Ac = b. Here are some: I = A-1 (if A is invertible). The...

Please Code in Matlab Let A be an n x n matrix and b a column vector with n entries. There are a number of ways to solve the linear system Arc = b. Here are some: r = A-1 (if A is invertible). The...

K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of the process made it popular to data analysts. The task is to form clusters of similar...

Multiple linear regression solves the minimisation problem min (y-XB)(y = X) = - ,...., where the vectors = (o ... Bp)", n min (Bo Xij;), Bo.B1Bp i=1 - T j=1 , y = (y ... Yn)" and X is an n (p+1)...

Please solve using Matlab Problem 1: Poisson's Equation Consider the linear system Anp-p, where An is an n x n matrix with 2's on the main diagonal, -1's directly above and below the main diagonal...

T 8. Suppose A(t) is an n x n matrix, and consider the first-order linear vector differential equation on (to, t1] dx = A(t)x + f(t) dt where x(t), f(t) ER". Show that any scalar nth order linear...

Can anybody help me to solve these three questions (Exercises1-3)in MATLAB device, I also posted some basic MATLAB information in last two photos! Thank you very much! EXERCISES Instructions: For the...

Please complete c and d. Thanks The banded, upper triangular matrix A ERnXxn can be written as a2 b2 c2 3 an-2 n-2 Cn-2 where (ai ,a2, . . . , an-2, an-1, an) e Rn b- (bi, b2,..., bn-2, bn-1) ER" c=...

Repeat Example 8.7 using the implicit method of Eq. (8.118). Take t = 0.2 s and y = 0.01 m, which ensures that an explicit model would diverge. Compare your accuracy with Example 8.7.

Question 4: [5 Marks] Construct the following example and comment why such examples exists also put all details of the example: a. A metric that cannot be derived from norm. b. A norm which cannot be...

SRGAP 2 duplication in humans is believed to contribute primarily to: None of these Auditiviprocessing Synaptic plasticity and neocortex expansion Hownone repulation Fine motor skills

TRANSACTION PROBLEM #2 Transaction Analysis, Trial Balance, and Financial Statements On December 1, a group of in- dividuals formed a corporation to establish the Beeper, a neighborhood weekly...