Question: Problem 3 : You experiment with a GPU to compute the following multiplication C = A x B , where m = n = l

Problem

3

: You experiment with a GPU to compute the following multiplication C

=

A x B

,

where

m = n = l = 102

the elements of

A, B,

and

C

are all with the same bit

-

width of

4

Bytes. When implementing on GPU, each thread is in charge of computing for one element in C

.

Please answer the following questions

(

please show detailed steps.

) (25

pts

) .

A = [\begin{matrix} x_{1, 1} & c d o t s & x_{1, l} \\ v d o t s & d d o t s & v d o t s \\ x_{m, 1} & c d o t s & x_{m, l} \end{matrix}]

and

B = [\begin{matrix} y_{1, 1} & c d o t s & y_{1, n} \\ v d o t s & d d o t s & v d o t s \\ y_{l, 1} & c d o t s & y_{l, n} \end{matrix}]

)

From the hardware standpoint, the GPU has

4

Streaming Multiprocessors

(

SMs

),

each with exclusive L

1

and L

2

caches, and

8

warp schedulers managing

4

warps per scheduler. Each warp consists of

32

cores. From the software perspective, threads within the same block can share data directly, while threads across different blocks cannot. What is the maximum block size that matches the GPU hardware architecture?

(5

pts

) 8 4 32 = 1024

)

Following the result from a

),

how many blocks should we have for computing the entire matrix C

?

Please use dim

3

we have learned to initialize the threading.

(5

pts

)

dim

3

block

(32, 32)

dim

3

thread

(32, 32)

)

After applying tiling, with the tile size of

32 32,

houmany tiles can cover the entire matrix? Given the block size calculated in a

),

how many titles can cover thejentire block?

(p t s 1024 \frac{1024}{256} = 4096, \frac{1024}{256} = 4

)

Following the block and tile sizes in c

),

what is the proper size of shared memory that we need to request

(

Hint: threads within the same block can share data directly, while threads across different blocks cannot

) ?

Please explain your answer.

(10

pts

)

Just c & d

Problem 3 : You experiment with a GPU to compute

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!

Problem 4 : You experiment with a GPU to compute the following multiplication C = A B , the elements of A , B , and C are all with the same bit - width. Please answer the following questions ( please...

chapter 5 INTRODUCTION TO MATRIX ALGEBRA GOALS The purpose of this chapter is to introduce you to matrix algebra, which has many applications. You are already familiar with several algebras:...

Problem 3 The performance of a GPU is often measured in the number of polygons per second being rendered. Consider a GPU with 2 processing cores, each capable of rendering a polygon in parallel. A...

2. Design a between subject experiment to test whether bees can distinguish red from green. 3. Design a between subject experiment to test whether human newborns can tell the difference between male...

Problem 1 (10 marks) Three years ago, you purchased a bond for $974.69. The bond had three years to maturity, a coupon rate of 8% paid annually, and a face value of $1,000. Each year you reinvested...

Problem 1 Order the following list of functions by the big-Oh notation. Group together (for example, by underlining) those functions that are big-Theta of one another. 4\" n3 n2logn 4109" V logn 22\"...

Homework help C .2 Ratio Analysis D E G Points: 3 items @ 4 points = 12 points Directions: Compute the following liquidity ratios based on the information provided below. Inventory Turnover Sales...

Problem 14-22A (Algo) Ratio analysis LO 14-3, 14-4, 14-5 Perez Companys income statement information follows: Year 3 Year 2 Net sales $ 411,000 $ 269,000 Income before interest and taxes 118,000...

SPRING 2020 MATH 131 : CALCULUS I Lab 2: Limits and Continuity Instructions: Work together with your group to understand the following ideas and solve the following problems. If you do not finish the...

Alsup Consulting sometimes performs services for which it receives payment at the conclusion of the engagement, up to six months after services commence. Alsup recognizes service revenue for...

Some experts believe that 20% of all freshwater fish in the United States have such high levels of mercury that they are dangerous to eat. Suppose a fish market has 250 fish tested, and 60 of them...

Write the inequality shown by this number line. Use the letter x. 05 5 10 15 20 25

Pharoah Company reported the following amounts for 2022: Raw materials purchased $95,200 Beginning raw materials inventory 5,824 Ending raw materials inventory 5,040 Beginning finished goods...