Question: Convert your code from Exercise 4.6 into PTX code. How many instructions are needed for the kernel? Exercise 4.6 With CUDA we can use coarse-grain

Convert your code from Exercise 4.6 into PTX code. How many instructions are needed for the kernel?

Exercise 4.6

With CUDA we can use coarse-grain parallelism at the block level to compute the conditional likelihood of multiple nodes in parallel. Assume that we want to compute the conditional likelihood from the bottom of the tree up. Assume seq_length = = 500 for all notes and that the group of tables for each of the 12 leaf nodes is stored in consecutive memory locations in the order of node number (e.g., the mth element of clP on node n is at clP [n*4*seq_length+m*4]). Assume that we want to compute the conditional likelihood for nodes 12–17, as shown in Figure 4.35. Change the method by which you compute the array indices in your answer from Exercise 4.5 to include the block number.

12 18 13 21 2 3 Figure 4.35 Sample tree. 14 5

Exercise 4.5

Now assume we want to implement the MrBayes kernel on a GPU using a single thread block. Rewrite the C code of the kernel using CUDA.

Assume that pointers to the conditional likelihood and transition probability tables are specified as parameters to the kernel. Invoke one thread for each iteration of the loop. Load any reused values into shared memory before performing operations on it.

12 18 13 21 2 3 Figure 4.35 Sample tree. 14 5 19 6 15 22 16 20 17 10 11

Step by Step Solution

★★★★★

3.37 Rating (147 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

The question provided appears to be part of a larger context where specific CUDA code was developed probably in a book or courses exercises As you hav... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Computer Architecture Questions!

With CUDA we can use coarse-grain parallelism at the block level to compute the conditional likelihood of multiple nodes in parallel. Assume that we want to compute the conditional likelihood from...

The Company XYZ has 1173 blocks of building for its business operation, where each block has 7 floors. The distance between each floor is 7 meters. ] (ii) Give a function run2diff which can be...

(a) Use the following text to derive distributions for rat and chased. Use a five-word window, including open- and closed- class words, ignore case, punctuation and sentence boundaries and weight...

Case study: Remedy Physiotherapy. I need help in drafting a marketing plan Names of people and businesses are disguised. Some aspects of the local area and the physiotherapy industry are simplified...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

CIS 548 Enterprise Resource Planning. SAP Exercise Chapter 8 08-2 Materials Planning: Materials Planning Process Save Chapter 04: Procurement Process Exercise 04-02: Basic Procurement Process Single...

Please answer me page 51 to page 56 on the attachment. is a multiple choice questions. Thank you FAC1502/101/3/2016 Tutorial letter 101/3/2016 Financial accounting concepts, principles and procedures...

(1) Write an equation (using the actual numbers on the income statement) that shows how operating income is computed by the Coca Cola Co. (2) What item(s) are included as part of other...

In an exciting scene1 from the 1980 space-opera film Flash Gordon, two protagonists take turns in putting their arms into different holes of a large tree stump, where the wood beast lives, in order...

There are 8 employees on The Game Shop's sales team. Last month, they sold a total of g games. One of the sales team members. Chris, sold 17 fewer games than what the team averaged per employee How...

Why do you think that Roberto Goizueta switched from a strategy that emphasized localization towards one that emphasized global standardization? What were the benefits of such a strategy?

Repeat Activity 34 for utilitarian and valueexpressive appeals.

Write a Little Man program that prints out the sums of the odd values from 1 to 39. The output will consist of 1, 1 + 3, 1 + 3 + 5, 1 + 3 + 5 + 7 . . . . No input is required. As an aside, do you...

Write a Little Man program that prints out the odd numbers from 1 to 99. No input is required.

Write a Little Man program that adds a column of input values and produces the sum as output. The first input value will contain the number of values that follow as input to be added.

Keesha Compary borrows $200,000 cash on November 1 of the current year by signing a 90 -day, 9%,$200,000 note. (Click on the Chart of Accounts Tab below.)

JAVA CODE: Need code for the execute method: Variables for the bank model: EList = OList of Event objects (time, type) (ordered time (primary) and type (secondary)) Qcust = Queue2 of Customer objects...

What is the energy loss over a gradual contraction from a DN100 Schedule 40 steel pipe to a DN50 Schedule 40 steel pipe with a cone angle of 50, if the flow rate was 182.112L/min