Question: ANSWER WITH MATLAB CODE Write a function getcodonusage(s) that takes a DNA strand s as input, calls the findorf(s) function to find the ORF, and

ANSWER WITH MATLAB CODE

Write a function getcodonusage(s) that takes a DNA strand s as input, calls the findorf(s) function to find the ORF, and returns the codon usage of the sequence as both a vector (each element corresponding to the count of a codon, where codons are in alphabetical order) and as a struct with each codon as a field, and its count as the number of occurence of that codon. The returned structure should contain all possible codons, with codons not occurring in the sequence having a count of 0. Do not use functions available in the Bioinformatics toolbox. Hint: You may find getallcodons.m function useful. If your solution calls other files (such as findorf and getallcodons), you can add these functions to the end of your getcodonusage.m file, so your getcodonusage file is self-contained.

Here is an outline (pseudocode) for this function that you can follow. This is of course, not exactly the Matlab code.

Let v be the codon count vector. Let r be the codon count struct. allcodons=getallcodons(); Initialize v to be a vector of zeros (you will have as many elements as allcodons). Foreach allcodons as cod: Initialize r.(cod) to zero. orf=findorf(s); Foreach orf as cod: Find the index of cod in allcodons using strcmp(). Increment the corresponding entry in v by one. Increment r.(cod) by one.

The following are codes for findorf.m and getallcodons.m, please use them in your answer!:

function cods = getallcodons

for i='ACGT'

for j='ACGT'

for k = 'ACGT'

cods(end+1) = [i j k];

end

function o = findorf(s)

s='ATTAATGCATTTTTAGGAATA';

starts = strfind(s,'ATG');

stops = [strfind(s,'TAA') strfind(s,'TAG') strfind(s,'TGA')];

if isempty(starts); starts=1;

orfs = zeros(0,2);

for mystart=starts

I = (stops > mystart) & (mod(stops - mystart, 3)==0);

mystops = stops(I);

if isempty(mystops); mystops= numel(s); end

mystop = mystops(1);

orfs(end+1,:) = [mystart mystop];

end

lengths = orfs(:,2) - orfs(:,1);

[maxlen, I] = max(lengths);

startstop = orfs(I,:);

o = ();

for i =startstop(1) : 3 : startstop(2)

o[end+1] = s(i:i+2);

end

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

ANSWER WITH MATLAB CODE Write a function getcodonusage(s) that takes a DNA strand s as input, calls the findorf(s) function to find the ORF, and returns the codon usage of the sequence as both a...

Answer in python 2.7 please . I am really struggling with this and i really would love your help. Its due today and i am especially stuck on question 3 which builds on from the other codes like...

please try to answer parts 3,7&8 especially using python 2.7. my code is not working and this is due today. verify using working test codes. Basically I am stuck with the longestORF non reading (part...

please help with this code, I am confused. thank you! UnknownSeqs Layout References Mailings Review View Help Search A A Aa P A.D.A. S S 21 st. - - AaBbCcDc AaBbCcDc AaBb C AaBbcc 1 Normal No Spac......

BME 201 HOMEWORK 5 - How to fix the code for DNA Replication, Transcription, and translation. InsulinDNAseq is included. Please help! 0 10f7 BME 201 Homework 5 - Molecular Biology: Transcription and...

Finding genes in DNA is a fundamental problem in biology. After all, only a few percent of human DNA actually contains protein-coding genes. Sifting through the more than 3 billion base pairs to find...

C++ Your assignment is to build an application that can read a file of nucleotides into a linked list. 1. The file contains a sequence of nucleotides (G, C, A, T). 2. The file will always contain a...

HASKEL ASSIGNMENT -- * Each part has a main function for testing that part. -- * Make additional helper functions wherever you need them. -- * Replace 'undefined' with your own code when you are...

Problem 2 (25 points): Open reading frame finder. An Opending Reading Frame (ORF) is a continuous stretch of codons (nu- cleotide triplets) that contain a start codon (i.e., ATG) at the beginning and...

Computer Organization and Networks Practicals 2021/22 October 9, 2021 Computer Organization and Networks Practicals 2021/22 b68495714b Contents Contents 0 Introduction 3 0.1 Registration . . . . . ....

Consider the function f(x) = 12x5 + 75x - 120x + 4. f(x) has inflection points at (reading from left to right) x = D, E, and F where D is and E is and F is

From the given data, compute the following: Record the amounts withheld for group and health insurance and calculate the net pay for each employee. Record the amount to be withheld for health...

Quertion 2 ofs Excluding stocks traded in the United States, a stock that is traded in a country other than the inwing coepary's boes country is callod a q , a . Preferred stock b . Global classified...

Wolanin Pharmacy, part of a large chain of pharmacies, fills a variety of prescriptions for customers. The complexity of prescriptions filled by Wolanin varies widely; pharmacists can spend between...

3. To identify which trainees benefit most or least from the program.

1. To identify the programs strengths and weaknesses. This includes determining if the program is meeting the learning objectives, if the quality of the learning environment is satisfactory, and if...

4. Cost justification for training is based on numerical indicators. (Here the company has a strong orientation toward evaluation.)