Question: in C language. Genes are substrings of DNA which code for proteins and carry the heritable information from our parents. Genes start with the sequence



in C language.
Genes are substrings of DNA which code for proteins and carry the heritable information from our parents. Genes start with the sequence of three letters ATG, called the start codon, and end with one of the three sequences TCA, TAA, or TAG, called stop codons. The stretch of sequence between the start codon and any of the stop codons is a potential gene. Each codon codes for an amino acid represented by a letter of the alphabet. There is a total of 19 amino acids. Strung together, amino acids from proteins. A substring of a DNA sequence is a translatable sequence if: it has a length that is multiple of three, it starts with a start codon and ends with a stop codon it can be translated into an amino acid sequence For example, DNA sequence AATTAAGATGCCCCTCTAAAAT contains such a translatable sequence, starting at the 8th position and of 1ength 12 (ATGGGGCTCTAA), thus consisting of 4 codons. This sequence can be translated using a codon table into the ength three amino acid sequence MGL. Note that the start codon codes for amino acid M while the stop codons don t code for any amino acids le the stop codons don' code On the other hand, DNA sequence AATCAATCTACT is not a translatable sequence. Write program dna_translate. c that takes two command line arguments: an input file name, containing DNA sequences, and an output file name, in which you will store the translated, protein sequences. For each sequence, the program should identify the longest possible translatable sub-sequence, if one exists, and translate it into a protein using a codon table given in the file codeoflife.txt. See example below
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
