Question: I need help making the java code for this lab. Pasted below is the readme/directions for the lab. The project name of this lab is

I need help making the java code for this lab. Pasted below is the readme/directions for the lab.

The project name of this lab is Codon.

Purpose of this lab

This lab is to introduce you to one of the components of genetic testing. In our cells we contain both DNA and RNA which are used to create (or encode) proteins which do a lot of the work in our bodies.

The information contained within RNA is made up of a series of chemicals called nucleotides and those chemicals are abbreviated to four letters A (adenine, C (cytosine), U(uracil), and G (guanine). When someone has their RNA sequenced then you get a sequence of these chemicals and it looks like this:

GUGUCGCGUGGGCAUUCGUAA

However, the nucleotides bunch up into groups of three called codons (Links to an external site.) and it is the codon analysis that gives us insight into the genetic structure. Since they are lumped into groups of three you can actually have three possible sequences (called frames) from a given stream of nucleotides.

If you start with the first nucleotide and group by threes you get

GUG UCG CGU GGG CAU UCG UAA

If you start with the second one (by skipping the first one) you get

[skip G] UGU CGC GUG GGC AUU CGU [skip AA]

and if you start with the third one (by skipping the first two) you get

[skip GU] GUC GCG UGG GCA UUC GUA [skip A]

Each one of these constitutes a sequence that must be examined to find a sequence that creates a protein (called a protein sequence). When looking at the codons there are special codons which indicate the start of a protein sequence and those codons, called start codons, are "AUG", "GUG", and "UUG". If we look at our sequences above then only the first group of codons actually contains a valid start codon; the other two do not. A sequence is terminated by one of the stop codons which are "UAA", "UAG", and "UGA". Note that there is only one start codon and one stop codon for a protein sequence. All of the codons in between make up the protein that will actually be created by the protein sequence.

Design of the Software

The UML diagram below features the objects that you will create.

I need help making the java code for this lab. Pasted below

The Codon sequence is simply the nucletides grouped by threes and kept in the list called proteinSequences. This is created by adding one nucleotide at a time to our codon sequence using the method addNucleotide. You will have to take care of the grouping by three in your code. At any point you can figure out all of the protein sequences present by calling the findAllProteinSequences method which returns the number of protein sequences found. You can then extract any one of the ProteinSequences by giving an index. If the index is within the number of protein sequences found then the ProteinSequence object is returned. If the index is outside of the number of protein sequences found then a null is returned. The ProteinSequence object can be queried to find the starting codon and ending codon.

Note: Only your Program class should deal with reading the text file. It will at least do the printing of the number of sequences and the start and stop, but will delegate printing the list of sequences to the print method of the Codon class.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!