Question: PLS help me java programming language The similarity between DNA sequences is important for biological research. In this case, you are asked to find similarities

PLS help me java programming language

The similarity between DNA sequences is important for biological research. In this case, you are asked to find similarities between the DNA sequences given as inputs and to keep them in a file.

As you know, the DNA sequences are composed of 4 nucleic acids. These are Adenine, C (Citizin), G (Guanine) and T (Timin). They form the codes of protein synthesis, one of the most important processes in living things.
You can read the DNA sequences of various creatures from a file (DNA.txt) in the following format, we write a class in these living things that can write to 10 other living files (similarity.txt) whose DNA sequence is most similar to each other.

On each line of the DNA.txt file;

The similarities between DNA sequences after reading them Hamming distance in the form of simple calculation is required. (DNA sequences are different in nature, but they are different lengths. For the question, let's say they are all the same length.)

D 1 = AT GAGCT ACG D 2 = AT CGCCT ACA

Hamming distance between sequences (the same nucleic acid in the same index is underlined)

H(D1,D2)=NE/L=6/10=0,6

Where N is the number of matching nucleic acids, and L is the length of the sequence.

You have to resemble this (S (D 1, D 2)) as follows.

he format of each line of the similarity.txt file should be as follows: S (D 1, D 2)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!