Question: Preferably in Java In this assignment, you are going the implement the dynamic programming algorithm to compare two strings. In particular, you need to implement

Preferably in Java  Preferably in Java In this assignment, you are going the implement

In this assignment, you are going the implement the dynamic programming algorithm to compare two strings. In particular, you need to implement the follow two algorithms as taught in class: Longest common subsequence (LCS) Edit distance. Recall here you are given two strings A and B. You want to transform A to B using substitutions, insertions and deletions such that the number of such operations is minimized. Here, these edits operate on single positions in the strings. You are allowed to use any programming language you want to use. Here are more detailed information on the provided data. After de-compression, you should see these types of files. Description of data. There arc files with names like listSeqs-crrorxxx-lxxx.txt, which are generated with different error rates. Each file contains 10 sequences (each line for a sequence). These ten sequences are generated from a single consensus sequence (its length is specified in the filename) by adding some modifications (errors including changing of the characters, insertion of a new character or deleting a character in the consensus). I generated two types of sequences: those with low errors (i.e. these sequences match more the consensus sequences and should have smaller edit distance) and those with high errors (those sequences tend to look more different and thus should haw larger edit distance). Implementation Your program needs to both find the length of LCS and edit distance and also the LCS itself and the optimal edit operations (i.e inserting proper spaces to make the two strings match in as many places as you can). Recall in the class I described how to use trace back for this kind of tasks from the dynamic programming table. Analysis Run your program to find the edit distance of each pair of the 10 sequences stored in a single file (i.e. you only need to compare sequences that are contained in the same file). What to submit? Submit a short report containing the following. Settings Write down the language you use, the machine (its CPU frequency and memory size) you use for testing your program. Results Present the running time analysis on various sizes and types of sequences. How well does your program scale with the input size? Show the found the LCS and the optimal edit operations at least three pairs of sequences (which you consider to be representative). Also write down any observations you get by running your code on the test data. Conclusion Tell me what you learned by doing this assignment. Code Attach the source code of your implementation. If it, is short enough, you may simply include your code as part of your report

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!