Question: Instructions In this task you will implement a single-source file program called DNA Analyser, which will perform text operations on a String representing a

DNA Analyser, which will perform text operations on a String representing a

DNA sequence (details are given later). You do not need to remember

Instructions In this task you will implement a single-source file program called DNA Analyser, which will perform text operations on a String representing a DNA sequence (details are given later). You do not need to remember high school biology for this task. All relevant information is included here. In this task it is left to you to decide how the program should be broken down into methods (as a rough guide, a Completed solution will have between 5 and 7 relatively small methods, including main). We will accept a broad range of solutions, not merely the sample solution we created when designing the task. A description of the program and its functionality is given next, followed by a number of suggestions for how to implement some of the trickier functionality. Background information about DNA A DNA sequence consists of a restricted alphabet of 'A', 'T', 'G' and 'C', which are known as bases. Your program will read a single piece of text representing such a sequence Because DNA sequencing technology is imperfect the sequence given to your program may include invalid characters (which your program will replace with a special character meaning 'unknown') DNA strands are actually made up of two complementary sequences, with strict rules about which bases appear with which other bases (A only ever pairs with T, G only ever pairs with C). Your program will be able to generate the complementary sequence to the one it is working with (more details are given later) A common operation on a DNA sequence is transcription, where a part of the sequence is converted into messenger RNA (mRNA), which is an intermediate step to translating the code into proteins (this second step is beyond your program). RNA, which is an older form of genetic material than DNA, uses the same alphabet as DNA except that T is replaced by U DNA Analyser The program should implement the following high-level algorithm: Program: DNA Analyser Steps: 1 Display the program's name 2 Prompt the user to enter a DNA sequence (any non-whitespace characters are allowed) 3 Convert that String to upper case and replace all non-DNA characters with ? 4 Do 4-1 | 4-2 | Display the sequence with a suitable prefix (like 'Sequence: ') Display a menu of operations for working with that DNA sequence 5 While the use has not selected quit After step 3, only the 'cleaned' sequence of characters will be used. For example, if the user typed 'GaxtT/ACa' then it will be displayed later as After step 3, only the 'cleaned' sequence of characters will be used. For example, if the user typed 'GaxtT/ACa' then it will be displayed later as 'GA?TT?ACA'. The available operations are: Display the sequence with its complement, which should output three lines: o the sequence; o a row of | (pipe character) the same length as the sequence (this is really just decorative/cute); o the sequence's complement, in which each A becomes T, T becomes A, G becomes C, C becomes G, and ? remains unchanged For example, the output from this option for the above input would be: GA?TT?ACA ||||||||| CT?AA?TGT Transcribe the entire sequence, which should display the mRNA equivalent of the entire sequence (i.e., the sequence with all Ts converted to Us). Given the example above, the program would display 'GA?UU?ACA: Transcribe a section of the sequence. The user should be asked to give the start position (0-based) and length of the section to transcribe. Given the example above, if the user entered the start position 2 and length 5 the program would output '?UU?A: Show possible repair, which will display a version of the sequence in which all ?s have been replaced by a valid, lower case DNA character (the lower case indicates uncertainty over its value). It should apply the following rules (which are certainly not the way a recorded sequence would be repaired in real science): find the first valid DNA character in the sequence, convert that to lower case and then replace all? with that lower case character. There is advice below on how to achieve this using library methods and you can assume that there will be at least one valid DNA character in the user's input. Implementation advice Cleaning the input Strings have a replaceAll(String_regex, String_replacement) method that will replace all substrings that match the given regular expression (a pattern that can match a variety of String values) with the given replacement. As we don't cover regular expressions in the unit, here are some patterns that you will find useful: "[XYZ]" matches any single character from the set 'X', 'Y' or 'Z' "[^XYZ]" matches any single character that is not in the the set 'X', 'Y' or 'Z' You should not write a loop to clean the input text. Displaying the row of Is Remember your WordList program? Take a look at how you outputted an underline of characters and adapt for this program. Constructing the sequence's complement for display This is possible with the use of several String methods, but is actually easier if you write a loop to either construct a new String from the current expression (a pattern that can match a variety of String values) with the given replacement. As we don't cover regular expressions in the unit, here are some patterns that you will find useful: "[XYZ]" matches any single character from the set 'X', 'Y' or 'Z' "[^XYZ]" matches any single character that is not in the the set 'X', 'Y' or 'Z' You should not write a loop to clean the input text. Displaying the row of Is Remember your WordList program? Take a look at how you outputted an underline of characters and adapt for this program. Constructing the sequence's complement for display This is possible with the use of several String methods, but is actually easier if you write a loop to either construct a new String from the current value or to print, one at a time, each complementary character. If taking the first approach remember that, given a String variable s, initialised as "", s += "some text"; will append some text to the end of s. Transcribing the sequence This is most easily done using a String method (mentioned above) instead of a loop. Remember you only need to change one of the characters. There's another String method, discussed in the notes, that will help with transcribing a section. You will need to do a little bit of arithmetic with the start and length given by the user to determine the end point to give to that String method. Implementing the 'repair' algorithm There are two other String methods that can be useful in implementing this behaviour: one replaces Strings (not regular expressions) and so could be used to create a copy of the sequence with all ? removed; the first character of that copy will be the first valid DNA character in the original sequence; and another replaces individual characters. Remember to convert the valid DNA character you extract to lower case.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

One-Way Between Subjects ANOVA High school juniors planning to attend college viewed one of three videos about a particular college, each differing according to what aspect of college life was...

If the expected dividend in 1 year is $2.58 per share, g (which is constant) = 4.0%, and the current stock price is $24.00 per share, what is the stock's expected dividend yield for the coming year?

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

For brands is it more important to create big idea that is required to drive a customer connection or can it be stifled by corporate requirements to measure performance with automation and generate...

An AWG 24 copper wire with a diameter of 0.511 mm is tightly wrapped (one turn per wire diameter) around a2.54-cm-diameter tube. As the number of turns increases, the inductance L of the resulting...

Pretend that you have been charged with the task of redesigning the interface for the ATM at your local bank. Develop an interface standard that includes metaphors, objects, actions, icons, and a...

Goal Survey Interview five friends or family members about their goals. Ask them to name their long-term goals and their related short-term goals. Why did they pick these goals? What deadline do they...

Joseph and Erica, husband and wife, jointly own all of the stock in Velvet Corporation. The two are currently involved in divorce proceedings, and pursuant to those negotiations, they have agreed...

[The following information applies to the questions displayed below. Consider the following narrative describing the process of registering a car with the DMV: Heide lives in California and it is...

Westley Fong, manager of The Lucky 88 Motel, has a contract with Appraisers Associates to appraise his 150-room motel, which is located in beautiful downtown Wahiawa. The consultant on the job has...

1. Wolz's Domino Manufacturing Company learned that one of its cutting machines is obsolete. Although the company will continue to use this machinery in the future, management beleves that an...

Prove that the more inelastic demand and supply conditions are in a foreign country, the greater the ability of a country that is large in world markets to impose an optimal tariff. Use this result...

For the circuit shown in Figure P32.38, what must the frequency $f$ of the $\mathrm{AC}$ source be in order for the amplitude $V_{C}$ of the potential difference across the capacitor to equal...

Compute the point estimates b0 and b1. Exercises 610 refer to the following data set: x 25 13 16 19 29 19 16 30 y 40 20 33 30 50 37 34 37

Suppose a distributor faces an annual demand for a car horn for small cars that is normally distributed with a mean of 2300 items and a standard deviation of 300. The cost of placing an order is $150...

Air flows with negligible friction through a $10-\mathrm{cm}$-diameter duct at a rate of $2.3 \mathrm{~kg} / \mathrm{s}$. The temperature and pressure at the inlet are $T_{1}=450 \mathrm{~K}$...

6 Prepare the schedule of cost of goods manufactured for Barton Company using the following information. 125 points Direct materials Direct labor Factory overhead costs Work in process, beginning...

How can you tell from the vertex form y = a(x - h) 2 + k whether a quadratic function has no real zeros?

How are torus and Illiac networks similar to a two-dimensional nearest-neighbor mesh? How are they different?

Consider a message-passing multicomputer system with 16 computing nodes. a. Draw the node connections for the following connection topologies: linear array, ring, two-dimensional rectangular...

Why was IEEE 754-1985 a significant development in the history of computing, especially in the fields of scientific and engineering applications?

The NPD Group recently released its annual U.S. Video Game Industry Retail Sales Report. The report contained the NPD Groups selection of the top 10 video games based on units sold. The top-selling...

In an article titled Airport Screeners Strains, Sprains Highest among Workers, Thomas Frank reported that the injury rate for airport screeners was 29%, far exceeding the 4.5% injury rate for the...

The Future-Vision Company is considering applying for a franchise to market satellite television dish systems in a Florida market area. As part of the companys research into this opportunity, staff...