Question: Your code: 1 . Create a function called load _ data. a . It takes as argument the name of the file to be used

Your code:

1 .

Create a function called load

_

data.

.

It takes as argument the name of the file to be used

(

a string

) .

.

It returns a data structure

(

or more than one

)

that contains all of the information from the input file.

2 .

Create a function called count

_

nucl

_

freq.

.

It takes as argument the data structure

(

)

generated by load

_

data.

.

It returns a new data structure

(

or more than one

)

that contains the frequencies of the nucleotides for each column in each sequence.

3 .

Create a function called find

_

consensus.

.

It takes as argument the data structure

(

)

generated by count

_

nucl

_

freq.

.

It returns a string; the consensus sequence.

4 .

Create a function named process

_

results.

.

It takes as arguments the data structure

(

)

created by count

_

nucl

_

freq and the name of the output file

(

a string

) .

.

It writes the results, in the format previously described, to the output file.

.

It doesn

t return anything.

Other Important information

1 .

Sample files are provided, but they are for testing purposes only. In other words, the sample DNAOutput.txt provided should be the result of executing your program with the sample file provided

(

DNAInput

.

fasta or DNAInput.txt

) .

Your program should be able to work with any FASTA file where all sequences are of the same length.

2 .

You should NOT prompt for the file name; you should ALWAYS try to open a file named DNAInput.txt and your output should ALWAYS be to a file named DNAOutput.txt

.

You may assume that:

Every combination of description

+

sequence takes up

2

lines

(1

line for each

) .

All sequences in the file have the same length. The exact length is not initially known; you may determine it from any of the sequences.

All nucleotides are in capital letters.

There will be no characters other than A

,

,

,

and G in the sequences.

There will be no ties for the most highly

-

occurring nucleotide in any column. This means that, when determining the consensus, there will be a single nucleotide that is the highest occuring.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Hi, what is the python solution to this? You are required to create a back-end service that will help capture basic information about prospective students who come to inquire here at Umuzi. In this...

plz do in python idle Task 1: Creating Simple Functions 1. Open a new Python file and name it "functions_intro.py". 2. Write a function called greet_user ( ) that prints the message "Hello, new...

problem2.py contents: # INSERT ALL FUNCTION IMPLEMENTATIONS HERE # Document all functions if __name__ == "__main__": print("Running the problem2 module (Bank Transactions problem)") cdict =...

In Python, Implement a function called customer dictionary with a single parameter balfilename, the name of the file containing current account balance information for the customers of the bank in...

Please solve in python Please inlclude all the conditions specified in the problem Complete the code based on the program below Problem 1 Create a function called histogram that takes as input a...

Using python creating the following functions specifying all the instructions and the provided script: assume 2 directories exists called, fake_data_files_1 and fake_data_files_2 appropreatly wherein...

Language: Python Program used: PyCharm Interpreter: 3.6.1 at ~/anaconda/bin/python ----- ----- Starter code: https://pastebin.com/NgrECZQ7 candidates_2016:...

Please do in java and put comments as im trying to understand what does what. thank you! This assignment asks you to write a collection of little functions that all operate on a class of array of...

Java coding package oopizza; /** * * @author */ public class OOPizza { /** * @param args the command line arguments */ public static void main(String[] args) { // TODO code application logic here...

Match the heat treatment process of the steels given in Group 1 with the microstructural feature given in Group 2: Group 1: P: Quenching, Q: Normalizing, R: Tempering, S: Austempering Group 2: 1:...

To treat a burn on your hand, you decide to place an ice cube on the burned skin. The mass of the ice cube is 19.5 g, and its initial temperature is -11.2 C. The water resulting from the melted ice...

Proficiency tests are based solely on the performance of an analyst. Group of answer choices True False

Kohler Corporation reports the following components of stockholders' equity on January 1. Common stock-$10 par value, 100,000 shares authorized, 40,000 shares issued and outstanding Paid-in capital...