Question: Stage 1 (max score 80): Write a python script that accepts two parameters on the command line. These will be two sequence similarity survey files

Stage 1 (max score 80): Write a python script that accepts

Stage 1 (max score 80): Write a python script that accepts two parameters on the command line. These will be two sequence similarity survey files comparing proteins from two organisms Survey 1: Organism 1 vs organism 2 Survey 2: Organism 2 vs organism 1 The script should read both survey files. From each, remove similarities that have an eval>0.01. Among the remaining similarities, select the ones with the lowest evalue for each query sequence- these are the best hits for each query sequence. Compare the best hits from each survey and find similarities where best hits are reciprocal. In other words, find best hits where the same proteins are the query and subject in one survey, and the subject and query in the other. Print out the pair of ids for each reciprocal best hit, one pair to a line, with organism 1 ids first and organism 2 ids second Examples (example outputs on compile, long commands are line wrapped, note the redirected output) Example 1 (H. pylori and E. coli): s python orthologs.py h pylori_vs_e_coli.tsv e_coli_vs_h_pylori.tsv> Example 2 (H. pylori and H. influenzae): orthologs_h_pylori_and_e_coli_stagel.txt python orthologs.py h_pylori_vs_h_influenzae.tsv h_influenzae_vs_h_pylori.tsv orthologs_h_pylori_and_h_influenzae_stagel.txt

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Computer Organization and Networks Practicals 2021/22 October 9, 2021 Computer Organization and Networks Practicals 2021/22 b68495714b Contents Contents 0 Introduction 3 0.1 Registration . . . . . ....

Introduction and learning objectives When you were learning about operational analysis earlier in the term, we talked about jobs that require multiple visits to the CPU (or servers) to receive their...

- DO NOT SEND CODE THAT HAS ALREADY BEEN POSTED ON CHEGG. THOSE ANSWERS DON'T WORK - DO NOT HAVE COMMENTS IN THE CODE - WRITE CODE IN JAVA Objectives This is one of two major programming projects...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

What are the biggest ah-ha! moments from Oracy Development? 6 English-Language Oracy Development Learning Outcomes After reading this chapter, you should be able to ... . Describe the basics of...

GRADUATE CERTIFICATE IN PROJECT MANAGEMENT PROJ5010: PROJECT PROCUREMENT AND STRATEGIC SOURCING. CASE STUDIES CONTENTS 1. Proj5010: The World Bank RFP Case Study covers 1. Assignment 1: Marks = 5 2....

I gat this C++ assignment, but I have no idea from where I should start, I would really appreciate if someone helps me out. this assignment should be written in C++. A. OBJECTIVES: Doing this...

Write the program that allows users to enter the file name to store the information of students then provides the menu to allow users to select the following tasks and only terminate the program when...

This project is 2 5 % of the final exam mark. You are to create a WCF Service and a client to consume this service. The client can be a console app ( max score 6 0 % ) , Windows Forms app ( max score...

Brain volume 1007 Listed below are the brain volumes (in cm" ) of twins and the corresponding IQ score. Construct a scatterplot, find the value of the linear correlation coefficient r, and find the...

Calculate the heat transfer for the process described in Problem 4.46.

If the opening in Problem 4 is in an in situ stress field such that h = 0.01v, is there a possibility of failure? Justify your answer.

Consider the following table: \ table [ [ , , Stock Fund,Scenario ] , [ Scobability , Rate of Return, \ table [ [ Bond Fund ] , [ Rate of Return ] ] , ] , [ Severe recession, 0 . 1 0 , - 3 8 % , - 1...

Below, atomic radius, crystal structure, electronegativity, and the most common valence are tabulated, for several metals. Please apply for conditions for substitutional solid solution (Hume -...

How has Departmental Computing increased the need for HCM Professionals and Technical Staff to be skilled in Business Computing Software and Systems?

Describe the difference between Two- and Three-Tier Computing Systems.

Explain the differences between On Premises, SaaS, PaaS, IaaS, and Hybrid Computing environments.