Question: Help me create a pipeline uisng python wrapper script. Make sure to have comments on each session functions, and important areas in the code. We

Help me create a pipeline uisng python wrapper script. Make sure to have comments on each session functions, and important areas in the code. We are comparing HCMV transcriptomes

2 -

and

6 -

days post infection as documents been converted into fastq files.

The pipeline should automate steps

2 - 5 (

Steps are written below

)

by running one wrapper Python script that can take either the full data or your sample test data as input. Your wrapper script can call other scripts, but the user should be able to run steps

2 - 5

with one command as directed in your README

(

I will be creating the README, but it will be great if you can write how to run the code

) .

Be explicit about how the user should run the test data.

Write a Python script to automate steps

2 - 5

and produce the log file requested named

PipelineProject

.

log

and other output files in a directory named

PipelineProject

__(

where you

ve used your first and last name

) .

ALL results generated by you or the programs called should be written to this directory. The easiest way to guarantee this is to create the directory using an os

.

system

()

call and then move into it via an os

.

chdir

()

call.

2 .

Which strains are most similar to these patient samples? To compare to other strains, you will assemble these transcriptome reads. We don

t expect assembly to produce the entire genome from transcripts, but enough to be useful in BLAST. Virus sequencing experiments often include host DNAs. It is difficult to isolate the RNA of just the virus

(

as it only transcribes during infection of the host cell

) .

Before assembly, let

s make sure our reads map to the HCMV genome. Using Bowtie

2,

create an index for HCMV

(

NCBI accession NC

_006273.2) .

Next, save only the reads

that map to the HCMV index for use in assembly. Write to your log file the number of reads in each transcriptome before and after the Bowtie

2

mapping. For instance, if I was looking at the Donor

1 (2

dpi

)

sample, I would write to

the log

(

numbers here are arbitrary

)

Donor

1 (2

dpi

)

had

230000

read pairs before Bowtie

2

filtering and

100000

read

pairs after.

3 .

Using the Bowtie

2

output reads, assemble all four transcriptomes together to produce

1

assembly via SPAdes.Write the SPAdes command you used to the log file.

4 .

Write Python code to calculate the number of contigs with a length

> 1000

and write the # to the log file as follows

(

replace # with the correct integer

)

There are # contigs

> 1000

bp in the assembly.

Write Python code to calculate the length of the assembly

(

the total number of bp in all of the contigs

> 1000

bp in length

)

and write this # to the log file as follows

(

replace # with the correct integer

)

There are # bp in the assembly.

5 .

Does your assembly align with other virus strains? Write Python code to retrieve the longest contig from your SPAdes assembly. Use the longest contig as blast

+

input to query the nr nucleotide database limited to members of the Betaherpesvirinae subfamily. Think, which blast should you use? You will need to make a local database of just sequences from the Betaherpesvirinae subfamily. Your blast

+

run should only keep the best alignment

(

HSP

)

for any

single query

-

subject pair of sequences. For the top

10

hits, write the following to your log file: Subject accession,Percent identity, Alignment length, Start of alignment in query, End of alignment in query, Start of alignment in subject, End of alignment in subject, Bit score, E

-

value, and Subject Title. Include the following header row in the log file, followed by the top

10

hits, and tab

-

delimit each item:

sacc pident length qstart qend sstart send bitscore evalue stitle

If you need anymore informaiton, let me know. I need this within the next

2

hours!

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

I need help on my python assignment. they are the same assignments broken into groups. > Project Two Guidelines and Rubric Competencies In this project, you will demonstrate your mastery of the...

Hi, Can you please help me with assignment, I am failing to create the train_nn function. Please advise how I can get data to you, my previous efforts have failed. Tensorflow_NeuralNetworkspdf May 1,...

CHA P TER 9 Understanding Software: A Primer for Managers 1. INTRODUCTION L E A R N I N G O B J E C T I V E S 1. Recognize the importance of software and its implications for the rm and strategic...

What are the biggest ah-ha! moments from Oracy Development? 6 English-Language Oracy Development Learning Outcomes After reading this chapter, you should be able to ... . Describe the basics of...

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Find attached Ingredients: Water, MCC, Salt, Nicotine, pH regulator, sweeteners and flavours I am wondering why the decision to go with LYFT. I understand migrating to LYFT would be easier from sells...

What am I required to do in this assignment? Shared Power is an information system to help tradesmen share expensive and specialist tools rather than buying them themselves. Registered owners add...

Hello, I wrote this report and need them to be brought down to 3000 words (or a little more is ok- excluding the references). And I am just wondering which repetitive part I could get rid of to make...

Can someone help me with part (b) and (c)? Cwww.cs.fsu.edu/-jayarama/org2/Homeworks/Homework2.pdf Apps New Tab Online Derivative Ca M ITSDocs: Create, Cop In Unik, how do I list db vi/vim notesIn...

Objective Create your own business plan and make a presentation at the end of the semester. Use the template provided to develop your business plan. Write out your own business plan by using the...

A company sells personal computers for $2,300 each. The price includes a two-year warranty. During the current year, the company sells 400 computers. On the basis of the past experiences, the...

Eye Co., a shipping company headquartered and incorporated in State I, signed a contract with Kay Co., a company headquartered and incorporated in State K, to transport goods for Kay Co. from State K...

Nicki Minal purchased a horre in Morgan, Berland. She applied for and received a home loan from Bowit Building and laan Aswociation ( Bowie ) for the purpose of hully renowating her home. Bowie took...

Using the integral method please determine the order of reaction and the parameter K \begin{tabular}{|lccccccc|} \hlinet(min) & 0 & 50 & 100 & 150 & 200 & 250 & 300 \\ CA(mol/dm3) & 0.05 & 0.038 &...

Holding national saving constant, does an increase in net capital outflow increase, decrease, or have no effect on a countrys accumulation of domestic capital?

An article in USA Today (December 16, 2004) began President Bush said Wednesday that the White House will shore up the sliding dollar by working to cut record budget and trade deficits. a. According...

International trade in each of the following products has increased over time. Suggest some reasons this might be so. a. wheat b. banking services c. computer software d. automobiles