Previously we looked at proteins and genes identified to be correlated to Covid-19 severity from this...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Previously we looked at proteins and genes identified to be correlated to Covid-19 severity from this study. The Raw fastq data is deposited at GEO: GSE 157103(See if you can identify the BioProject project ID from the GEO page). In this assignment we will download a few RNA-seq samples from the study and take a look at basic statistics. First, we will obtain the sequencing run information with: esearch -db sra -query PRJNA660067 | efetch -format runinfo > runinfo.csv Obtain the SRR run ID's with (take the first 9 for example): cut -d","f 1 runinfo.csv | head 1. Report at least 5 SRR run IDs for the fastq samples of this study. 2. Report the sex of these 5 samples based on the metadata provided here: SraRun Table.txt If you use my server, the file can also be found here: /data/bio2023/Assignments/Lec09/SraRunTable.txt You can use the command below to get the sex information from the 40th column (Please the command below to help, but you need to revise it to answer the questions). cut -d","f1,40 SraRunTable.txt Second, we will download the fastq files for one of those samples with the command. fastq-dump -X 1000 --split-files SRR12544599 Look at the statistics and output of this sequencing run with the command 3. Is this a paired ended library? Third, run seqkit stats SRR12544599_1.fastq Please note: (i) If you use your own environment or the environment in the classroom or the biou20 image I provided, you need to run conda install -c bioconda seqkit to install the package first. (ii) If you are using the environment provided by me in my server, you do not need to install it. (iii) Some of you use my server but created your own environment; in this case, you still need to install the packages by yourself. 4. How many reads are in this sample? 5. What are the read lengths of this library? Previously we looked at proteins and genes identified to be correlated to Covid-19 severity from this study. The Raw fastq data is deposited at GEO: GSE 157103(See if you can identify the BioProject project ID from the GEO page). In this assignment we will download a few RNA-seq samples from the study and take a look at basic statistics. First, we will obtain the sequencing run information with: esearch -db sra -query PRJNA660067 | efetch -format runinfo > runinfo.csv Obtain the SRR run ID's with (take the first 9 for example): cut -d","f 1 runinfo.csv | head 1. Report at least 5 SRR run IDs for the fastq samples of this study. 2. Report the sex of these 5 samples based on the metadata provided here: SraRun Table.txt If you use my server, the file can also be found here: /data/bio2023/Assignments/Lec09/SraRunTable.txt You can use the command below to get the sex information from the 40th column (Please the command below to help, but you need to revise it to answer the questions). cut -d","f1,40 SraRunTable.txt Second, we will download the fastq files for one of those samples with the command. fastq-dump -X 1000 --split-files SRR12544599 Look at the statistics and output of this sequencing run with the command 3. Is this a paired ended library? Third, run seqkit stats SRR12544599_1.fastq Please note: (i) If you use your own environment or the environment in the classroom or the biou20 image I provided, you need to run conda install -c bioconda seqkit to install the package first. (ii) If you are using the environment provided by me in my server, you do not need to install it. (iii) Some of you use my server but created your own environment; in this case, you still need to install the packages by yourself. 4. How many reads are in this sample? 5. What are the read lengths of this library?
Expert Answer:
Answer rating: 100% (QA)
For questions 4 and 5 you mentioned the need to analyze RNAseq data and obtain read information and ... View the full answer
Related Book For
Modern Systems Analysis And Design
ISBN: 9780134204925
8th Edition
Authors: Joseph Valacich, Joey George
Posted Date:
Students also viewed these programming questions
-
Annual Income Tax rate in Wachanda is as follows: From $0 - $20,000 - 5% From $20,000 - $50,000 - 10% Anything more than 50,000 - 20% How much tax do you pay if you make $75,000 ?
-
Carol Harris, Ph.D, CPA, is a single taxpayer and she lives at 674 Yankee Street, Durham, NC 27409. Her Social Security number is 793-52-4335. Carol is an Associate Professor of Accounting at a local...
-
The following additional information is available for the Dr. Ivan and Irene Incisor family from Chapters 1-7. Ivan sold the following securities during the year and received a Form 1099-B that...
-
Find the minimum and maximum values of the function subject to the given constraint. f(x, y) =xy, 4x +9y = 32
-
In Figure P30.30, both currents in the infinitely long wires are in the negative x direction. (a) Sketch the magnetic field pattern in the yz plane. (b) At what distance d along the z axis is the...
-
Jerry Stans, a young industrial engineer, prepared an economic analysis for some equipment to replace one production worker. The analysis showed that the present worth of benefits (of employing one...
-
The following information has been extracted from the financial statements and the notes of Champigon Ltd. Required (a) Calculate the following for 2023 to one decimal place: i. current ratio ii....
-
Joyce Murphy runs a courier service in downtown Seattle. She charges clients $0.50 per mile driven. Joyce has determined that if she drives 3,300 miles in a month, her total operating cost is $875....
-
As a foreign exchange trader at a leading bank in Kenya, you have a customer who would like spot and 30-day forward AS/Yen quotes. Current market rates are: S/KES AS/KES Spot 150.25-34 30-day 30-15...
-
Play the following games in the Games Fair: Three Prize Roller Word Scramble Marble Draw Ten Spinner For each Games Fair game, answer the following questions: Create the probability distribution in a...
-
Ocasa Ltd. Just paid a dividend of $6.00 per share next year, and that the dividend will grow at the same rate as its profits. High profits are expected during this period, with the first three years...
-
The cost of an asset is $1,180,000, and its residual value is $250,000. Estimated useful life of the asset is four years. Calculate depreciation for the first year using the double declining-balance...
-
The Manning Company has financial statements as shown next, which are representative of the company's historical average. The firm is expecting a 40 percent increase in sales next year, and...
-
On December 31, 2023, Cheyenne Inc., a public company, borrowed $3 million at 11% payable annually to finance the construction of a new building. In 2024, the company made the following expenditures...
-
The fiscal year ends December 31 for Lake Hamilton Development. To provide funding for its Moonlight Bay project, LHD issued 9% bonds with a face amount of $610,000 on November 1, 2024. The bonds...
-
10 Henderson Company uses the gross profit method to estimate ending inventory and cost of goods sold when preparing monthly financial statements required by its bank. Inventory on hand at the end of...
-
Beths living room is 4 meters wide and 4 meters long. She wants to install teal carpet that costs $3.85 per square meter. How much will it cost to buy enough carpet for the living room?
-
Show that every group G with identity e and such that x * x = e for all x G is abelian.
-
What is the difference between how a range control statement and a referential integrity control statement are handled by a file management system?
-
Identify someone who manages an information systems project in an organization. Describe to him or her each of the skills and activities listed in Table 3-1. Determine which items he or she is...
-
Search computer magazines or the web for recent reviews of project management software. Which packages seem to be most popular? What are the relative strengths and weaknesses of each software...
-
How do you use functional requirements for this chapter?
-
How does deployment pattern help understand the core of the problem?
-
How do you apply nonfunctional requirements for this chapter?
Study smarter with the SolutionInn App