Question: Please use R language or Python, to complete the code to generate a Manhattan plot for the following Genome Wide Association Studies assignment. (R language

Please use R language or Python, to complete the code to generate a Manhattan plot for the following Genome Wide Association Studies assignment. (R language proides a package called QQMan that makes the Manhattan Plot creation way easier.) I struggled with formatting my data properly. The link to download the specified datasets are here: https://files.fm/u/ktxb65pu

Please use R language or Python, to complete the code to generate

We wil visualize Genome-Wide Association Studies (GWAS) with the Manhattan plot for psychiatric disorders. You can download the two data sets of SNPs and phenotypes that contain genome data of two groups: psychiatric disorders (y-1) and control (y 0) at the course web page. In "SNP.csv", there are 37,853 SNPs of 130 samples. A value in the file indicates the numbers of minor allele on each SNP (i.e., x E f0,1,23). In "Phenotype.csv", zero indicates that the sample is a control, while one shows a psychiatric disorder (one of bipolar disorder, schizophrenia, and major depression) Compute p-values by using t-test (you can use any libraries for t-test). Perform t-test pairwise between a SNP and phenotype. I.?., you need to perform 37,853 t-tests and compute p-values. Then, make a Manhattan plot with bonferroni multiple testing correction (i.e., consider the p- value cutoff: 0.05/37,853). See Figure 1. Unfortunately, it does not show significant SNPs associated to psychiatric disorders 10000 20000 30000 SNPs Figure 1. Manhattan Plot We wil visualize Genome-Wide Association Studies (GWAS) with the Manhattan plot for psychiatric disorders. You can download the two data sets of SNPs and phenotypes that contain genome data of two groups: psychiatric disorders (y-1) and control (y 0) at the course web page. In "SNP.csv", there are 37,853 SNPs of 130 samples. A value in the file indicates the numbers of minor allele on each SNP (i.e., x E f0,1,23). In "Phenotype.csv", zero indicates that the sample is a control, while one shows a psychiatric disorder (one of bipolar disorder, schizophrenia, and major depression) Compute p-values by using t-test (you can use any libraries for t-test). Perform t-test pairwise between a SNP and phenotype. I.?., you need to perform 37,853 t-tests and compute p-values. Then, make a Manhattan plot with bonferroni multiple testing correction (i.e., consider the p- value cutoff: 0.05/37,853). See Figure 1. Unfortunately, it does not show significant SNPs associated to psychiatric disorders 10000 20000 30000 SNPs Figure 1. Manhattan Plot

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

Journal of Autism and Developmental Disorders, Vol. 32, No. 3, June 2002 ( 2002) Descriptive Epidemiology of Autism in a California Population: Who Is at Risk? Lisa A. Croen,1,3 Judith K. Grether,1...

CHA P TER 9 Understanding Software: A Primer for Managers 1. INTRODUCTION L E A R N I N G O B J E C T I V E S 1. Recognize the importance of software and its implications for the rm and strategic...

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

3 COLLEGE ALGEBRA - TRIGONOMETRY Business and Finance (MAT115) This course will start with a review of basic algebra (factoring, solving linear equations, and equalities, etc.) and proceed to a study...

i want complete solution for my assignment and it should be without plagiarism COIT20274: Information Systems for Business Professionals, Term One 2016 Assignments 1 & 2 Requirements Assignment 1 -...

GRADUATE CERTIFICATE IN PROJECT MANAGEMENT PROJ5010: PROJECT PROCUREMENT AND STRATEGIC SOURCING. CASE STUDIES CONTENTS 1. Proj5010: The World Bank RFP Case Study covers 1. Assignment 1: Marks = 5 2....

Module Case Study Information A Module Case Study is a critical analysis and evaluation of a specific case or subject. For this course a Module Case Study must: Be two pages in length, double-spaced....

Can a recourse debt of a partnership increase the basis of a limited partners partnership interest? Explain.

An insured becomes disabled on March 1 and submits a claim statement to the insurance company on March 31, while still disabled. The company fails to remit payment promptly and the insured decides to...

7 Multiple Oroice 0 . 3 7 5 points false true

You are a marketing manager who needs provide a summary of the marketing trends from the data shared (3 files). Based on the marketing data attached, which highlights the growth of the business, what...

Why is the System Build Process an iterative process?

What phase normally comes directly after the System Build process in a Project?

Name two other algorithms available in SSAS Data Mining other than Decision Trees.