Question: R Code only 2 variety plants ( VarX & VarY ) , each with 2 conditions ( stress and control ) var _ x _

R Code only

2

variety plants

(

VarX & VarY

),

each with

2

conditions

(

stress and control

)

var

_

_

all

< -

read

_

csv

('

all

_

VarX

_

TwoTimePoints.csv

')

Header: gene

_

name,VarXCRep

. 1,

VarX

1

Rep

. 1,

VarX

2

Rep

. 1,

VarXCRep

. 2,

VarX

3

Rep

. 2,

VarX

1

Rep

. 2,

VarX

2

Rep

. 2,

VarXCRep

. 3,

VarX

3

Rep

. 3,

VarX

1

Rep

. 3,

VarX

2

Rep

. 3

var

_

_

all

< -

read

_

csv

('

all

_

VarY

_

TwoTimePoints.csv

')

Each variety has differentially expressed genes

(

DEGs

)

var

_

_

degs

< -

read

_

csv

('

Leaf

_

DEGs

_

VarX.csv

')

Header: gene

_

name, log

2

FoldChange, padj, Athaliana

_

gene

_

nameID, Gene

_

Function, VarXCRep

. 1,

VarXCRep

. 2,

VarXCRep

. 3,

VarX

1

Rep

. 1,

VarX

1

Rep

. 2,

VarX

1

Rep

. 3

var

_

_

degs

< -

read

_

csv

('

Leaf

_

DEGs

_

VarY.csv

')

#INVESTIGATE THE DISTRIBUTION OF EXPRESSION VALUES FOR ALL GENES IN EACH SAMPLE

(

Variety X

) .

var

_

_

all.long

< -

pivot

_

longer

(

var

_

_

all, cols

=

VarXCRep

. 1

:VarX

1

Rep

. 3,

names

_

=

"sample", values

_

=

"expression"

)

var

_

_

plot

< -

ggplot

(

var

_

_

all.long, aes

(

=

sample, y

=

expression

)) +

geom

_

boxplot

() +

DO SAME for VarY

#INVESTIGATE THE DISTRIBUTION OF EXPRESSION VALUES FOR THE DEGs IN EACH SAMPLE

(

Variety X

) .

var

_

_

degs.long

< -

pivot

_

longer

(

var

_

_

degs,cols

=

VarXCRep

. 1

:VarX

1

Rep

. 3,

names

_

=

"sample", values

_

=

"expression"

))

var

_

_

degs

_

plot

< -

ggplot

(

var

_

_

degs.long, aes

(

=

sample, y

=

expression

)) +

geom

_

boxplot

()

DO SAME for VarY

#HOW MANY DIFFERENTIALLY EXPRESSED GENES ARE THERE IN EACH VARIETY?

var

_

_

dup

< -

CODE HERE

var

_

_

dup

< -

CODE HERE

#INVESTIGATE IF THE SAME OR DIFFERENT GENES ARE DIFFERENTIALLY EXPRESSED IN THE TWO VARIETIES. Create a suitable plot to look at the overlap in the DEGs between the two Varieties. CODE HERE

#SEPARATE OUT THE UP

-

AND DOWN

-

REGULATED DEGs

(

BETWEEN STRESS AND CONTROL CONDITION

) .

By looking at

`

var

_

_

degs

`

and

`

var

_

_

degs

`

data frames, you can see that some genes have a positive log

2

fold change and others have a negative log

2

fold change.

*

Create a data frame called

`

var

_

_

degs.up

`

containing only genes that are upregulated in Stress Treatment compared to control in Variety X

.

CODE HERE

*

Create a data frame called

`

var

_

_

degs.down

`

containing only genes that are downregulated in Stress Treatment compared to control in Variety X

.

*

Same for VarY. CODE HERE

#INVESTIGATE THE FOLD CHANGE IN GENE EXPRESSION FOR THE DEGs, BETWEEN STRESS AND CONTROL CONDITION.

*

Create a box plot to show the distribution of log

2

fold change for all DEGs by variety. Hint: the base R boxplot

()

command and the abs

()

function could be helpful here.

*

Create a box plot to show the distribution of log

2

fold change for upregulated DEGs by variety. Hint: the base R boxplot

()

command could be helpful here.

*

Create a box plot to show the distribution of log

2

fold change for downregulated DEGs by variety. Hint: the base R boxplot

()

command could be helpful here.

#INVESTIGATE THE FUNCTIONS OF THE DIFFERENTIALLY EXPRESSED

(

UPREGULATED

)

GENES WITH THE LOWEST FOLD CHANGE

*

Find out the function of the bottom most upregulated gene in Variety X

(

lowest fold change

)

and assign the result to variable called

`

bottom

_

gene.x

` .

*

Find out the function of the bottom most upregulated gene in Variety Y

(

lowest fold change

)

and assign the result to variable called

`

bottom

_

gene.y

` .

#INVESTIGATE THE BEHAVIOUR OF THE BIOLOGICAL REPLICATES FOR THE DEGs in Variety X IN THE TREATMENT TIME POINT.

*

Create a set of scatterplots to visually inspect how well the different replicates agree

/

correlate for the DEGs in Variety X in the treatment time point.

#INVESTIGATE THE BEHAVIOUR OF THE BIOLOGICAL REPLICATES FOR THE DEGs in Variety X IN THE CONTROL TIME POINT.

*

Create a set of scatterplots to visually inspect how well the different replicates agree

/

correlate for the DEGs in Variety X in the control time point.

#COMPARE THE MEAN EXPRESSION IN TREATMENT VERSUS CONTROL REPLICATES FOR EACH DEG.

*

Modify your data frame

`

var

_

_

degs

`

to include two new

(

additional

)

columns as follows:

*

The first new column should be named

`

control

_

mean

`

and contain the mean expression value for the three control replicates.

*

The second new column should be named

`

stress

_

mean

`

and contain the mean expression value for the three stress treatment replicates.

#PRIORITISE GENES OF INTEREST FOR FURTHER INVESTIGATION.

*

Create a data frame called

`

var

_

_

degs.up

.

big

`

containing only genes in Variety y that are upregulated in Stress Treatment compared to control, have at least an

2

fold absolute change in expression and have a p value less than

1

- 06 . *

Hint: remember you are dealing with log

2

fold change.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

A study is interested in assessing whether there is a linear relationship between the distance traveled with a gallon of fuel and the weight of cars. The preloaded data set mtcars includes fuel...

Managerial Decision Making Six Decision Stages in Chapter 5 I. Identify and Diagnose the Problem Consider the following questions when identifying and diagnosing the problem: Is there a difference...

View the 2013 Annual Report for the Ford Motor Company, a Fortune 50 company, linked here as well as on the Course Information page. Using this report, answer the following questions: Does the...

On the profitability side, you should specifically address the following points: a) Is the cost structure of the two companies similar or different? Does the cost structure reflect their respective...

I'm trying to develop Pro Forma financial statements (Balance Sheet and Income Statement) for the next two fiscal years, assuming a 10% growth rate in sales and Cost of Goods Sold (COGS) for each of...

a. Example 13-1: Batch Reactor with an Exothermic Reaction Wolfram 1. Adiabatic Case: Use Wolfram to see whether you can find a trajectory that is ready to ignite and whose trajectory looks like a...

Chapter 7 from Mastering Strategic Management was adapted by The Saylor Foundation under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 license without attribution as requested by the...

Read Classroom Glimpse. Discuss stress, rhythm, pitch, and intonation based on the tale in the classroom 2 Language Structure and Use Learning Outcomes After reading this chapter, you should be able...

subject: Differential Equations pls read instructions do not use ai. drop all references and link Instructions ODE application. - find an article related to ODE application - provide a short...

Hello, I need some assistance with this project. I have everything completed except the third and fourth question (located in the instruction project document. 3.)You are a security analyst...

A firm plans to sell 1,000 units of its only product at a price of $20 per unit, with variable manufacturing costs of $8 per unit and fixed manufacturing costs of $2 per unit. Expected SGA costs were...

When university employees are aclassified into five classes in terms of their departments, this scale is considered a interval scale bratio scale c nominal scale od ordinal scale

Budgeting for production 0 . . e , units to ? t o be produced in an upcoming budget periody Multiple Cholce Involves the sales budget and both begining and ending frished goods inventory amounts. Is...

Journal entries recorded at the end of each accounting period to prepare the revenue, expense, and withdrawals accounts for the upcoming year and to update the owner's capital account for the events...