Use R language code and English interpretation to answer all questions Clustering Stock Returns When building portfolios of stocks, investors seek to obtain good returns while limiting the variability in those returns over time This can be achieved by selecting stocks that show different patterns of returns In this question, we will use clustering to identify clusters of stocks that have similar returns over time an investor would select a diverse portfolio by selecting stocks from different clusters For this question, we will use the dataset NasdaqReturns csv , which contains monthly stock returns from the NASDAQ stock exchange during 2 0 0 0 2 0 0 9 The companies selected in this dataset are limited to those that were listed on the stock exchange for this entire time period and whose stock price never fell below $ 1 The NASDAQ is the second largest stock exchange in the world, and it lists many technology companies The variables in the dataset are described in Table 2 Table 2 Variables in the dataset NasdaqReturns csv Variable StockSymbol Industry SubIndustry Ret 2 0 0 0 0 1 Ret 2 0 0 9 1 2 1 Let us start by exploring the dataset ( a ) How many companies are there in this dataset ( 2 points ) How many companies are there in each of the industries ( 2 points ) ( b ) In the aftermath of the dot com bubble bursting in the early 2 0 0 0 s , the NASDAQ was quite tumultuous In December 2 0 0 0 , how many stocks in this dataset saw their value increase by 1 0 ( including 1 0 ) or more ( 2 points ) Decrease by 1 0 ( including 1 0 ) or more ( 2 points ) ( c ) Entering the Great Recession, most stocks lost significant value, but some sectors were hit harder than others In October 2 0 0 8 , which 3 industries had the worst average return ( 3 points ) 2 Let us now cluster the stocks according to the monthly returns For the remainder of this question, make sure that you are just clustering the observations based on the variables Ret 2 0 0 0 0 1 Ret 2 0 0 9 1 2 ( i e , StockSymbol, Industry, and SubIndustry should not be used to cluster the observations ) ( 2 points ) ( Hint You can do this by creating a new data frame without irrelevant variables using the function within ( ) we learned in the lecture Model selection ) ( a ) In this analysis, we will not normalize our data prior to clustering Why is this a valid approach for this question and dataset ( 3 points ) ( b ) Cluster the data using Hierarchical clustering ( 2 points ) Clearly indicate which distance metrics you used for point distances and cluster distances ( 2 points ) Plot the resulting dendrogram ( 2 points ) What do you think are reasonable choices for the number of clusters to select, based on the dendrogram ( 3 points ) A further consideration for the stock selection problem is that we should include enough stocks to create our well diversified portfolio Based on the dendrogram and this specific concern, select a number of clusters to use for the rest of the question, and justify your choice ( 3 points ) ( c ) Extract cluster assignments from your hierarchical clustering model, using the number of clusters you selected in ( b ) ( 2 points ) Describe each cluster, using the number of observations in the cluster ( 3 points ) , the most common industry of the companies in the cluster ( 3 points ) , and the most common subindustry of the companies in the cluster ( 3 points ) ( Hint Since we never changed the order of the observations, you can create a data frame including the number of observations in each industry subindustry that is counted by the function table ( ) ( recall what you learned in the 3 rd tutorial ) You can then use the order ( ) function to sort this data frame in the order of frequency ) ( d ) For some months, we expect there to be significant differences between the returns of stocks in different clusters For February 2 0 0 0 , do some clusters have negative average returns while other clusters have positive average returns ( 2 points ) How about for March 2 0 0 0 ( 2 points ) ( e ) Now run the K means clustering algorithm on this data ( when clustering, only use the variables Ret 2 0 0 0 0 1 Ret 2 0 0 9 1 2 ) You should select the same number of clusters that you used for Hierarchical clustering ( 3 points ) Extract cluster assignments from your K means clustering model, and compare them to the Hierarchical cluster assignments by common industries ( 3 points ) Open ended question Are there any similar clusters ( 1 point )

The Answer is in the image, click to view ...

Question: Use R language code and English interpretation to answer all questions: Clustering Stock Returns When building portfolios of stocks, investors seek to obtain good returns

Use R language code and English interpretation to answer all questions: Clustering Stock Returns

When building portfolios of stocks, investors seek to obtain good returns while limiting the variability in those returns over time. This can be achieved by selecting stocks that show different patterns of returns. In this question, we will use clustering to identify clusters of stocks that have similar returns over time; an investor would select a diverse portfolio by selecting stocks from different clusters.

For this question, we will use the dataset NasdaqReturns.csv

,

which contains monthly stock returns from the NASDAQ stock exchange during

2000 - 2009 .

The companies selected in this dataset are limited to those that were listed on the stock exchange for this entire time period and whose stock price never fell below $

1 .

The NASDAQ is the second

-

largest stock exchange in the world, and it lists many technology companies. The variables in the dataset are described in Table

2 .

Table

2

: Variables in the dataset NasdaqReturns.csv

Variable: StockSymbol

/

Industry

/

SubIndustry

/

Ret

2000.01 -

Ret

2009.12

1 .

Let us start by exploring the dataset.

(

)

How many companies are there in this dataset?

(2

points

)

How many companies are there in each of the industries?

(2

points

)

(

)

In the aftermath of the dot

-

com bubble bursting in the early

2000

,

the NASDAQ was quite tumultuous. In December

2000,

how many stocks in this dataset saw their value increase by

10 % (

including

10 %)

or more?

(2

points

)

Decrease by

10 % (

including

- 10 %)

or more?

(2

points

)

(

)

Entering the Great Recession, most stocks lost significant value, but some sectors were hit harder than others. In October

2008,

which

3

industries had the worst average return?

(3

points

)

2 .

Let us now cluster the stocks according to the monthly returns. For the remainder of this question, make sure that you are just clustering the observations based on the variables Ret

2000.01 -

Ret

2009.12 (

.

.,

StockSymbol, Industry, and SubIndustry should not be used to cluster the observations

) . (2

points

)

(

Hint: You can do this by creating a new data frame without irrelevant variables using the function within

()

we learned in the lecture

Model selection

.)

(

)

In this analysis, we will not normalize our data prior to clustering. Why is this a valid approach for this question and dataset?

(3

points

)

(

)

Cluster the data using Hierarchical clustering.

(2

points

)

Clearly indicate which distance metrics you used for point distances and cluster distances.

(2

points

)

Plot the resulting dendrogram.

(2

points

)

What do you think are reasonable choices for the number of clusters to select, based on the dendrogram?

(3

points

)

A further consideration for the stock selection problem is that we should include enough stocks to create our well

-

diversified portfolio. Based on the dendrogram and this specific concern, select a number of clusters to use for the rest of the question, and justify your choice.

(3

points

)

(

)

Extract cluster assignments from your hierarchical clustering model, using the number of clusters you selected in

(

) . (2

points

)

Describe each cluster, using the number of observations in the cluster

(3

points

),

the most common industry of the companies in the cluster

(3

points

),

and the most common subindustry of the companies in the cluster

(3

points

) .

(

Hint: Since we never changed the order of the observations, you can create a data frame including the number of observations in each industry

/

subindustry that is counted by the function table

() (

recall what you learned in the

3

rd tutorial

) .

You can then use the order

()

function to sort this data frame in the order of frequency.

)

(

)

For some months, we expect there to be significant differences between the returns of stocks in different clusters. For February

2000,

do some clusters have negative average returns while other clusters have positive average returns?

(2

points

)

How about for March

2000 ? (2

points

)

(

)

Now run the K

-

means clustering algorithm on this data

(

when clustering, only use the variables Ret

2000.01 -

Ret

2009.12) .

You should select the same number of clusters that you used for Hierarchical clustering.

(3

points

)

Extract cluster assignments from your K

-

means clustering model, and compare them to the Hierarchical cluster assignments by common industries.

(3

points

)

Open

-

ended question: Are there any similar clusters?

(1

point

)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Clustering Stock Returns USE EXCEL AND SHOW STEP BY STEP PLEASE!! When building portfolios of stocks, investors seek to obtain good returns while limiting the variability of those returns over time....

I need help with a question regarding the attached document, Question: The author proposes several lessons about market efficiency that we can learn from the financial crisis. One is that there are...

Hi, I have an Assignment for my Finance Subject. I have attached the necessary documentation here for you to view including the Lecture slides of all the Topics covered for this assignment. Please...

Question #1: The author gives several reasons why the blame cast on the EMH for the global financial crisis is unfounded. Which one makes the most sense to you? Why? Which argument do you disagree...

What are the implications for corporate financial managers of the EMH as articulated and defended by the author? Frame you answer around implications for issues such as capital raising, dividend...

I have to do an accounting assignment. I have attached the instructions, the questions, and pdf's containing the information requested for the questions. Company Estimates & Opinions Sanmina Corp...

Hello. I need to write a three page review for the two articles attached. Each review should include a brief summary that include the major points of the article, as well as a summary of my own...

You helped me a couple of weeks ago. I was wondering if i could setup a time for tomorrow that you could be available to help me with some questions. I will pay $20 per question. They are short and I...

Please help me to answer the following file ............................................................................................... INVESTMENT AND PORTFOLIO MANAGEMENT COURSE ASSESSMENT 1...

Please help me to answer the attached files ................................... INVESTMENT AND PORTFOLIO MANAGEMENT COURSE ASSESSMENT 1 Submission deadline without penalties is 12 October 2017....

The following data represent a dependent variable and four independent variables: a. Use the standard stepwise regression to produce an estimate of a multiple regression model to predict y. Use 0.15...

Figure P12.56 shows a truss that supports a downward force of 1 000 N applied at the point B. The truss has negligible weight. The piers at A and C are smooth. (a) Apply the conditions of equilibrium...

Which one of the following statements related to unexpected gains and losses is not correct?O a . Asset gains occur when the actual return is greater than the expected return b . Asset gains and...

tell me more about LEAN