UNIT5:TEAMWORK AND CORRELATION For the Unit 5 discussion, you will expand your working knowledge of Excel and
Question:
UNIT5:TEAMWORK AND CORRELATION
For the Unit 5 discussion, you will expand your working knowledge of Excel and explore the concept of correlation versus causation. You will use Excel to complete this discussion post.
First, download the sample of 100 housing units that were sold in Melbourne between 2016 and 2017. Please note the following:
- Price is given in Australian Dollars.
- Date is the date of the sale.
- Days is the number of days between 1 January, 2016 and the date of the sale.
- Distance is given by kilometers from the city center.
- Postcode is the postal code.
- Bedroom is the number of bedrooms in the house.
- Bathroom is the number of bathrooms in the house.
- Car is the number of cars that can fit in the garage.
- Landsize is the size of the lot in square metres.
- BuildingArea is the area of the home in square metres.
- Yearbuilt is the year in which the home was built.
POST 1: INITIAL RESPONSE
Perform the first task that is available.
Task 1: Answer the following questions:
- Choose 2 quantitative variables that each have at least 10 different values. Which 2 quantitative variables from the dataset will your team use for the assignment?
- Why did you select these 2 variables?
- Do you believe there is likely to be a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?
Task 2: Using the 2 variables listed in task 1, determine the independent and dependent variable.
- Which of the variables listed in task 1 will be used for the independent (x) variable.
o Why did you select this variable to be independent?
o How might this variable impact the dependent (y) variable?
- Do you believe there is likely to be a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?
Task 3: Using the independent and dependent variables from task 2, generate a scatterplot.
- Post a picture of the scatterplot.
- Comment on the data points shown on the scatterplot:
o Do the points appear to have a linear relationship?
o Are the points widely scattered or closely clustered?
- Do you believe there is likely to be a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?
Task 4: Using the variables provided in task 2 or the scatterplot provided in task 3, find the correlation coefficient.
- How would you characterize the correlation?
- Do you believe there is a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?
Task 5: Using the 2 variables listed in task 2, reverse the independent and dependent variables and generate a new scatterplot.
- Post a picture of the new scatterplot.
- Compare the new scatterplot to the one in Task 3:
o Does there appear to be a change in the linear relationship?
o Are the points more or less widely scattered or closely clustered?
- Do you believe there is likely to be a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?
Task 6: Using the variables or scatterplot provided in task 5, find the new correlation coefficient.
- How would you characterize the correlation?
- How does the correlation compare to the one in Task 4?
- Do you believe there is a relationship between these variables?
o If so, what evidence supports your position?
o If not, why not?