Question: In this assignment, you will be working with a dataset on housing prices. The dataset contains information about various houses, including their location, size, and

In this assignment, you will be working with a dataset on housing prices. The dataset contains information about various houses, including their location, size, and price. Your task is to read the dataset, clean it, and perform some basic data analysis.
The dataset is: Housing.csv
Read the file, Houasing.csv in RStudio making sure strings are read as factors. Save it to a dataframe called housing_data. Print the first 10 observations to verify the completion.Overwrite the housing_data dataset with ONLY the following variables (columns): price, area, bedrooms, stories, basement, furnishingstatus. Determine the variable types of each variable (column) in the new data frame.Determine the variable types (int, num or factor) of each variable (column) in the data frame.Generate summary statistics for all variables of housing_data.Create bar charts of ALL the variables that are of class factor. Describe what you see (i.e., which level is most common).Create histograms of the variables price and area. Describe what you see (i.e., shape of the distributions).Create a scatter plot with price on the Y-axis and area on the X-axis. Describe what you see in the relationship.Subset the data into a data frame called housing_basement that includes only homes with basement (that is, with values for the variable basement equal to "yes"). Use the head() function to print the first few lines of the resulting dataframe.Get a pie chart for the variable furnishingstatus on the housing_basement dataset.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!