Question: 1 ) Run specified R code lines using the ggplot 2 package. Choose the txhousing database for analysis. Type library ( ggplot 2 ) data

1) Run specified R code lines using the ggplot2 package. Choose the "txhousing" database for analysis.
Type
library(ggplot2)
data(package =ggplot2) txhousing
1) Create a dataframe and a tibble with information on the "txhousing" database.
2) Describe the number of variables, observations, variable names, and definitions.
3) Explain why the data is not in an ideal tidy format and perform required steps to
tidy it.
1
4) Check for missing information or duplicates and explain whether these values provide additional information or whether they should be dropped from the database.
5) Create a new column, "unsold_percentage," measuring the percentage of unsold properties each month.
6) Create the average value of variables per city, named "mean_unsold_percentage."
7) Keep this new variable and city, drop duplicates values if any, and export the data in excel file named yourlastname_HW1

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!