Question: use in R - Data Exploration and Cleaning Use the provided dataset to answer this section. You are given access to 31 variables that are
use in R - Data Exploration and Cleaning Use the provided dataset to answer this section. You are given access to 31 variables that are directly related to property sales from the above-mentioned dataset. Most of these variables are similar to the type of information that an assessor will use to evaluate and assess the price of a property (e.g., when was it built? How big is the lot? What is the size of the living room? Is the basement developed and completed? Number of bathrooms?). You need to answer the following questions with evidence and justifications. 1. a. Which variables are continuous/numerical? Which are ordinal? Which are nominal? b. Whatarethemethodsfortransformingcategoricalvariables? c. Carryoutanddemonstratedatatransformationwherenecessary. 2. a. Calculate the summary statistics: mean, median, max and standard deviation for each of the continuous variables, and count for each categorical variable. b. Isthereanyevidenceofextremevalues?Brieflydiscuss. 3. Plot histograms for each of the continuous variables and create summary statistics. Based on the histogram and summary statistics answer the following and provide brief explanations: a. Whichvariableshavethelargestvariability? b. Whichvariablesseemskewed? c. Arethereanyvaluesthatseemextreme? BUS5PA Predictive Analytics - 2024 4. a. b. Whatarethemethodsofhandlingmissingvalues? c. Apply the 3 methods of missing values and demonstrate the output (summary statistics and transformation plot) for each method in
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
