Question: Explore the data using the data visualization capabilities of R. Which of the pairs among the variables seem to be correlated? We plan to analyze
Explore the data using the data visualization capabilities of R. Which of the pairs among the variables
seem to be correlated? We plan to analyze the data using various data mining techniques you've
learned about this week. You need to prepare the data for use as follows:
Step 1:
The dataset has two categorical attributes, Fuel Type and Metallic.
Describe how you would convert these to binary variables.
Confirm this using R's functions to transform categorical data into dummies.
Step 2:
Prepare the dataset (as factored into dummies) for data mining techniques of supervised learning by
creating partitions in R.
Select all the variables and use default values for the random seed and partitioning percentages for
training (50%), validation (30%), and test (20%) sets.
Describe the roles that these partitions will play in modeling.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
