Question: Extract three archival data sets from a reputable data repository, such as Kaggle or UCI machine learning data repository. Each of the dataset should be
Extract three archival data sets from a reputable data repository, such as Kaggle or UCI machine learning data repository. Each of the dataset should be in relation to classification target or dependent variable is discrete regression target is continuous and clustering no target
The following tasks are expected to be addressed by each group:
Describe each dataset wrt the data source, features, missing values, etc.
Discuss what kind of problems each of the dataset can be used to address.
Discuss how you will preprocess each dataset. Justify, based on literature, the preprocessing techniques that will be considered.
For each of the data set, which machine learning techniques will be used to setup the prediction or clustering models. Justify, based on literature, the machine learning techniques that will be considered.
Discuss how you will go about the training, validation and testing for each dataset.
Prepare a PowerPoint presentation summary for this assignment and note that you will be presenting it during class session.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
