Question: Extract three archival data sets from a reputable data repository, such as Kaggle or UCI machine learning data repository. Each of the dataset should be

Extract three archival data sets from a reputable data repository, such as Kaggle or UCI machine learning data repository. Each of the dataset should be in relation to classification

(

target or dependent variable is discrete

),

regression

(

target is continuous

),

and clustering

(

no target

) .

The following tasks are expected to be addressed by each group:

1 .

Describe each dataset wrt the data source, features, missing values, etc.

2 .

Discuss what kind of problem

(

)

each of the dataset can be used to address.

3 .

Discuss how you will preprocess each dataset. Justify, based on literature, the preprocessing techniques that will be considered.

4 .

For each of the data set, which machine learning technique

(

)

will be used to setup the prediction or clustering models. Justify, based on literature, the machine learning techniques that will be considered.

5 .

Discuss how you will go about the training, validation and testing for each dataset.

6 .

Prepare a PowerPoint presentation summary for this assignment and note that you will be presenting it during class session.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Extract these three archival data sets from a reputable data repository, such as Kaggle or UCI machine learning data repository. classification ( Iris Dataset ) , regression ( housing dataset ) , and...

You can use any software to plot and/or to calculate values/data, but if you do, provide (copy/paste) here the code. Data sets relevant for this HW can be found at the UCI Machine Learning...

Every business project starts with a business problem. You need to select a business problem that can be addressed through data mining. You receive 30 extra points if your topic is novel. Finding a...

very business project starts with a business problem. You need to select a business problem that can be addressed through data mining. You receive 30 extra points if your topic is novel. Finding a...

Objective: Apply supervised learning techniques to a real - world dataset to solve a prediction problem. Use at least two different supervised learning algorithms to train models and perform a...

You are expected to visit data repositories such as Kaggle, UCI Machine learning repository, PROMISE, etc. and extract three different datasets in the areas of classification, clustering and...

DATA ANALYTICS Choose one dataset from the classification category (there are a total of 262 sets) of the UCI Machine Learning Repository. If the data comes in one set, partition the data into...

Assignment: Data Management for AI Applications Objective: To understand and apply the concepts and techniques of data management for AI applications. Tasks: Choose an AI application domain that...

Major tasks required for the project: Step 1: Obtaining a dataset The first step is to find your own domain-specific dataset for your statistical analysis project. There is no restrictions on the...

Write solubility product expressions for the following compounds. a. Ba3(PO4)2 b. FePO4 c. PbI2 d. Ag2S

Idrees engineering (IE is a supplier of various types of industrial machines. It also provides services for the maintenance of these machines. Following transactions were carried out by IE during the...

You have an investment opportunity that promises to pay you $ 2 0 , 0 0 0 in four years. You could earn a 7 % annual return investing elsewhere. What is the maximum amount you would be willing to...

CT Corp Comprehensive Question Canadian Tire Corporation, Limited ( Canadian Tire ) is a family of companies that includes a retail segment and a financial services division, among others. The retail...

What is the environment we are trying to create?

3. How might you apply these learnings to the challenges you face currently? 1. What will you say to Diane? Are there additional questions you would like to ask?

How can we visually describe our goals?