Question: The classic MNIST Digit RecognizerLinks to an external site. problem is a competition on Kaggle.com, and you will compete in this competition. For this assignment,

The classic MNIST Digit RecognizerLinks to an external site. problem is a competition on Kaggle.com, and you will compete in this competition. For this assignment, you will develop a classifier that may be used to predict which of the

10

digits is being written.

Management

/

Research Question

In layman

s terms, what is the management

/

research question of interest, and why would anyone care?

Requirements

Fit a random forest classifier using the full set of explanatory variables and the model training set

(

csv

) .

Record the time it takes to fit the model and then evaluate the model on the csvdata by submitting to Kaggle.com. Provide your Kaggle.com score and user ID

.

Execute principal components analysis

(

PCA

)

on the combined training and test set data together, generating principal components that represent

95

percent of the variability in the explanatory variables. The number of principal components in the solution should be substantially fewer than the explanatory variables.

Record the time it takes to identify the principal components.

Using the identified principal components from step

(2),

use thecsvto build another random forest classifier.

Record the time it takes to fit the model and to evaluate the model on the csvdata by submitting to Kaggle.com. Provide your Kaggle.com score and user ID

.

Use k

-

means clustering to group MNIST observations into

1

10

categories and then assign labels.

(

Follow the example here if needed: kmeans mnist.pdf Download kmeans mnist.pdf

) .

kmeans mnist

- 2 .

pdf Download kmeans mnist

- 2 .

pdf

Submit the RF Classifier, the PCA RF

,

and k

-

means estimations to Kaggle.com, and provide screen snapshots of your scores as well as your Kaggle.com user name.

The experiment we have proposed has a major design flaw. Identify the flaw. Fix it

.

Rerun the experiment in a way that is consistent with a training

-

and

-

test regimen, and submit this to Kaggle.com.

Report total elapsed time measures for the training set analysis. It is sufficient to run a single time

-

elapsed test for this assignment. In practice, we might consider the possibility of repeated executions of the relevant portions of the programs, much as the Benchmark Example programs do

.

Some code that might help you with reporting elapsed total time follows.

start

=

datetime.now

()

2 .

fit

(

trainimages

,

labels

)

end

=

datetime.now

()

(

end

-

start

)

Python Programming

All programming will be done in Python.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

1 2 Milestone Three - Statement of Cost of Goods Sold 3 4 6 Beginning Work in Process Inventory 7 Direct Materials: 8 9 11 12 A Materials: Beginning Add: Purchases for month of January 14 15 16...

STRATEGY How Companies Become Platform Leaders Under the right circumstances, * companies of any size can grow to become platform leaders. And particular business and technology decisions can help...

For some years now, you've owned a small specialty bookshop in a college town. You sell some textbooks but mainly cater to a broader customer base. Your store always stocks the latest fiction,...

Having reviewed the opening lecture and this week's reading inHBR's OnStrategy Vol. 2,think about all the changes we've experience in the global marketplace in just the last 3 months.Where are you...

Case Summary Read the Discussion Assignment 1-1 on p.24 of the text Winning and Longevity. Select a health care entity to focus on, this could be a clinic or hospital of your choosing. Apply the case...

MGMT 479- Strategic Management ANALYZING A CASE STUDY (Revised January 25, 2016) Rubric for Case Study - Maximum Points Available (100 Points) The purpose of the case study is to let you apply the...

Sawchyn Guitars Case Study Questions: Who are the main players within the organization affected by the challenges? In what business and industry is the company operating? Describe the characteristics...

play.google.com + (105) The SAG-AFTRA Strike: Streaming Revenue, Al drama+did going WOKE c... Course Hero Strategy, Create and Implement the Best Strategy for Your Business - Google P... Strategy,...

(Note: this is not the same problem as last week's Problems of the Week.) Suppose that f(x, y) is a function such that fx 0, fxx > 0, fyy > 0, and fay

What would you do in each of the scenarios described in Table, Ethical Decision Making in the International Context?

In terms of competitive rivalry within an industry, what is the impact of a business having surplus production capacity? Costs of spare capacity are likely to reduce the business's ability to be...

Which of the following are problems with identifying users of ABC? Multiple select question. ABC means different things to different organizations. Organizations will announce the discontinuance of...

Compare the different types of employee separation actions.

Assess alternative dispute resolution methods.

Distinguish between intrinsic and extrinsic rewards.