Question: For Analytic Solver, partition the data sets into 5 0 % training, 3 0 % validation, and 2 0 % test and use 1 2

For Analytic Solver, partition the data sets into 50% training, 30% validation, and 20% test and use 12345 as the default random seed. When searching for the optimal value of k, search within possible k values from 1 to 10. If the predictor variable values are in the character format, then treat the predictor variable as a categorical variable. Otherwise, treat the predictor variable as a numerical variable.
Being able to predict machine failures before they happen can save millions of dollars for manufacturing companies. Manufacturers want to be able to perform preventive maintenance or repairs in advance to minimize machine downtime and often install electronic sensors to monitor the machines and their surrounding environment. However, the more sophisticated the machine, the more difficult it is to diagnose and predict the failure rate. Data mining has been used to analyze environmental factors to predict whether or not complex machines such as nanotechnology equipment will fail from one production period to another. The accompanying data file contains 480 observations and three environmental variables: level of humidity in the room where the equipment is located (Humid, in percentage); overall temperature in the room (Temp, in Fahrenheit); and a target variable that indicates whether or not the equipment broke down during the next production period (Breakdown =1 if breakdown, 0 otherwise).
a. Perform KNN analysis on the Machine_Data worksheet to determine the optimal k. Use 0.5 as the cutoff value for this analysis. Enter the optimal k in the box below:

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!