The purpose of this exercise is to develop models to predict forest cover type using a number
The purpose of this exercise is to develop models to predict forest cover type using a number of cartographic measures. The given data set (Online File W6.1) includes four wilderness areas found in the Roosevelt National Forest of northern Colorado. A total of 12 cartographic measures were utilized as independent variables; seven major forest cover types were used as dependent variables. The following table provides a short description of these independent and dependent variables: This is an excellent example for a multiclass classification problem. The data set is rather large (with 581,012 unique instances) and feature rich. As you will see, the data is also raw and skewed (unbalanced for different cover types). As a model builder, you are to make necessary decisions to preprocess the data and build the best possible predictor. Use your favorite tool to build the models for neural networks, support vector machines, and nearest neighbor algorithms, and document the details of your results and experiences in a written report. Use screenshots within your report to illustrate important and interesting findings. You are expected to discuss and justify any decision that you make along the way. The reuse of this data set is unlimited with retention of copyright notice for Jock A. Blackard and Colorado State University.
Step by Step Answer: