This question should be answered using the Ionosphere data set, which is part of the mlbench...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
This question should be answered using the Ionosphere data set, which is part of the mlbench package. This radar data was collected by a system in Goose Bay, Labrador. The data frame consists of 351 observations on 35 independent variables. The last column in the dataframe is a categorical variable, "Class", defining the free electrons in the ionosphere: "good" radar returns are those showing evidence of some type of structure in the ionosphere. "bad" returns are those that do not. (a) Produce some numerical and graphical summaries of the Ionosphere data. Do there appear to be any patterns? (b) Notice that the second the column contains only one single value, so remove that column and work on the rest of the questions using the new dataset. Perform a K-Nearest Neighbors (KNN) algorithm with K = 1, where Class is the response, and the rest columns in the dataset as predictors. (c) Compute the confusion matrix and overall fraction of correct predictions. Explain what the confusion matrix is telling you about the types of mistakes made by KNN algorithm. (d) Split the data randomly into a training set (70%) and a test set (30%). Make sure to use set.seed (4323), for reproducible results. Fit the KNN model (K = 3) (e) Repeat (d) using K = 5. (f) Repeat (d) using K = 7. (g) Which of these methods appears to provide the best results on this data? This question should be answered using the Ionosphere data set, which is part of the mlbench package. This radar data was collected by a system in Goose Bay, Labrador. The data frame consists of 351 observations on 35 independent variables. The last column in the dataframe is a categorical variable, "Class", defining the free electrons in the ionosphere: "good" radar returns are those showing evidence of some type of structure in the ionosphere. "bad" returns are those that do not. (a) Produce some numerical and graphical summaries of the Ionosphere data. Do there appear to be any patterns? (b) Notice that the second the column contains only one single value, so remove that column and work on the rest of the questions using the new dataset. Perform a K-Nearest Neighbors (KNN) algorithm with K = 1, where Class is the response, and the rest columns in the dataset as predictors. (c) Compute the confusion matrix and overall fraction of correct predictions. Explain what the confusion matrix is telling you about the types of mistakes made by KNN algorithm. (d) Split the data randomly into a training set (70%) and a test set (30%). Make sure to use set.seed (4323), for reproducible results. Fit the KNN model (K = 3) (e) Repeat (d) using K = 5. (f) Repeat (d) using K = 7. (g) Which of these methods appears to provide the best results on this data?
Expert Answer:
Related Book For
Managerial Accounting for Managers
ISBN: 978-0073527130
2nd edition
Authors: Eric Noreen, Peter Brewer, Ray Garrison
Posted Date:
Students also viewed these accounting questions
-
The McCracken County Humane Society (MCHS), which is part of a county's reporting entity, established a permanent fund to provide support for its pet neutering program. As of the start of the year,...
-
This question should be answered only after Research and Application 3-20 is completed. Required: 1. Referring to the data for Blue Nile in Research and Application 3-20 and the data on net sales...
-
This question should be answered only after Research and Application 520 are completed. Required: 1. Referring to the data for Blue Nile in Research and Application 5-20 and the data on net sales...
-
Super Slushie charges $6.50 for a medium slushie and $9 for a large slushie. Their total net marketing contribution is $28,000 per week. They want to raise the price of each slushie 12.5% next month...
-
What are the contribution margin and the contribution margin ratio for the company in Problem 1 if $15,000 of the overhead is considered variable overhead? In Problem 1 A construction company has...
-
Sonora Company produces and sells pottery chimineas (small clay outdoor fireplaces). The chi- mineas come in three models: small basic, large basic, and carved (ornately shaped and carved). In the...
-
Suppose your name is Grant Scheffer, and Advanced Automotive repaired your car. You settled the bill as noted on the following invoice. To you this is a purchase invoice. To Advanced Automotive, it...
-
Nathans Athletic Apparel has 2,000 shares of 5%, $100 par value preferred stock the company issued at the beginning of 2014. All remaining shares are common stock. The company was not able to pay...
-
How do calculate sales forecast and expense forecast for several years
-
A manufacturer has acquired four small assembly plants, located in Charlotte, Tulsa, Memphis, and Buffalo. The plan is to remodel and keep two of the plants and close the other two. The table at the...
-
The catalysts to enable innovativeness are: (i) encouraging experimentation - the first catalyst to enable innovativeness is the presence of a culture that encourages experimentation. This will...
-
The monopolistic competitor _________ produces at the minimum point of his or her ATC curve.
-
Joe Spivey is president of Advantage Research, Inc. The firm specializes in customized research for clients in a variety of industries via computer-assisted mall interviews in five of the largest...
-
The trusts won only the ____________case. a) AT&T b) U.S. Steel c) American Tobacco d) Standard Oil
-
A key passage of the Act stated that every contract, combination in the form of trust or otherwise, in restraint of commerce among the several states, or with foreign nations, is hereby declared...
-
The act that supported union organizing was the ________. a) National Labor Relations Act b) Taft-Hartley Act c) Landrum-Griffin Act d) Sherman Antitrust Act.
-
A superscalar processor may speculatively execute loads even when one or more earlier stores have not yet computed their memory addresses. In practice, we would need to restart execution from the...
-
Willingness to pay as a measure of a person's value for a particular good measures the maximum a person would be willing to pay requires that payment actually be made depends on the satisfaction that...
-
The questions in this exercise are based on Dell, Inc. To answer the questions, you will need to download Dell's 2005 Form 10-K by going to www.sec.gov/edgar/searchedgar/companysearch.html. Input CIK...
-
Various cost and sales data for Meriwell Company for the just completed year appear in the worksheet below: Of the $105,000 of manufacturing overhead, $15,000 is variable and $90,000 is fixed....
-
Refer to the data for Lavage Rapide in Exercises 9-8 and 9-10. Data From Ex 9-8 Data From Ex 9-10 Lavage Rapide Income Statement For the Month Ended August 31 Actual cars washed...
-
Why are most larger businesses not managed as a single unit by one manager?
-
What are the main advantages and disadvantages that should be considered when deciding between a partnership business and a limited liability company?
-
Suppose an item of information is capable of being provided. It is relevant to a particular decision, it is also reliable, comparable, can be understood by the decision maker concerned and is...
Study smarter with the SolutionInn App