Question: For data sets that contain many missing values, methods for estimating the missing values called imputation algorithms may be applied. In the journal,

For data sets that contain many missing values, methods for estimating the missing values — called imputation algorithms — may be applied. In the journal, Data & Knowledge Engineering (March 2013), researchers compared several imputation algorithms based on using nearest neighbors to estimate missing values. The five methods studied are named KMI, EACI, IKNNI, KNNI, and SKNN. Each of the methods was applied to each of four different data sets, one data set with 10% missing values, one with 30% missing, one with 50% missing, and one with 70% missing. After each imputation algorithm was applied, the normalized root mean square error (NRMSE) — a measure of the accuracy of the missing value predictions — was determined. These NRMSE values (based on information provided in the journal article) are given in the following table. Conduct a nonparametric analysis of the data. Is there evidence to indicate that the NRMSE distributions differ for the five imputation algorithms? Test using α = .01.

IMPUTE Missing % KMI EACI IKNNI KNNI SKNN 10 42 .29 .23

IMPUTE Missing % KMI EACI IKNNI KNNI SKNN 10 42 .29 .23 .23 .23 30 40 .30 24 .25 .24 50 .39 .30 .25 .26 26 70 .40 .31 .26 .27 .26

Step by Step Solution

★★★★★

3.48 Rating (155 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

To conduct a nonparametric analysis of the data and test whether the NRMSE distributions differ for the five imputation algorithms we can use a nonpar... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Statistics For Engineering And The Sciences Questions!

Define cognitive scripts and how they may be applied in situations where spontaneous violence could occur.

Discuss how the various valuation approaches may be applied in the determination of the fair value of the computer program

Researchers at Carnegie Mellon University developed an algorithm for estimating the sensor motion of a robotic arm by mounting a camera with inertia sensors on the arm (International Journal of...

Could you please explain the findings of the study? A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models Evangelia...

Rieg, Robert, Zarzycka, Ewelina, & Dobroszek, Justyna. (2021). Determinants of separating management accounting from financial accounting in SMEs and Family Firms - evidence from Poland and Germany....

Al-Driven Contextual Advertising: Toward Relevant Messaging Without Personal Data E. Haglund and J. Bjorklund Department of Computing Science, Umea University, Umed, Sweden ABSTRACT In programmatic...

Evaluation of an imputation method for missing data. When analyzing big data (large data sets with many variables), business researchers often encounter the problem of missing data (e.g.,...

nodes, but at least its bias can be quantified by Markov Chain L. INTRODUCTION analysis and thus can be corrected via appropriate re-weighting The popularity of online social networks (OSNs) in...

Big Data for Social Innovation By Kevin C. Desouza & Kendra L. Smith Stanford Social Innovation Review Summer 2014 Copyright 2014 by Leland Stanford Jr. University All Rights Reserved Stanford Social...

Calculate the income tax liability of Mr. Smith for the financial year 2022-23. Mr. Smith's total income for the year is $150,000, and he has made the following investments: i) Invested $20,000 in...

In Figure a closed loop carries current i = 200 mA. The loop consists of two radial straight wires and two concentric circular arcs of radii 2.00 m and 4.00 m. The angle is /4 rad. What are the (a)...

Which one of the following transactions represents a primary market transaction? Multiple Choice Purchase of 5 0 0 shares of GE stock from a current sharehoider leM's purchase of GE stock from a...

1. Explain the three stages of social media management as a profession. Why do you think the profession has developed in this way? Please explain your answer.

71. Here is a description from Minitab of the strength data given in Exercise 13. d1 5 x1 2 y1, c, dn 5 xn 2 yn F 5 9 5C 1 32 sy s 2 x y 2 i 5 1, 2, x c, n yi 5 axi 1 b x 2 m g(xg(x i 2 m)2 i 2 x)2...

72. Anxiety disorders and symptoms can often be effectively treated with benzodiazepine medications. It is known that animals exposed to stress exhibit a decrease in benzodiazepine receptor binding...

73. The article Can We Really Walk Straight? (Amer. J. of Physical Anthropology, 1992: 1927) reported on an experiment in which each of 20 healthy men was asked to walk as straight as possible to a...

Break-Even Analysis. Dime a Dozen Diamonds makes synthetic diamonds by treating carbon. Each diamond can be sold for $100. The materials cost for a standard diamond is $40. The fixed costs incurred...

any help would be greatly appreciated Calculate the yield of the following stock. (Round your answer to the nearest tenth percent.) Company Yearly dividend Closing price per share Best Buy $0. 28...

I nclude the title for the step in each of your problem solutions ( example:a. population, b. parameter, etc. ) a.population b.parameter c.null hypothesis and alternative hypothesis d.data /...