For data sets that contain many missing values, methods for estimating the missing values called imputation

Question:

For data sets that contain many missing values, methods for estimating the missing values — called imputation algorithms — may be applied. In the journal, Data & Knowledge Engineering (March 2013), researchers compared several imputation algorithms based on using nearest neighbors to estimate missing values. The five methods studied are named KMI, EACI, IKNNI, KNNI, and SKNN. Each of the methods was applied to each of four different data sets, one data set with 10% missing values, one with 30% missing, one with 50% missing, and one with 70% missing. After each imputation algorithm was applied, the normalized root mean square error (NRMSE) — a measure of the accuracy of the missing value predictions — was determined. These NRMSE values (based on information provided in the journal article) are given in the following table. Conduct a nonparametric analysis of the data. Is there evidence to indicate that the NRMSE distributions differ for the five imputation algorithms? Test using α = .01.

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  answer-question

Statistics For Engineering And The Sciences

ISBN: 9781498728850

6th Edition

Authors: William M. Mendenhall, Terry L. Sincich

Question Posted: