Question: Given the following FIT and TEST data sets: FIT data set: = = = = = = = = = = = = = Program

Given the following FIT and TEST data sets:

FIT data set:

= = = = = = = = = = = = =

Program Modules Actual no

.

of Fault

-

Prone

(

)

LOC

Faults

(

)

Not Fault

-

Prone

(

NFP

)

Indep. Variable

(

)

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

0

NFP

19

1

NFP

25

0

NFP

20

2

NFP

28

1

NFP

21

3

NFP

30

2

NFP

36

5

NFP

38

7

41

11

45

16

50

21

60

TEST data set:

= = = = = = = = = = = = =

? ? 42

? ? 37

Where

(?)

means that we do NOT know its value and have to predict

/

estimate

.

(

)

Use CBR with Euclidean Distance as similarity function and un

-

weighted

average of THREE most similar cases to predict the number of faults in modules M

and N

.

SHOW ALL YOUR WORK!

(

)

Use CBR with Euclidean Distance as similarity function and Majority Voting

method with THREE most similar cases to classify modules M and N as FP or NFP

when C

= 2.25 .

SHOW ALL YOUR WORK!

(

)

Repeat part

(

)

with Data Clustering method. SHOW ALL YOUR WORK!

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Computer Science Data Mining Machine Learning Given the following FIT and Test data sets: FIT DATA SET: Program Modules Actual no of Faults (Y) 0 A B 1 0 2 Fault-Prone (FP) Not Fault-Prone (NFP) NEP...

. 5. Given the following FIT and TEST data seta: FIT data set: Program Modules Actual no. of Fault-Prone (FP Fault-prone Faults Y Not (NFF) indep. var x NFP NEP NFF NFP NEP NEP NEP NEP FP FP 23 25 28...

use the code r Script below to Answer the questions from number 3 to 7 Questions : 3. Model #1 - First Logistic Regression Model Reporting Results Report the results of the regression model. Address...

This assignment involves ondering of software modules, based on the nunmber of fulls predicted by a software quality prediction model. Use the predictions obuained with Linear Regression Mode we...

Small Project 2: Module Order Models DUE DATE: March 29, 17 This assignment involves ondering of software modules, based on the nunmber of fulls predicted by a software quality prediction model. Use...

Create the following program using java and post the output as well. Assignment In this assignment you are to implement a hash table for use in a simple spell checker. The spell checker should be set...

Use the C programming language and also put comments in the code. Take the pic to a new tab and zoom in. D Micrasoft Word- SP 201 x Secure | http/p thyrul 1 yp id225412psuh4/SP 20 1 8...

Problem Construct a module by the name of DataAnalysis . py , which has a class LinearRegression, and two functions split data ( ) and pairwiseplot ( ) . Use this module in a different file to fit...

Show that each point of the Cantor set F is a cluster point of F.

Assume x and y are functions of t. Evaluate dy/dt for 4xy-3x+4y^3= -76 , with conditions dx/dt= -8, x=4, y=-2

BE2.4 (LO 2, 3) Using the data in BE2.3, journalize the entry on July 1 and the adjusting entry on December 31 for Zubin Insurance. Zubin uses the accounts Unearned Insurance Revenue and Insurance...

5. Develop a scenario comparing two PH programs and involving the use of a CBA.

Create a word table that gives definitions for six types of multimedia presentation aids. Use the APA style to format and place the number and title of the table. Assume that this table is the first...

Prepare a visual aid that shows the steps in developing an oral presentation. (Objectives 3 and 7)

What is the difference between a pictograph and a drawing? (Objective 5)