Question: Overview: In this analysis you will develop logistic regression model based on the data set provided to predict whether or not the specimens are genuine.

Overview: In this analysis you will develop logistic regression model based on the data set provided to predict whether or not the specimens are genuine.

Data Set Information: Data (A6DATA.csv) were extracted from images that were taken from genuine and forged banknote-like specimens. For digitization, an industrial camera usually used for print inspection was used. The final images have 400x 400 pixels. Due to the object lens and distance to the investigated object gray-scale pictures with a resolution of about 660 dpi were gained. Wavelet Transform tool were used to extract features from images.

Attribute Information:

V1: variance of Wavelet Transformed image (continuous)

V2: skewness of Wavelet Transformed image (continuous)

V3: kurtosis of Wavelet Transformed image (continuous)

V4: entropy of image (continuous)

V5: class (0-forged, 1-genuine)

PART 1

Read the A6DATA.csv data file into RStudio. Run set.seed(222) for partitioning of the dataset into training (50%) and testing (50%). Report on the number of forged and genuine banknote-like specimens in the training and testing data.

PART 2

Develop a logistic regression model using the training data. Provide final logistic regression model (with only significant variables), equation for calculating probability that specimen is genuine, confusion matrix for both training & testing data, misclassification error for both training & testing data, and comment on the performance of the model.

PART 3

Develop logistic regression models with 60%/40%, 70%/30%, and 80%/20% partitioning into training and testing data sets using set.seed(222). Summarize training and testing accuracy, sensitivity and specificity for each and compare with 50%/50% performance using the table below. Recommend and comment on the best model for future use.

Partitioning	Accuracy %	Sensitivity %	Specificity %
Training - 50%
Testing 50%
Training - 60%
Testing 40%
Training - 70%
Testing 30%
Training - 80%
Testing 20%

PART 4

Compare the best and the worst logistic regression model in the previous question using ROC curve, AUC and best threshold values based on testing data. Discuss your results.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!

What to submit? - Submit a R Markdown file that contains R codes, output and responses as required below. The file submitted should be Word or PDF . Max points: 1 0 0 In this analysis you will...

1- The sales department is interested in knowing the factors (e.g. employee tenure, employee salary) related to the amount of sales (in dollars) each salesperson generates. Given the outcome variable...

use the code r Script below to Answer the questions from number 3 to 7 Questions : 3. Model #1 - First Logistic Regression Model Reporting Results Report the results of the regression model. Address...

April 16, 2007 20:3 WSPC/177-JCR 00049.tex Journal of Construction Research, Vol. 7, Nos. 1&2 (2006) 111-132 c World Scientic Publishing Company \u0001 ASSESSMENT OF RISK PERCEPTION OF IRONWORKERS...

Submitted to Management Science manuscript MS-0001-1922.65 Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title....

Referring to Problem 14.41 on page 506, you have decided to analyze whether there are differences in fixed acidity, chlorides, and pH between white wines and red wines (0 = white 1 = red). Using the...

A. Histogram: For your two variables, create histograms. B. Summary statistics: For your two variables, create a table to show the mean, median, and standard deviation. C. Interpret the graphs and...

**PLEASE PROVIDE ANSWERS TO 8,9,10,11 ** --- output: pdf_document: default html_document: default --- --- title: 'Home Equity Loan Customer Pre-screen and Scoring' subtitle: 'UMaine BUA684 Module 3'...

Article review on journal of The Role of Micro and Small Business Enterprises in Linking Youth and Women in to Business: A Case Study in South Gondar Zone, Ethiopia Dejen Debeb Asmare1Alebel Weretaw...

Suppose a bond has 10 years to maturity, a coupon rate of 6%, and a face value of $100 selling at a 7% yield to maturity where coupon payments are made every 6 months. a) Use modified duration to...

In the second row of the periodic table, Be, N, and Ne all have endothermic (unfavorable) electron affinities, whereas the other second- row elements have exothermic (favorable) electron affinities....

3 1 A company has outstanding 3 million shares of $ 2 par common stock and 1 miltion shares of $ 4 par preferred stock. The preferred stock has an 8 $ dividend rate The company declares $ 3 0 0 , 0 0...

visionmatching.docx Instructions: match each part of the brain with its function A. inferior temporal cortex B. fusiform gyrus C. ventral stream D. dorsal stream E. V4 F. V5/medial temporal G....