The file UniversalBank.csv contains data on 5000 customers of Universal Bank. The data include customer demographic information

Question:

The file UniversalBank.csv contains data on 5000 customers of Universal Bank. The data include customer demographic information (age, income, etc.), the customer’s relationship with the bank (mortgage, securities account, etc.), and the customer response to the last personal loan campaign (Personal Loan). Among these 5000 customers, only 480 (=9.6%) accepted the personal loan that was offered to them in the earlier campaign. In this exercise, we focus on two predictors: Online (whether or not the customer is an active user of online banking services) and Credit Card (abbreviated CC below) (does the customer hold a credit card issued by the bank) and the class label Personal Loan (abbreviated Loan below).

Partition the data into training (60%) and holdout (40%) sets.

a. Create a pivot table for the training data with Online as a column grouping attribute, CC as a row attribute (i.e., group by attribute), and Loan as a secondary row attribute (i.e., group by attribute). The values inside the table should convey the count. Consider using the Turbo Prep view for building the pivot table.

b. Consider the task of classifying a customer who owns a bank credit card and is actively using online banking services. Looking at the pivot table, what is the probability that this customer will accept the loan offer? [This is the probability of loan acceptance (Loan = true) conditional on having a bank credit card (CC = true) and being an active user of online banking services (Online = true).

c. Create two separate pivot tables for the training data. One will have Loan (rows) as a function of Online (columns), and the other will have Loan (rows) as a function of CC.

d. Compute the following quantities [P(A | B) means “the probability of A given B”:

i. P(CC = true | Loan = true) (the proportion of credit card holders among the loan acceptors)

ii. P(Online = true | Loan = true)

iii. P(Loan = true) (the proportion of loan acceptors)

iv. P(CC = true | Loan = false)

v. P(Online = true | Loan = false)

vi. P(Loan = false)

e. Use the quantities computed above to compute the naive Bayes probability P(Loan = true | CC = true, Online = true).

f. Compare this value with the one obtained from the pivot table in (b). Which is a more accurate estimate?

g. In RapidMiner, run naive Bayes on the data. Examine the model output on the training data, and find an entry that corresponds to P(Loan = true | CC = true, Online = true). Compare this with the number you obtained in (e).

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Answer rating: 100% (QA)

To address the given tasks well follow these steps a Create a pivot table for the training data with Online as a column grouping attribute CC as a row attribute and Loan as a secondary row attribute d...View the full answer

Answered By

Ashington Waweru

I am a lecturer, research writer and also a qualified financial analyst and accountant. I am qualified and articulate in many disciplines including English, Accounting, Finance, Quantitative spreadsheet analysis, Economics, and Statistics. I am an expert with sixteen years of experience in online industry-related work. I have a master's in business administration and a bachelor’s degree in education, accounting, and economics options. I am a writer and proofreading expert with sixteen years of experience in online writing, proofreading, and text editing. I have vast knowledge and experience in writing techniques and styles such as APA, ASA, MLA, Chicago, Turabian, IEEE, and many others. I am also an online blogger and research writer with sixteen years of writing and proofreading articles and reports. I have written many scripts and articles for blogs, and I also specialize in search engine I have sixteen years of experience in Excel data entry, Excel data analysis, R-studio quantitative analysis, SPSS quantitative analysis, research writing, and proofreading articles and reports. I will deliver the highest quality online and offline Excel, R, SPSS, and other spreadsheet solutions within your operational deadlines. I have also compiled many original Excel quantitative and text spreadsheets which solve client’s problems in my research writing career. I have extensive enterprise resource planning accounting, financial modeling, financial reporting, and company analysis: customer relationship management, enterprise resource planning, financial accounting projects, and corporate finance. I am articulate in psychology, engineering, nursing, counseling, project management, accounting, finance, quantitative spreadsheet analysis, statistical and economic analysis, among many other industry fields and academic disciplines. I work to solve problems and provide accurate and credible solutions and research reports in all industries in the global economy. I have taught and conducted masters and Ph.D. thesis research for specialists in Quantitative finance, Financial Accounting, Actuarial science, Macroeconomics, Microeconomics, Risk Management, Managerial Economics, Engineering Economics, Financial economics, Taxation and many other disciplines including water engineering, psychology, e-commerce, mechanical engineering, leadership and many others. I have developed many courses on online websites like Teachable and Thinkific. I also developed an accounting reporting automation software project for Utafiti sacco located at ILRI Uthiru Kenya when I was working there in year 2001. I am a mature, self-motivated worker who delivers high-quality, on-time reports which solve client’s problems accurately. I have written many academic and professional industry research papers and tutored many clients from college to university undergraduate, master's and Ph.D. students, and corporate professionals. I anticipate your hiring me. I know I will deliver the highest quality work you will find anywhere to award me your project work. Please note that I am looking for a long-term work relationship with you. I look forward to you delivering the best service to you.

3.00+ 2+ Reviews 10+ Question Solved

Related Book For book-img-for-question

Machine Learning For Business Analytics

ISBN: 9781119828792

1st Edition

Authors: Galit Shmueli, Peter C. Bruce, Amit V. Deokar, Nitin R. Patel

See More Books

Question Posted: Mar 28, 2024 07:01 AM

See More Questions

The file UniversalBank.csv contains data on 5000 customers of Universal Bank. The data include customer demographic information

Question:

Step by Step Answer:

To address the given tasks well follow these steps a Create a pivot table for the training data with Online as a column grouping attribute CC as a row attribute and Loan as a secondary row attribute d...View the full answer

Machine Learning For Business Analytics

Students also viewed these Business questions