Question: 6 . 8 Tableau: Clustering ( loans ) Background As a data scientist working for Lending Club, you have been tasked with identifying and describing

6.8 Tableau: Clustering (loans)
Background
As a data scientist working for Lending Club, you have been tasked with identifying and describing the types, or clusters, of customers you have.
Tasks
Complete each step as outlined in the questions below.
Data Source
Use the lc_large.csv file available to download below.
Drag the table: lc_Loans into the entity view in Tableau. Select the "Extract" option for the connection in Tableau.
Data Dictionary:
Features about the loan
loan_status: current status of the loan
loan_status_numeric: a rank-ordered numeric version of loan_status
loan_amount: the listed amount of the loan applied for by the borrower
issue_d: the date the loan was funded/issued
term: the number of payments on the loan
int_rate: the interest rate on the loan
installment: the monthly payment owed by the borrower
total_pymnt: payments received to date for total amount funded
total_rec_prncp: payments received to date for total amount funded
total_rec_int: interest received to date
total_rec_late_fee: late fees received to date
recoveries: post charge off gross recovery (i.e., if the loan was charged off, how much money was recovered afterward, if any)
title: the loan title provided by the borrower
purpose: a category provided by the borrower for the loan request
Features obtained from the borrower before the loan was issued
emp_title: the job title supplied by the borrower
emp_length: employment length in years
home_ownership: the homeownership status provided by the borrower
annual_income: the self-reported annual income provided by the borrower
verification_status: was income verified by LC, the source, or not verified
Features obtained from the credit bureau about the borrower before issued
acc_now_delinq: the number of accounts on which the borrower is now delinquent
delinq_2yrs: the number of 30+ days past-due incidences of delinquency in the borrower's credit file for the past 2 years
earliest_cr_line: the month the borrower's earliest reported credit line was opened
inq_last_6mths: the number of unsecured inquiries in the past 6 months
mths_since_last_delinq: the number of months since the borrower's last delinquency
mths_since_last_record: the number of months since the last public record
open_acc: the number of open credit lines in the borrower's credit file
pub_rec: number of derogatory public records
revol_bal: total credit revolving balance
revol_util: the amount of credit the borrower is using relative to all available revolving credit
tot_coll_amt: total collection amounts ever owed
tot_cur_bal: total current balance of all accounts
total_acc: the total number of credit lines currently in the borrower's credit file
total_rev_hi_lim: total credit limit on revolving accounts
Features engineered by LC based on the credit bureau data
dti: a ratio calculated using the borrower's total monthly payments on the total debt obligations, excluding mortgages and the requested LC loan, divided by the borrower's combined self-reported monthly income
grade: the likelihood that the loan will be paid back
sub_grade: a more granular version of grade
Deliverables
In addition to answering the questions below, you will upload a .twbx file from Tableau that includes all of the work you did to complete this assessment.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!