Question: 6 . 8 Tableau: Clustering ( loans ) Background As a data scientist working for Lending Club, you have been tasked with identifying and describing
Tableau: Clustering loans
Background
As a data scientist working for Lending Club, you have been tasked with identifying and describing the types, or clusters, of customers you have.
Tasks
Complete each step as outlined in the questions below.
Data Source
Use the lclarge.csv file available to download below.
Drag the table: lcLoans into the entity view in Tableau. Select the "Extract" option for the connection in Tableau.
Data Dictionary:
Features about the loan
loanstatus: current status of the loan
loanstatusnumeric: a rankordered numeric version of loanstatus
loanamount: the listed amount of the loan applied for by the borrower
issued: the date the loan was fundedissued
term: the number of payments on the loan
intrate: the interest rate on the loan
installment: the monthly payment owed by the borrower
totalpymnt: payments received to date for total amount funded
totalrecprncp: payments received to date for total amount funded
totalrecint: interest received to date
totalreclatefee: late fees received to date
recoveries: post charge off gross recovery ie if the loan was charged off, how much money was recovered afterward, if any
title: the loan title provided by the borrower
purpose: a category provided by the borrower for the loan request
Features obtained from the borrower before the loan was issued
emptitle: the job title supplied by the borrower
emplength: employment length in years
homeownership: the homeownership status provided by the borrower
annualincome: the selfreported annual income provided by the borrower
verificationstatus: was income verified by LC the source, or not verified
Features obtained from the credit bureau about the borrower before issued
accnowdelinq: the number of accounts on which the borrower is now delinquent
delinqyrs: the number of days pastdue incidences of delinquency in the borrower's credit file for the past years
earliestcrline: the month the borrower's earliest reported credit line was opened
inqlastmths: the number of unsecured inquiries in the past months
mthssincelastdelinq: the number of months since the borrower's last delinquency
mthssincelastrecord: the number of months since the last public record
openacc: the number of open credit lines in the borrower's credit file
pubrec: number of derogatory public records
revolbal: total credit revolving balance
revolutil: the amount of credit the borrower is using relative to all available revolving credit
totcollamt: total collection amounts ever owed
totcurbal: total current balance of all accounts
totalacc: the total number of credit lines currently in the borrower's credit file
totalrevhilim: total credit limit on revolving accounts
Features engineered by LC based on the credit bureau data
dti: a ratio calculated using the borrower's total monthly payments on the total debt obligations, excluding mortgages and the requested LC loan, divided by the borrower's combined selfreported monthly income
grade: the likelihood that the loan will be paid back
subgrade: a more granular version of grade
Deliverables
In addition to answering the questions below, you will upload a twbx file from Tableau that includes all of the work you did to complete this assessment.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
