Question: In this example we will consider data form a consumer - to - consumer ( C 2 C ) lending market, in which borrowers can
In this example we will consider data form a consumertoconsumer CC lending market, in which borrowers can post loan listings and lenders can invest in these loans. From the lenders perspective, it would be useful to get a sense of how likely the loan is to default, given some information on it Data is available you can see it in the file loandatamodifiedcsv on approximately loans and whether these loans ultimately failed defaulted or were current. In addition to this information, we also have data on the following features:
a The amount of the loan in $
b The age of the loan in months
c The borrower rate the interest rate that the borrower pays the lender
d The borrowers credit rating which is either good or low Since this is a categorical feature, we convert it into a dummy numerical feature which takes value for low and for good. Note: the original data has many credit rating categories that range from AA B C all the way down to High Risk. For simplicity, I clubbed these ratings into only two categories, either good or low
A logistic model is trained on this data, with faildefault being the positive case and loan current being the negative case. The following model is obtained:
coefficients:
Estimate
Intercepte
Amount e
Age e
Borrower.Rate e
RatingLow e
ii From the model above, what is the predicted probability that a loan will default if it is for $ has age months, a borrower rate of and the borrowers credit rating is good Note: this means that the ratinglow feature takes a value of in the model In answering this question, it is helpful to understand the standard e and e notation that you see in the output above as follows: e
ee and e
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
