Objective: Create a model that predicts whether or not a loan will be default using historical data.
Problem Statement:
For companies like Lending Club correctly predicting whether or not a loan will be a default is very important. In this project, using historical data from to you have to build a deep learning model to predict the chance of default for future loans. As you will see later, this dataset is highly imbalanced and includes a lot of features that make this problem more challenging.
Domain: Finance
Analysis to be done: Perform data preprocessing and build a deep learning prediction model.
Content:
Dataset columns and definition:
credit.policy: if the customer meets the credit underwriting criteria of LendingClub.com, and otherwise.
purpose: The purpose of the loan takes values "creditcard", "debtconsolidation", "educational", "majorpurchase", "smallbusiness", and "allother"
int.rate: The interest rate of the loan, as a proportion a rate of would be stored as Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates.
installment: The monthly installments owed by the borrower if the loan is funded.
logannual.inc: The natural log of the selfreported annual income of the borrower.
dti: The debttoincome ratio of the borrower the amount of debt divided by annual income
fico: The FICO credit score of the borrower.
days.with.crline: The number of days the borrower has had a credit line.
revol.bal: The borrower's revolving balance the amount unpaid at the end of the credit card billing cycle
revol.util: The borrower's revolving line utilization rate the amount of the credit line used relative to total credit available
inq.lastmths: The borrower's number of inquiries by creditors in the last months.
delinqyrs: The number of times the borrower has been days past due on a payment in the past years.
pub.rec: The borrower's number of derogatory public records bankruptcy filings, tax liens, or judgments
Steps to perform:
Perform exploratory data analysis and feature engineering and then apply feature engineering. Follow up with a deep learning model to predict whether or not the loan will be default using the historical data.
Tasks:
Feature Transformation
Transform categorical values into numerical values discrete
Exploratory data analysis of different factors in the dataset.
Additional Feature Engineering
You will check the correlation between features and drop those features that have a strong correlation. This will help reduce the number of features and leave you with the most relevant features.
Modeling After applying EDA and feature engineering, you are now ready to build the predictive models. In this part, you will create a deep learning model using Keras with Tensorflow backend.