Credit risk poses a classification problem that’s inherently imbalanced. This is because healthy loans easily outnumber risky loans. For this project I used various techniques to train and evaluate models with imbalanced classes. Using a dataset of historical lending activity from a peer-to-peer lending services company, I built a model that can identify the creditworthiness of borrowers.
The technologies required for the program to run are as follows:
pandas
https://pandas.pydata.org/
numpy
https://numpy.org/
sklearn
https://scikit-learn.org/stable/
imblearn
https://imbalanced-learn.org/stable/
-
This project consists of the following subsections:
-
Split the Data into Training and Testing Sets
-
Create a Logistic Regression Model with the Original Data
-
Predict a Logistic Regression Model with Resampled Training Data
-
Write a Credit Risk Analysis Report
-
Scott J. Marler
LinkedIn Profile: https://www.linkedin.com/in/scott-marler-212040b6/