0% found this document useful (0 votes)

117 views6 pages

PSI and KS Statistic

PSI AND KS STATIC

Uploaded by

Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views6 pages

PSI and KS Statistic

PSI AND KS STATIC

Uploaded by

Ganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Paper 132

Population Stability and Model Performance Metrics Replication for

Business Model at SunTrust Bank
Bogdan Gadidov, Kennesaw State University; Benjamin McBurnett, Georgia Institute of
Technology

Abstract
Board of Governors of the Federal Reserve System has published Supervisory Guidance on
Model Risk Management (SR Letter 11-7) emphasizing that banks rely heavily on quantitative
analysis and models in most aspects of financial decision making. Ongoing monitoring and
maintenance (M&M) is essential for timely evaluation of model performance to determine
whether changes in business strategies and market conditions require adjustment,
redevelopment, or replacement of the model. A typical M&M plan includes tracking of
Population Stability Index (PSI), Rank Ordering Test, and Kolmogorov-Smirnov Statistic (KS).
As part of an internship program at SunTrust bank, I was able to track these key metrics for one
business critical model. The model uses a logistic regression to predict the probability of default
for a given customer.

To track the three metrics stated above, data from quarter 1 of 2014 is compared with a
baseline distribution, generally the dataset which is used to create the model. PSI quantifies the
shift in the distribution of the population between the baseline and current time periods. Rank
Ordering Testing involves comparing the expected default rate, predicted by the model, to the
actual default rate in the current quarter. The KS statistic assesses model performance by
measuring the model's ability to discern defaults from non-defaults. The npar1way procedure
was used in SAS® to calculate KS. Reports and charts presented in this poster will be sanitized
due to the confidential nature of the data, but methodology and step-by-step procedures
represent actual research results.

Introduction
The three metrics will be used together to assess how well the model is performing in the
validation period. Each observation in the datasets contains a score which is equal to the
probability of default multiplied by 10,000. The score is created in such a way to be similar to a
FICO score. For the calculation of PSI and rank ordering testing, the observations are grouped
into bins based off of the score of each observation. 8 bins are created by using predefined
cutoff points as the intervals. The number of observations in each bin does not necessarily have
to be the same. The percentage of defaulting customers in each of these bins will be used in
calculations in further sections.

PSI
PSI quantifies shifts in population dynamics over time. As models are based on historical
datasets, it is necessary to ensure that present-day population features are sufficiently similar to
the historical population from which the model is based. A higher PSI corresponds to greater
shifts in population. Generally, PSI values greater than 0.25 indicate a significant shift in the
Population Stability and Model Performance Metrics Replication for Business Model at SunTrust Bank, continued SESUG 2015

population, while values less than 0.10 indicate a minimal shift in the population. Values
between 0.10 and 0.25 indicate a minor shift. The formula for PSI is shown below, where ndi is
the number of observations in the ith bin of the development dataset, nvi is the total number of
observations in the ith bin of the validation dataset, and Ndi and Nvi are the total number of
observations in the development and validation datasets respectively. This formula is used to
calculate the PSI for each of the 8 bins, and the total PSI is the summation of the individual
PSI's from each bin.

Eq. 1.

Table 1 below shows the individual calculations of the PSI for each bin. The percentage of the
observations which lie in each bin are shown for both the development and validation datasets
in the Dev Percent and Val Percent columns. The PSI column shows the calculated PSI for
each bin, using the formula from above. The value which is bolded in the Cumulative PSI
column shows the overall value for the PSI across all 8 bins. Using the guidelines as defined
earlier, this value is much less than 0.1, indicating a minimal shift in the population between
development and validation periods.

Score Dev Dev Val Val Cumulative

Bin PSI
Range Frequency Percent Frequency Percent PSI
>1400 1 8846 1.34 8074 1.62 0.00023 0.00023
440-1400 2 18990 2.88 15241 3.05 0.00004 0.00027
200-440 3 35537 5.39 26272 5.26 0.00001 0.00028
90-200 4 74324 11.28 54985 11.01 0.00003 0.00031
40-90 5 92214 14.00 68979 13.81 0.00001 0.00032
18-40 6 105203 15.97 79916 16.00 0.00000 0.00032
8-18 7 223414 33.91 169095 33.85 0.00000 0.00032
<=8 8 100347 15.23 76954 15.41 0.00001 0.00033
Table 1. PSI Calculations

Rank Ordering Testing

The second metric also identifies shifts in population. Rank ordering is calculated by considering
the percentage of “bads” (typically defaults, delinquencies, etc.) per given score ranges within
the development and validation datasets. The expected event rate is calculated for each bin
from the development dataset, using the calculated probabilities of default from the logistic
regression. The actual event rate is calculated from the validation dataset by finding the
percentage of defaults within each bin of observations. A monotonically decreasing pattern
should be seen among the bins, as the higher score ranges have higher rates of default, which
can be seen in Table 2. A 95% confidence interval is then calculated between the difference of
the expected and actual event rates. Confidence intervals which do not contain 0 have a
statistically significant difference between the expected and actual default rates.

2
Population Stability and Model Performance Metrics Replication for Business Model at SunTrust Bank, continued SESUG 2015

Expected vs. Statistically

Score Expected Actual Lower Upper
Bin Actual % Significant
Range Event Rate Event Rate 95% CI 95% CI
Difference Difference
>1400 1 45.47% 40.07% 5.40% 3.91% 6.89% Yes
440-1400 2 10.54% 8.07% 2.47% 1.86% 3.09% Yes
200-440 3 4.43% 3.34% 1.09% 0.79% 1.40% Yes
90-200 4 1.87% 1.48% 0.39% 0.25% 0.53% Yes
40-90 5 0.96% 0.71% 0.25% 0.16% 0.34% Yes
18-40 6 0.41% 0.51% -0.10% -0.16% -0.03% Yes
8-18 7 0.15% 0.21% -0.06% -0.09% -0.03% Yes
<=8 8 0.09% 0.10% -0.02% -0.04% 0.01% No
Table 2. Rank Ordering Testing

KS Statistic
The KS statistic is used as a measure of the ability of the model to separate good and bad
accounts. The KS statistic is calculated manually and also through the proc npar1way
procedure in SAS. To calculate KS manually, each dataset is divided into 10 groups (deciles).
Since the score values occur in discrete intervals, each decile contains approximately 10% of
the dataset. Once the deciles are obtained, the cumulative percentage of defaults and non-
defaults is calculated across the 10 deciles. The KS is the maximum difference between the
cumulative percentage of defaults and non-defaults.

Table 3. KS for Development Dataset

3
Population Stability and Model Performance Metrics Replication for Business Model at SunTrust Bank, continued SESUG 2015

Table 4. KS for Validation Dataset

Table 3 and Table 4 show the calculations for the KS statistic in both the development and
validation datasets. Each table shows the 10 deciles and the number of defaults within each
decile. Using this, the cumulative percentage of the defaults and non-defaults can be calculated
for each decile, and the difference is shown in the last column. The point where the difference is
the largest is the value of the KS statistic. A graph in Figure 1 is used to visualize where this
maximal KS occurs.

100
90
80
Cumulative % Defaults

70
60
50 Development
40 Validation
30 Random
20
10
0
0 20 40 60 80 100
Cumulative % Non-Defaults

Figure 1. Graphical Comparison between Development and Validation KS

It is important to see from the figure above that the shape of the KS curves for both the
development and validation datasets have approximately the same shape, indicating that the
maximal separation occurs around the same point in the second decile. The red line represents
"random" guessing at assigning defaults and non-defaults. The distance between the red line
and the point on green or blue curve represents the value of the KS. It can be seen that the
validation KS is slightly less than the development KS. This is expected as model performance
will deteriorate over time. The difference, however, is relatively small. Some cutoff points for
acceptable KS ranges are:

• Validation KS > 50 indicates excellent model performance

4
Population Stability and Model Performance Metrics Replication for Business Model at SunTrust Bank, continued SESUG 2015

• Validation KS < 50 and less than 20% decrease from development KS indicates
acceptable performance
• Validation KS < 50 and more than 20% decrease from development KS indicates a
deterioration of model performance

In addition to calculating KS manually, the npar1way procedure can be performed in SAS to

achieve similar results. The output from this procedure appear in Output 1 and Output 2 below.
The value of the KS statistic calculated in tables 3 and 4 can be compared to the D value in the
outputs below.

Output 1. SAS Output from Proc Npar1way for Development Dataset

Output 2. SAS Output from Proc Npar1way for Validation Dataset

Conclusion
The three metrics discussed, PSI, rank ordering testing, and KS statistic, can be used in
conjunction to assess model performance. PSI and rank ordering testing focus more on how the
population may have shifted between development and validation periods, while the KS statistic
is used to assess the predictive capability and performance of the model. Of the three metrics,
PSI and KS should be more closely monitored when making decisions regarding the model and
its performance. For the model evaluated in this paper, the PSI was well within acceptable
ranges (<0.1). The KS statistic decreased by approximately 4 between development and
validation periods, but using the guidelines described in the previous section, since the KS is
greater than 50, the model is still performing well within acceptable range. The rank ordering
testing should be used in conjunction with the other two statistics. The rank ordering testing
suggests that the model is over-predicting the actual default rate, but since the other two
statistics agree that the model performance has not deteriorated, it can be determined that the
model is still performing within acceptable standards.

Acknowledgments
Special thanks to Alex Shenkar at SunTrust Bank who provided materials and assisted with this
project throughout the internship program.

Contact Information
Your comments and questions are valued and encouraged. Contact the author at:
Bogdan Gadidov
Kennesaw State University
[email protected]

5
Population Stability and Model Performance Metrics Replication for Business Model at SunTrust Bank, continued SESUG 2015

SAS and all other SAS Institute Inc. product or service names are registered trademarks or
trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration.
Other brand and product names are trademarks of their respective companies.

Solution Manual For Introductory Statistics 8th Edition by Mann
44% (16)
Solution Manual For Introductory Statistics 8th Edition by Mann
5 pages
Exercises Session 03
No ratings yet
Exercises Session 03
5 pages
Exercises 1
100% (1)
Exercises 1
7 pages
Week Two Assignment A
No ratings yet
Week Two Assignment A
1 page
Razavi Monolithic Phase-Locked Loops and Clock Recovery Circuits
No ratings yet
Razavi Monolithic Phase-Locked Loops and Clock Recovery Circuits
39 pages
Solar Battery Charger Circuit Guide
No ratings yet
Solar Battery Charger Circuit Guide
3 pages
Waves Exam Q
0% (1)
Waves Exam Q
24 pages
CH #4 Analysis of Result
No ratings yet
CH #4 Analysis of Result
1 page
Steel Detaing Part1
No ratings yet
Steel Detaing Part1
114 pages
Solenoid Valve 2/2 Way N.O. Direct Acting - Dampness-Proof IP 67
No ratings yet
Solenoid Valve 2/2 Way N.O. Direct Acting - Dampness-Proof IP 67
2 pages
Software Reliability and Quality Management: Version 2 CSE IIT, Kharagpur
No ratings yet
Software Reliability and Quality Management: Version 2 CSE IIT, Kharagpur
6 pages
Sums For Practice in Statistics
No ratings yet
Sums For Practice in Statistics
5 pages
Inverse of A Matrix
100% (1)
Inverse of A Matrix
71 pages
Statistical Data Assignment 1
No ratings yet
Statistical Data Assignment 1
6 pages
Probability & Statistics Problem Set
No ratings yet
Probability & Statistics Problem Set
2 pages
Support Vector Machines Based On K-Means Clustering For Real-Time Business Intelligence Systems
No ratings yet
Support Vector Machines Based On K-Means Clustering For Real-Time Business Intelligence Systems
11 pages
Statistical Foundation For Analytics-Module 1
No ratings yet
Statistical Foundation For Analytics-Module 1
18 pages
Wipro Technical Interview Questions
No ratings yet
Wipro Technical Interview Questions
3 pages
Assignment Module04 Part2 KI 20220407
100% (1)
Assignment Module04 Part2 KI 20220407
6 pages
Set+1 Descriptive+statistics+Probability+
100% (1)
Set+1 Descriptive+statistics+Probability+
4 pages
Ijert Ijert: Decision Making To Predict Customer Preferences in Life Insurance
No ratings yet
Ijert Ijert: Decision Making To Predict Customer Preferences in Life Insurance
4 pages
Modified Compressed Air Engine Two Stroke Engine Working On The Design of A Four Stroke Petrol Engine
No ratings yet
Modified Compressed Air Engine Two Stroke Engine Working On The Design of A Four Stroke Petrol Engine
3 pages
Evaluating Student's Performance Using K-Means Clustering: Rakesh Kumar Arora, Dr. Dharmendra Badal
No ratings yet
Evaluating Student's Performance Using K-Means Clustering: Rakesh Kumar Arora, Dr. Dharmendra Badal
5 pages
Diffusion of Solids in Liquids
No ratings yet
Diffusion of Solids in Liquids
8 pages
Data Science Student Project
No ratings yet
Data Science Student Project
11 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
Hydraulics Course for Marine Engineers
No ratings yet
Hydraulics Course for Marine Engineers
1 page
25 Question Paper
No ratings yet
25 Question Paper
4 pages
Business Stats for Managers
No ratings yet
Business Stats for Managers
19 pages
Assignment #2 Confidence Interval Estimation
No ratings yet
Assignment #2 Confidence Interval Estimation
5 pages
Es 103 - Module 4 - Shearing Deformation
No ratings yet
Es 103 - Module 4 - Shearing Deformation
21 pages
Quiz Sample of Business Statistic'S: January 2020
No ratings yet
Quiz Sample of Business Statistic'S: January 2020
79 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
ChatGPT in Exploratory Data Analysis
No ratings yet
ChatGPT in Exploratory Data Analysis
6 pages
Set 1 Descriptive Statistics Probability 2 4 Docx Completed
No ratings yet
Set 1 Descriptive Statistics Probability 2 4 Docx Completed
4 pages
PLC Components & Functions Guide
No ratings yet
PLC Components & Functions Guide
2 pages
Thống Kê Trong Kinh Doanh
No ratings yet
Thống Kê Trong Kinh Doanh
5 pages
Regression Analysis Q&A Guide
No ratings yet
Regression Analysis Q&A Guide
2 pages
BTEC Business Statistics Guide
No ratings yet
BTEC Business Statistics Guide
21 pages
Paper 1 73
No ratings yet
Paper 1 73
6 pages
November 2010)
No ratings yet
November 2010)
6 pages
Chapter 2
No ratings yet
Chapter 2
17 pages
ISOM2500Practice - Quiz 1
No ratings yet
ISOM2500Practice - Quiz 1
6 pages
Cmam2022 285 290
No ratings yet
Cmam2022 285 290
6 pages
DWDM Notes Unit-4
No ratings yet
DWDM Notes Unit-4
89 pages
Business Statistics Mock Test
No ratings yet
Business Statistics Mock Test
19 pages
Lecture 9 Statistical Learning
No ratings yet
Lecture 9 Statistical Learning
3 pages
Hyd Cylinder Details Jyo Make
No ratings yet
Hyd Cylinder Details Jyo Make
4 pages
Financial Risk Analysis Guide
No ratings yet
Financial Risk Analysis Guide
49 pages
Grade 7 Science: Heat & Energy
No ratings yet
Grade 7 Science: Heat & Energy
9 pages
Model Evaluation Metrics 1683566651
No ratings yet
Model Evaluation Metrics 1683566651
12 pages
Chemsheets AS 1051 Hesss Law 2 Combustion
100% (1)
Chemsheets AS 1051 Hesss Law 2 Combustion
2 pages
Tree
No ratings yet
Tree
7 pages
Aging Performance and Moisture Solubility of Veg. Oils For Power Trfs.
No ratings yet
Aging Performance and Moisture Solubility of Veg. Oils For Power Trfs.
6 pages
547-Article Text-1844-1-10-20210628
No ratings yet
547-Article Text-1844-1-10-20210628
7 pages
Fortnightly Test Series 2023 24 - RM (P1) Test 01A
No ratings yet
Fortnightly Test Series 2023 24 - RM (P1) Test 01A
20 pages
Capastone - Project - Subash Karnatakapu
No ratings yet
Capastone - Project - Subash Karnatakapu
54 pages
Business Analytics Module2
No ratings yet
Business Analytics Module2
9 pages
Banking Project Final
No ratings yet
Banking Project Final
38 pages
Tutorial DataMiningENG
No ratings yet
Tutorial DataMiningENG
8 pages
Vewlix VLX 1 Base
No ratings yet
Vewlix VLX 1 Base
6 pages
Data Preparation DM
No ratings yet
Data Preparation DM
26 pages
Excel & Python Statistical Functions
No ratings yet
Excel & Python Statistical Functions
44 pages
Sta15m1 CH 1 - Exercises
No ratings yet
Sta15m1 CH 1 - Exercises
6 pages
Name of Subject: BUSINESS STATISTICS
No ratings yet
Name of Subject: BUSINESS STATISTICS
29 pages
Act. 2 - Micropipetting Techni
No ratings yet
Act. 2 - Micropipetting Techni
29 pages
CCW331 Set4
No ratings yet
CCW331 Set4
5 pages
Proplem Chapter 2.pdf - 2023.02.03 - 12.38.41pm
No ratings yet
Proplem Chapter 2.pdf - 2023.02.03 - 12.38.41pm
7 pages
Final - Bank Customer Response Prediction Model
No ratings yet
Final - Bank Customer Response Prediction Model
23 pages
ET - W2021 (2131905) (GTURanker - Com)
No ratings yet
ET - W2021 (2131905) (GTURanker - Com)
2 pages
DSOST2
No ratings yet
DSOST2
44 pages
Vector Addition Activity
No ratings yet
Vector Addition Activity
4 pages
DA Caravan 6672064
No ratings yet
DA Caravan 6672064
26 pages
Peniel Favour
100% (2)
Peniel Favour
4 pages
SCSA1606 - Predictive and Advanced Analytics - Unit II
No ratings yet
SCSA1606 - Predictive and Advanced Analytics - Unit II
50 pages
QUANT EXCELddjdjjddjdjdjdjdjdjdididdddddd
No ratings yet
QUANT EXCELddjdjjddjdjdjdjdjdjdididdddddd
24 pages
Worksheet of Business Statistics Ch-2 & 3
No ratings yet
Worksheet of Business Statistics Ch-2 & 3
3 pages
Introduction To DM
No ratings yet
Introduction To DM
27 pages
Simulation Methods 2
No ratings yet
Simulation Methods 2
19 pages
DMBI FH 2023 Solution
No ratings yet
DMBI FH 2023 Solution
14 pages
A Novel Online Machine Learning Approach For..
No ratings yet
A Novel Online Machine Learning Approach For..
7 pages
Illustration Cluster Analysis (K-Means)
No ratings yet
Illustration Cluster Analysis (K-Means)
8 pages
Class12 CS Practical File Slides Guidelines
No ratings yet
Class12 CS Practical File Slides Guidelines
12 pages
MBS702 Fe 0124
No ratings yet
MBS702 Fe 0124
14 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
13 Chapter 5
No ratings yet
13 Chapter 5
38 pages
Xii Aec Objective CA & Cuet Qp-7
No ratings yet
Xii Aec Objective CA & Cuet Qp-7
4 pages

PSI and KS Statistic

Uploaded by

PSI and KS Statistic

Uploaded by

Paper 132

Population Stability and Model Performance Metrics Replication for

Score Dev Dev Val Val Cumulative

Rank Ordering Testing

Expected vs. Statistically

Table 3. KS for Development Dataset

Table 4. KS for Validation Dataset

Figure 1. Graphical Comparison between Development and Validation KS

• Validation KS > 50 indicates excellent model performance

In addition to calculating KS manually, the npar1way procedure can be performed in SAS to

Output 1. SAS Output from Proc Npar1way for Development Dataset

Output 2. SAS Output from Proc Npar1way for Validation Dataset

You might also like