MODELLING THE PROFITABILITY OF CREDIT CARDS FOR DIFFERENT TYPES OF BEHAVIOUR WITH PANEL DATA Professor Jonathan Crook, Denys Osipenko
Content 2 Credit card dual nature System of statuses Multinomial logistic regression Macroeconomics and cycles Panel data and regression Utilization rate modelling interst income Profitability modelling non-interest income Modelling examples Exposure at default prediction Resume and client behaviour
Credit card dual nature 3 Credit Card has dual nature: Payment tool and Loan Payment tool Credit card Convenie nt loan What is the confident definition for revolver? Positive outstanding balance during 3 months, 6 months,? If only once paid off full amount? Outstanding balance is stochastic value in the range from 0 to Limit. Clients is split up two group: revolvers and transactors Revolver user, who carry a positive credit card balance and not pay off the balance in full each month roll over Transactor user, who pay in full on or before the due date of the interest-free credit period Competent user do not incur any interest payments or finance charges Credit cards dual nature and profitability were investigated by: Crook, Hamilton, Thomas (992) Banasik, Crook, Thomas (200) Ma, Crook, Ansell (200) So, Thomas (2008) Cheu, Loke (200) Tan, Steven, Yen (20)
Logistic regression means the binary target. Multinomial regression uses more that two values target. Credit Cards Statuses 4 Inactive The main task to estimate the probability of transition from status to status on the client level. Default Delinquent Revolver (standard) Transactor Transition matrix (classic approach) is calculated on the portfolio/pool level The problem the number of statuses which account can transfer to is more than two (for example, revolver can be transactor, delinquent, stay revolver, or inactive)
Multinomial logistic regression 5 We need to predict the probability of transition to the certain state (0,, 2, ) Multinomial logistic regression is a regression model which generalizes logistic regression by allowing more than two discrete outcomes This kind of the models can show weaker results than ordinal logistic regression, but better than tree of conditional logistic Cumulative probability of the transition for two different statuses
Map of statuses 6 Account status Definition Risk assessme nt Revenue assessment Note closed Account is closed or inactive more than 6 months No No Exclude from analysis inactive transact or OB (-6M) = 0 and Turnover (-6M) = 0 OB (-6M) = 0 and Turnover (-6M) <> 0 No No R No OB TR PR TR No predicted revenue and risk (avg interchange rate + fees rate)*tr current OB > 0 and DPD = 0 Behaviour al Score Limit*IR*PR + Beh. and Revenue Rate Scorecards for B0 R TR Current delinque nt OB > 0 and DPD > 0 and DPD <=90 Behaviour al Score Delinq. R TR + Penalty? Beh. and Revenue Rate Scorecards for B-3 defaulte d OB > 0 and DPD > 90 LGD - Recovery is not revenue
Credit cards income sources 7 Status Interest Fees/Interchang e Penalty Non active - - - Transactor -/+ + - Current (revolver) + + - Delinquent - + + Defaulted - - +/- Different income sources can be applied on different life-time stages Delinquent customer can bring an income, but not defaulted client
Credit Limit = 3000 Credit line Income Prediction 8 For interest income Utilization = Balance / Credit Limit IR_Income = Utilization Rate x Limit x IR For non-interest income (POS, ATM, Interchange etc.) POS Income = TR Debit_POS x POS_fees_rate Interchange = TR Debit_POS x Interchange_fees_rate Cash Withdrawal Income = TR_Debit_ATM x ATM_fees_rate 50% X IR = IR Income Interest Income from Balance: 500 UAH X 36% /2 = 45 UAH Monthly transactions: 000 UAH POS X 2% = 20 UAH 500 UAH ATM X 2,5% = 2,5 UAH Total Non_Interest Income = 32,5 UAH Total Income = 45+32,5 = 77,5 UAH
Rates Modelling 9 First approach logistic regression with binary transformation p i T logit pi ln w w x w x wp x p w x p 0 2 2 i where p i is the probability of particular outcome; w 0...w p are regression weights; x...x p are characteristics. For UT rate is the share of balance - the weighted logistic regression with binarization. Parameters Outcome Binarization Parameters Outcome Weight X 0,75 X 0,75 X 0 0,25 Second approach Linear regression with Beta-transformation Distribution density function given via Gamma function It s possible to build a wide variety of distribution shapes
20042 200503 200506 200509 20052 200603 200606 200609 20062 200703 200706 200709 20072 200803 200806 200809 20082 200903 200906 200909 20092 20003 20006 20009 2002 2003 2006 2009 202 20203 20206 Macroeconomics and cycles 0 Unemployment Rate 0.0% 9.0% 8.0% 7.0% 6.0% 5.0% 2008/03 2009/03 20/02 50.0% 40.0% 30.0% 20.0% 0.0% UnemplRate ln Unempl_mom 4.0% 3.0% 2.0%.0% 0.0% -0.0% -20.0% ln Unempl_qoq ln Unempl_yoy 0.0% -30.0% Macroeconomic indicators contain cycles and fluctuations. They have an impact on client behaviour as systematic factor
ln x Utilization rate, % Correlation of macro- and micro indicators 0.6 The most correlated Macro Indicators and UT rate 0.7 0.4 0.6 0.2 0 0.5 0.4 Unemployment_lny oy UAH- EURRate_lnyoy CPIYear_lnyoy2-0.2 0.3 avgut avgut lag3-0.4 0.2-0.6 0. -0.8 More or less stable correlation can be observed between micro characteristics (like utilization rate) and macro indicators. The correlation between deltas (changes) of indicators is more stable. 0
Panel data and panel 2 regression Types of the data in econometrics: Cross-sectional by economic items at the same point of time (without any relation to the time) Time series observation of the economic values ranked in time In practice often this two dimensions is joined: Independent join (not ranked in time) pooled data Data slices Balance Jan, Balance Feb as Balance row, Balance row 2 Panel data two-dimension array (cross-sectional data ranked as time series) Ranked Data slices - Balance Jan, Balance Feb as Balance t=, Balance t=2
Panel data advantages 3 Higher number of observations results increase in the levels of freedom, gives more efficient estimations Heterogeneity of the sample objects is under control Testing of the effects which is impossible to identify separately in cross-sections and time series Decrease in multicollinearity It s possible to build more complicated behavioural models and decrease the influence of the missing values and incorrectly measured observations
Panel linear regression 4 y uit it x T it u i t it it i index individual, t index time, β vector of regression coefficients, x it T transposed vector of observations independent characteristics. μ i, λ i non-observed individual and time effects, υ it residual idiosyncratic components. y it X it it Pooled model α и β intercept and slope is independent from observation and time Х it - vector of regressors (predictors) Approach with time slices is widely applied as industry standard, for instance, to create development and validation samples from the data set with not enough observation at the point in time or to take into account different seasons. Assumption: dependence between factors is stable in time correlation between observations is not taking into account But in practice it is not true!
Panel linear regression 5 Fixed effect model y it it ui X it vit i - individual intercept (specific effect) Intercept is varying across groups and/or times Error variance is constant Random effect model y u X i v it it u i v it - Random effect Intercept is constant Error variance is varying across groups and/or times
Utilization Rate Modelling 6 UT Regression equation: utilization rate depends from behavioural, application, macroeconomic characteristics, and also from utilization rate with time lag it UT i UT T T t 2 it2 4 i( tl) b bi, t a ai m, t l UT K k B L l A M m M φ, α, β, γ regression coefficients (slopes) B vector of behavioural factors (for example, average balance to maximum balance, maximum debit turnover to average outstanding balance or limit, maximum number days in delinquency, etc for some periods of time) A vector of application factors - client s demographic, financial and product characteristics like age, education, position, income, property, interest rate, etc. M vector of macroeconomic factors (GDP, FX, Unemployment rate changes, etc.) UT utilization rate
Interest Income modelling,, ) ( t m M m ai L l a K k t bi b T l l t i T l it M A B UT T UT T Average utilization rate for the period of T months Income = Avg UT() x IR ()x Limit() Average Income (-T) = Avg UT(-T) x IR x Average Limit (- T) Average Income for period of T months depends of average utilization rate, average credit limit and interest rate
Non-interest income modelling 8,, ) ( ln m t M m m ai L l a K k t ki k T l l t i M A B UT T P P st stage estimation of the probability that the client will use credit cards for POS/ATM transaction during the forecast period 2 nd stage income amount for the period,, ) ( t m M m m ai L l a K k t bi k T n n t i it M A B UT T POS φ, α, β, γ regression coefficients (slopes) B vector of behavioural factors (for example, average balance to maximum balance, maximum debit turnover to average outstanding balance or limit, maximum number days in delinquency, etc for some periods of time) A vector of application factors - client s demographic, financial and product characteristics like age, education, position, income, property, interest rate etc) M vector of macroeconomic factors (GDP, FX, Unemployment rate changes, etc)
0-0. 0.-0.2 0.2-0.3 0.3-0.4 0.4-0.5 0.5-0.6 0.6-0.7 0.7-0.8 0.8-0.9 0.9-9 Utilization rate and Income distributions 6% 4% 2% 0% 8% Utilization Rate Density Distribution for active accounts Utilization rate density may have an U-shape distribution, can be approximated, as option, by beta-distribution 6% 4% 2% 0% POS income (interchange fees) may have exponential distribution It s necessary to filter a lot of insufficient amounts and enormous outliers 35.00% 30.00% 25.00% 20.00% 5.00% 0.00% 5.00% 0.00% Average POS income amount Итог
2-23 24-26 27-29 30-32 33-35 36-38 39-4 42-44 45-47 48-50 5-53 54-56 57-59 Utilization rate by characteristics Ut Rate - Age Ut Rate - Education 70% 60% 50% 40% 30% 20% 0% 0% Ut Rate 70% 60% 50% 40% 30% 20% 0% 0% Ut Rate Ut Rate - Industry Ut Rate - Position 70% 60% 50% 40% 30% 20% 0% 0% Ut Rate 70% 60% 50% 40% 30% 20% 0% 0% Ut Rate
2 Example of utilization rate model coefficients estimation Parameter Estimates Pooled Random Effect Variable Estimate Standard Standard t Value Pr > t Estimate Error Error t Value Pr > t Intercept 0.08887 0.0059.9 <.000 0.22476 0.00539 4.63 <.000 UT to AvgUT3-0.09 0.000588-53.7 <.000-0.0046 0.00045-23.22 <.000 OBalance Avg To MaxOBalance3 0.856652 0.00092 929.9 <.000 0.46893 0.0005 448.55 <.000 Tr_max_debit3_To_Limit 0.2074 0.000795 5.8 <.000 0.2524 0.00074 57.56 <.000 Age -0.007 0.000034-50.8 <.000-0.00285 0.00033-2.42 <.000 Edu_ High 0... 0... Edu_ Secondary 0.047508 0.000844 56.3 <.000 0.085285 0.00334 25.55 <.000 Edu_ Special 0.037003 0.00077 5.57 <.000 0.069385 0.00284 24.46 <.000 Edu_ Two Degree/PhD -0.0847 0.007-0.82 <.000-0.04267 0.0068-6.28 <.000 Marital_ Civil 0.08075 0.0039 3.04 <.000 0.03343 0.00552 6.06 <.000 Marital_ Divorced 0.093 0.00094 2.4 <.000 0.03376 0.00362 9.3 <.000 Marital_ Married 0... 0... Marital_ Single 0.0285 0.000782 6.43 <.000 0.07073 0.003 5.49 <.000 Marital_ Widow 0.040384 0.0074 23.27 <.000 0.066977 0.00687 9.75 <.000 position_ Employee 0... 0... position_ Manager -0.00554 0.000908-6.09 <.000-0.02 0.00362-3.07 0.002 Position_ Other 0.0046 0.00092.37 <.000 0.06682 0.00366 4.56 <.000 position_ Technical 0.029568 0.0027 0.93 <.000 0.046394 0.008 4.3 <.000 position_ Top 0.05995 0.000777 20.59 <.000 0.02554 0.00308 8.28 <.000 UAH_EUR Rate_ln yoy_lag3m -0.24903 0.00563-44.22 <.000-0.2342 0.00389-54.93 <.000 Unempl_ln yoy_lag3m -0.09404 0.04-6.67 <.000-0.02362 0.00974-2.43 0.053 FDI_ln yoy_lag3m 0.3002 0.00946 3.76 <.000 0.9322 0.00652 29.62 <.000
Example of linear regression avg POS amount Variable Standard Estimate Error t Value Pr > t Intercept 2.4686 5.9507 2. 0.036 Limit 0.00053 0.00007 30.66 <.000 customer_income 0.000437 0.000039.3 <.000 other_income 0.00036 0.000064 4.94 <.000 spouse_income 0.000242 0.000032 7.47 <.000 UnemplRate_5-47.793 75.5347 -.96 0.0504 Unempl_lnyoy_3 2.6809.8743.6 0.2474 UAH_EURRate_lnyoy_3 5.580589 2.234 2.52 0.07 b_avg_ut3-3.63053 0.694-5.86 <.000 b_avg_ut6 0.65986 0.5647.7 0.2426 b_avgob3_to_maxob3 -.25768 0.443-3.04 0.0024 b_trmax_deb3_to_limit 9.345766 0.89 49.43 <.000 b_trmax_deb3_to_avgob3 0.085842 0.076 4.88 <.000 b_travg_deb3_to_avgob3-0.28625 0.0348-8.23 <.000 b_trmax_deb6_to_limit 2.682203 0.59 6.86 <.000 b_trmax_deb6_to_avgob6 0.054655 0.05 3.62 0.0003 b_travg_deb6_to_avgob6 0.364 0.0404 7.83 <.000 b_trsum_deb6_to_trsum_crd6-0.06204 0.00992-6.25 <.000 b_deltaut3to46-0.00003 0.00002-2.52 0.08 b_ut_to_avgut6 -.06497 0.0827-2.88 <.000 b_avgnumdeb3 0.0053 0.00398.29 0.972 b_avgnumdeb6 0.022474 0.00478 4.7 <.000 b_deltanumdeb3to46 0.055753 0.0336.66 0.0969 b_max_dpd3 0.09562 0.029 4.34 <.000 b_max_dpd6-0.04252 0.07-3.63 0.0003 b_delbucket3 -.4759 0.305-4.56 <.000 Variable Estimate Standard Error t Value Pr > t Edu_Secondary -0.7666 0.283-5.97 <.000 Edu_Special -0.69455 0.98-5.8 <.000 Edu_TwoDegree 3.36949 0.2604 2.94 <.000 Marital_Civ 0.293893 0.2076.42 0.57 Marital_Div 0.04653 0.54 0.3 0.7627 Marital_Mar 0... Marital_Sin -0.247 0.5 -.08 0.278 Marital_Wid 0.579889 0.3327.74 0.084 position_empl 0... position_man.306543 0.489 8.77 <.000 position_oth -0.379 0.479-0.93 0.35 position_tech -0.42633 0.307-3.26 0.00 position_top 3.34009 0.335 9.97 <.000 sec_agricult -.796 0.394-2.99 0.0028 sec_constr -0.8334 0.4408-0.42 0.6775 sec_energy -0.92066 0.3506-2.63 0.0086 sec_fin 2.725007 0.3003 9.07 <.000 sec_gov -0.647 0.2829-2.27 0.0234 sec_industry.09577 0.4887 2.09 0.037 sec_manufact 0.40207 0.435 0.92 0.3564 sec_mining 0.454424 0.367.24 0.258 sec_service -0.05828 0.2856-0.2 0.8383 sec_trade 0.90074 0.298 0.65 0.548 SSE 78078 MSE 5.9496 R-Square 0.305
23 Example of linear regression POS amount Estimated function: Avg POS transaction amount
Profitability ratios Income part of the ratio: Interest Income = IR x Limit x Utilization Rate Non-Interest Income = POS Income + ATM Income Two approaches of profitability calculation denominator: Average Outstanding Balance real profitability Credit Limit profitability on allocated sources Profitability = (Interest Income + Non-Interest Income) / Average Outstanding Balance Or Profitability = (Interest Income + Non-Interest Income) / Credit Limit
Credit Limit = 3000 Exposure at Default estimation 25 Expected Loss EL = PD x LGD x EAD +40% +200 at Default Point Exposure at Default for credit card: EaD L UR L UR CF 50% CF (conversion factor) the percent (share) of the additional usage of remaining credit line at the default point. The credit conversion factor (CCF) converts the amount of a free credit line and other off-balance-sheet transactions (with the exception of derivatives) to an EAD (exposure at default) amount. L credit limit Some investigations of EaD - Jacobs (2008), Qi (2009) 500 CF = 200/500 = 80%
26 Utilization rate and Profitability estimation in Credit Limit Strategy Segment Utilization Profitability 0.0%.0% 2.0% 3.0% 4.0% 5.0% 6.0% 7.0% 8.0% 9.0% 0.0% 0.2% 3% 5% 5% 0% 0% 0% 0% 0% -5% -5% -5% -5% 2 6.3% 5% 0% 5% 5% 5% 5% 0% 0% 0% 0% -5% -5% 3 8.0% 7% 0% 0% 0% 0% 5% 5% 5% 5% 0% 0% 0% 4 9.5% 9% 5% 5% 0% 0% 0% 0% 5% 5% 5% 5% 0% 5 39.5% % 20% 20% 5% 5% 5% 0% 0% 0% 0% 5% 5% 6 36.3% 2% 20% 20% 20% 20% 5% 5% 5% 0% 0% 0% 0% 7 85.0% 3% 25% 20% 20% 20% 20% 5% 5% 5% 0% 0% 0% 8 78.4% 4% 25% 25% 20% 20% 20% 20% 5% 5% 5% 0% 0% 9 60.3% 6% 30% 30% 30% 25% 25% 25% 20% 20% 20% 5% 5% 0 35.3% 8% 35% 35% 35% 30% 30% 25% 25% 25% 20% 20% 20% 78.7% 9% 40% 40% 35% 35% 30% 30% 30% 25% 25% 25% 20% 2 77.6% 2% 45% 45% 40% 40% 35% 35% 35% 30% 30% 25% 25% 3 9.9% 24% 55% 50% 50% 45% 45% 45% 40% 40% 35% 35% 30% 4 93.3% 25% 60% 55% 55% 50% 50% 45% 45% 45% 40% 40% 35% Credit Limit changes depends on the profitability and probability of default segment. Utilization rate illustrates the fact that credit line profitability doesn t depend on the utilization rate pro rata. PD
Resume Client Behaviour 27 Transactor Revolver Competent Reveolver Interest Income Low/ No High No Non-Interest Rate High Low Low Risk Level Low Moderate Moderate/ High Competent Revolver the worst client from the profitability point of view (but not the sales volume point of view) It s recommended to build strategies in the Card business with the risk revenue principle to maximize the profitability. Areas of application: Limit management segmentation by revolver/transactor risk limitation and usage motivation Pricing not only risk-based, but use motivation Marketing differentiate target groups Use of panel data helps to avoid the impact of time heterogeneity on model results
Thank you for your attention! 28 The Business School The University of Edinburgh Professor Jonathan Crook Jonathan.Crook@ed.ac.uk Denis Osipenko, Doctoral Student D.Osipenko@sms.ed.ac.uk