Journal of Credit 15(2), 1–19 DOI: 10.21314/JCR.2018.244

Copyright Infopro Digital Limited 2019. All rights reserved. You may share using our article tools. This article may be printed for the sole use of the Authorised User (named subscriber), as outlined in our terms and condi- tions. https://www.infopro-insight.com/termsconditions/insight-subscriptions

Research Paper A consumer structural model based on affordability: balance at risk

Marcelo S. Perlin, Marcelo B. Righi and Tiago P. Filomena

Escola de Administrac¸ao,˜ Universidade Federal do Rio Grande do Sul, Rua Washington Luiz, 855 – Centro Historico´ de Porto Alegre – RS, 90010-460 Brazil; emails: [email protected], [email protected], tiago.fi[email protected]

(Received August 3, 2017; revised June 11, 2018; accepted August 20, 2018)

ABSTRACT This paper introduces an approach designed for personal credit risk, with possible applications in risk assessment and optimization of debt contracts. We define a struc- tural model related to the financial balance of an individual, allowing for cashflow seasonality and deterministic trends in the process. Based on the proposed model, we develop risk measures associated with the probability of default rates conditional on time. This formulation is best suited to short-term loans, where the dynamics of individuals’ cashflow, such as seasonality and uncertainty, can significantly impact future default rates. In the empirical section of this paper, we illustrate an application by estimating risk measures using simulated data. We also present the specific case of optimization of a financial contract, where, based on an estimated model, we find the yield rate/time to maturity pair that maximizes the expected profit or minimizes the default risk of a short-term debt contract.

Keywords: credit risk; balance at risk (BaR); personal finance; cashflow.

Corresponding author: M. S. Perlin Print ISSN 1744-6619 Online ISSN 1755-9723 c 2019 Infoproj Digital Risk (IP) Limited

1 2 M. S. Perlin et al

1 INTRODUCTION The assessment of consumer credit quality is an important aspect in banking and finance (Thomas 2000). The existence of a developed financial market allows an indi- vidual to sell future cashflows in exchange for present purchases of goods, such as a TV, car or real estate (Shiller 2013). Over the years, a higher demand for consumer credit products and competitive incentives for cost minimization in credit analysis have motivated a transition from the subjective evaluation of credit risk to the use of quantitative models for so-called credit and behavior scoring (Crook et al 2007; Hand and Henley 1997; Thomas 2000). Traditional models for consumer credit risk assessment tend to rely on a broad range of techniques, such as artificial neural networks (Khashman 2010; Oreski et al 2012), support vector machines (Huang et al 2007), logistic regressions (Crook and Bellotti 2010; Wiginton 1980), decision trees (Matuszyk et al 2010) and mathemati- cal programming (Crook et al 2007), to cite some examples.1 Another, less explored approach to consumer credit risk assessment is a structural model, which is more oriented toward obtaining the probability of default (PD) of a loan. The first structural model for credit risk was an option-based approach proposed by Merton (1974) that focused on modeling the stochastic dynamics of a firm’s value. If the firm’s value falls below its debt value on its maturity date, then the firm is in default. This phenomenon is equivalent to stating that equity holders did not exercise their call option on a company’s assets: the option expired out-of-the-money. Thus, the PD on the loan equals the probability of the option being out-of-the-money on the expiration date (Allen et al 2004). Since Merton’s seminal work, other corporate structural credit risk models have appeared, such as Longstaff and Schwartz (1995), Leland and Toft (1996) and Collin-Dufresne and Goldstein (2001).2 Despite the developments related to structural corporate credit risk models, not much attention has been paid to structural models for consumer credit risk. Perli and Nayda (2004) discuss an option-based model that is similar to one for corporate credit risk: if a consumer’s assets are lower than a threshold, then they will default on the loan. De Andrade and Thomas (2007) propose a structural model in which the PD is based on the consumer’s reputation. In their model, the consumer has a call option on the value of their reputation, and the strike price of this option is the debt value. The credit/behavior score is the proxy for the value of the reputation. Thomas (2009) classifies this model as a consumer structural model on the basis of reputation.

1 Reviews of other techniques can be found in Baesens et al (2003) and Lessmann et al (2013). 2 Reduced-form models, in which default is modeled exogenously, are also alternatives to structural models (see Jarrow et al 2004).

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 3

Our approach is a consumer structural model based on affordability (Thomas 2009), which differs not only from Perli and Nayda (2004) but also from De Andrade and Thomas (2007). We model directly the balance of the individual client by explic- itly defining a stochastic process for their income and expenses. For the purposes of this paper, we define an applicant’s balance as the available funds in their bank account. This is not a strict definition; one can clearly define any asset as part of the balance, as long as it is liquid enough to be cashed without price restrictions. This or other definitions can be adapted to the use of our proposed model. This approach allows for greater flexibility in simulating the scenarios and a more realistic assessment of the PD because it does not require any characteristic infor- mation from the applicant. The method is best suited to short-term loans, for which the distribution of cashflows over time can significantly impact the default rate of the debt. The model is easily justified because a better assessment of the applicants’ default rate is very important, given that short-term loans can improve the long-term relationship between a bank and its clients (Bodenhorn 2003) as well as the financial stability of the economy (Wagner and Marsh 2006). Further, as discussed by Thomas (2010) and Crook and Bellotti (2010), intro- ducing economics and market conditions into consumer risk assessment is still a challenge. Thus, we also contribute to the literature by directly considering the con- ditions of the economy, such as unemployment and wage levels. We call our model a balance-at-risk (BaR) model. Note that there is a literature combining transaction data with credit bureau infor- mation based on a variety of machine learning techniques (Butaru et al 2016; Khan- dani et al 2010; Oreski and Oreski 2014; Oreski et al 2012). These studies are reliant on finding patterns in different sets of data to make inferences about consumer credit ratings. Our BaR model, however, is organized around the consumer’s account bal- ance, which means it is closely related to corporate cashflow-at-risk (CFaR) models. This type of model is a downside risk metric based on past cashflow distributions used to evaluate corporate (Andren´ et al 2005; Oral and Akkaya 2015; Stein et al 2001). The theory that supports this proposal, along with derivations of risk measures, is provided in this paper. Using an artificial data set, we illustrate the model’s usefulness by first estimating an empirical model and then using it to calculate forward PDs that are conditional on time. We also show how the model can be used in a loan application by finding the parameters of a debt contract that maximize the of the financial transaction. This paper is organized as follows. In Section 2, we discuss the theory that sup- ports the empirical BaR model. In Section 3, an application model is examined. In Section 4, we finish the paper with some concluding remarks. www.risk.net/journals Journal of Credit Risk 4 M. S. Perlin et al

2 BALANCE AT RISK In this section, we present our proposed approach, BaR, to measure and manage credit risk in a mathematical and theoretical framework. Unless otherwise stated, the content is based on the following notation. Consider the filtered atom-less probability

space XT X.˝; F ;.FT /T T; P/ of monetary values, where ˝ is the sample WD 2 space; F is the set of possible events in ˝; FT is a filtration with F0 ; ˝ , S D f; g F . T >0 FT /, T R and the usual assumptions; and P is a probability D WD C [ 1 measure that is defined in ˝ of the events contained in F . We consider adapted random processes XT XT R to represent the variables W ! in our approach. Thus, EŒXT  is the expected value of XT under P. All equalities and

inequalities are considered to be almost surely in P. FXT FX FT is the probability WD j function of X , with inverse F 1 and density f . At time T , B , R , C , I , E T XT XT T T T T T and LT represent, respectively, balance, risk-free rate, net cashflow, income, expense and loans to be paid. We assume that RT and LT are FT dt measurable because their values are known by contract in the previous period. The main idea is to consider the balance of an agent as a stochastic process, com- posed of their net cashflows, to measure the credit risk incurred for a financial insti- tution when it loans money to this agent. The BaR approach is the analysis of risk measures through this stochastic process. Thus, in this section, we define and expose some properties of the stochastic process under analysis as well as the credit risk measures that are derived from it.

DEFINITION 2.1 Let BT , RT , CT , IT , ET XT R and CT IT ET . The W ! D closed form for the stochastic process that represents the balance is given by

 Z T à Z T  Z T à BT B0 exp Rt dt Ct exp Rs ds dt; (2.1) D 0 C 0 t ÂÄ ÂZ T Ã à dBT B0 exp Rt dt RT CT dt: (2.2) D 0 C

REMARK 2.2 The stochastic process in Definition 2.1 depends only on cashflows and can be considered information that summarizes the distinct dimensions (eco- nomic, social, geographic and others) of the agent’s behavior. Equations (2.1) and

(2.2) can be recursively generalized for any time T  6 T and, respectively, conform to ÂZ T Ã Z T ÂZ T Ã BT BT exp Rt dt Ct exp Rs ds dt; D  C T  T  t ÂÄ ÂZ T Ã Ã dBT BT exp Rt dt RT CT dt: D  C T 

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 5

Moreover, it is possible to obtain simpler formulations under a null initial balance, ie, B0 0. D Obviously, the properties of BT as a stochastic process directly depend on those possessed by CT and, consequently, by IT and ET . At this point, we do not make assumptions about the stochastic process that governs such random variables. Our goal is to keep the model as general as possible. Nonetheless, for a special case, it is possible to analytically derive FBT given FIT and FET . To do so, we use the concept of probability function convolution. Below, we present the definition of this concept and two well-known lemmas in probability theory. Given this background, we prove a theorem.

DEFINITION 2.3 The convolution between two probability densities fX and fY is given by Z 1 fX fY .x/ fX .x u/fY .u/ du: (2.3) D 1 LEMMA 2.4 Let X and Y be two independent random variables. Then, fX Y C D fX fY .x/.

LEMMA 2.5 Let k R. Then, 2 1 Âx à f .x/ fX : D k k j j PROPOSITION 2.6 Let BT , RT , CT , IT , ET XT R, CT IT ET and R T W ! D t Rs ds. If IT , ET are independent for all T T, and CT , CS are D t 2 independent for all T;S T, T S, then 2 ¤ Z x h ˇT i  ˇ FBT .x/ lim fB00 .s/ fCt t .s/ ds; (2.4) D dt 0 ˇ0 1 ! Ä 1  x Ã Ä 1  x à  fCt t .x/ fIt f Et for all t Œ0; T ; (2.5) D t t t t 2 j j j j T    fCt t .x/ 0 fC00 .x/ fC0 dt 0 dt .x/ fCT T .x/: (2.6) j D C C   

PROOF Because RT is FT dt measurable by assumption, t R for all t Œ0; T . 2 2 Because Ct t It t Et t for all t Œ0; T , we have by Lemma 2.4 and D 2 Lemma 2.5 that Ä 1  x Ã Ä 1  x à  fCt t .x/ fIt t Et t fIt f Et D D t t t t j j j j for all t Œ0; T . 2 www.risk.net/journals Journal of Credit Risk 6 M. S. Perlin et al

Repeating the argument for Ct t ; t Œ0; T , which are independent by assump- 2 tion, we obtain ˇT ˇ fR T .x/ lim fCt t .x/ : 0 Ct t dt D dt 0 ˇ0 !

Because fB00 is a Dirac measure, which is independent of any probability function, R x and FX .x/ fX .s/ ds, we have D 1 Z x h ˇT i  ˇ FBT .x/ lim fB00 .s/ fCt t .s/ ds: D dt 0 ˇ0 1 ! This concludes the proof. 

REMARK 2.7 Naturally, such assumptions of independence in Proposition 2.6 are too restrictive and can be questioned in real life. However, even with such constraints, the degree of complexity becomes significant and leads to an analytical expression

that is difficult to compute. Moreover, obtaining FBT from cashflow distributions is at least an alternative. One can eliminate the assumption that IT and ET are inde- pendent if the distribution for CT is directly known. Nonetheless, in practical situa-

tions, it is also possible to directly estimate FBT from balance data using numerical simulations.

Having explained the stochastic process, we turn our focus to the next step in the BaR approach, which is measuring credit risk. We concentrate on two risk measures: the PD and the loss given default (LGD). As both PD and LGD are very well-known risk measures for credit risk, we shall not pursue a debate of their theoretical prop- erties in this paper. Nonetheless, we formally define them and briefly discuss their financial meaning.

DEFINITION 2.8 Let BT , LT XT R. The probability of default, PDT W ! W XT Œ0; 1, and the loss given default, LGDT XT R, are defined, respectively, ! W ! as follows:

6 PDT P.BT LT / FBT .LT /; (2.7) WD D 1 LGDT EŒBT LT BT 6 LT F .PDT /: (2.8) WD j D BT REMARK 2.9 The PD is the probability of an agent not having enough balance to pay their loan at time T . It is a very simple and intuitive credit risk measure. However, PD alone does not give the entire picture of a situation, because it does not consider the loss magnitudes in the event of a default. This shortcoming is handled by the LGD, which indicates how much money is necessary to fulfill the loss in the event of a default. This measure has similarities to the well-known measure

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 7 expected shortfall (ES), but for LGD the expectation is truncated by LT instead of a quantile of interest. Moreover, because (2.7) and (2.8) both depend directly on FBT , under the hypothesis of Proposition 2.6 one can obtain analytical formulations for these two risk measures.

3 AN EMPIRICAL BaR MODEL Now, we consider a specific BaR model nested in the previous formulations. From this point forward, we change the notation so that the time index is represented by t 1; : : : ; T , which is the usual notation in empirical time series models. D We use a discrete version of the model by setting the income and the expense equation of an individual to separate stochastic processes. We expect income to be far more predictable than expenses, because most of society is bound by work contracts that explicitly define the amount of financial reward that one receives per unit of time. However, when looking at expenses, we expect a higher quantity of noise because the individual decision to purchase goods and services is a function of diverse economic, social and personal factors. The heterogeneous stochastic properties of the inflow and outflow of cash separate the process of income and expenses. Another important aspect of empirically modeling the inflow and outflow of finan- cial resources is recognizing the existence of seasonalities. An individual might receive more cash in particular periods than others. For instance, a worker in Brazil is entitled to an extra paycheck and an additional cash-based holiday premium through- out the year. These amounts are usually paid in June and December. Identifying these particular months is essential to developing a realistic credit risk model, particularly in the case of short-term loans. The expectation of receiving more cash in particular months affects the balance and adds a time dependency to the dynamic of cashflows and, therefore, the forward PD. A realistic empirical model should also consider the effect of unemployment. If an individual loses their work contract for any reason, their cash income ceases, thus affecting the PD. When an individual loses their job, all that is left as support for expenses is the current value of the balance. In considering these effects, we propose a discrete version of (2.1) and (2.2) for our empirical example:

Bt Bt 1.1 rt / Ii;t Ei;t ; (3.1) D C C ( 0 if St 1 (unemployed); Ii;t D (3.2) D FI MIi;t if St 2 (employed); C D

Ei;t FE MEi;t ; (3.3) D C www.risk.net/journals Journal of Credit Risk 8 M. S. Perlin et al

where Bt is the balance at time t (t 1; : : : ; T ); rt is the risk-free yield (monthly), D assumed constant (rt r); Ii;t is the total income for month i and time t; Ei;t is D the total expense for month i and time t; FI is the fixed monthly income; MI is the monthly extra income for month i; FE is the fixed monthly expense; and ME is the monthly extra expense for month i. Following the ideas described in Huh et al (2010) and Malik and Thomas (2012), the states of employment and unemployment (St ) follow a discrete Markov chain, given by transition matrix P : " # p11 1 p22 P : (3.4) D 1 p11 p22 Because only two states and two transition probabilities exist, we estimate the transition probabilities by inverting the expected duration of the states E.Dur.St D k// 1=.1 pkk/: D p11 1 1=E.time employed/; (3.5) D p22 1 1=E.time unemployed/: (3.6) D Thus, depending on how much time it takes for a worker in the same profession as the applicant to get a job or lose their job, transforming this information into transition probabilities using (3.5) and (3.6) is easy. Given this setup, we allow for economic conditions in the job market to affect the applicant’s PD. If the unemploy- ment duration increases, so does the likelihood of a lower income. Therefore, we can expect an increase in the PD on a loan. Note that the balance in (3.1) changes over time, indicating that the PD should be recalculated as time moves forward, in a man- ner similar to the behavioral models (Thomas 2000). If the effective balance level increases with an applicant’s good financial status, the likelihood of a future default decreases, and the individual should receive the reward of paying lower interest rates.

3.1 Simulation of the income and expense data As an example of an empirical application, we assume the existence of a history of inflows and outflows of a balance account over five years. That is, in the data- generating stage, we also assume the individual was employed or had passive earn- ings throughout the period. Individual banking statements should be easily available within a commercial bank. In commercial applications of the model, it might be interesting to consider the creation of a central organization using applicants’ aggre- gate bank statements. On the one hand, banks would benefit from gaining a larger amount of information from applications. On the other hand, clients would benefit from competition between banks, which should create a downward pressure on the interest rates of debt.

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 9

To illustrate the use of this model, we simulate the inflows and outflows of money as random normal variables, with different means and standard deviations according to the month of the year. This simulation should emulate the expected noise from empirical banking data. In the simulation, we set an initial balance of $1000, where, on the income side, the applicant makes an average of $3050 every month. In the months of June and December, they receive a bonus of 50%. The income in each month is a random nor- mal variable, with its mean defined using the previous rules and a standard deviation of $50. On the expense side, the applicant expends an average of $3000, with 40% increases in June and December. The instability, however, is higher for expenses, with a standard deviation of $250. Given the previous information, we can simulate a banking history for any period of time. In Figure 1, we present the time series plot of the simulated income and expense data. Note that the behavior of income is far more stable than that of expenses, with clear seasonality in June and December. The balance of this individual has a clear upward trend because their income is usually higher than their expenditure. Further, we assume that, generally, the applicant is employed for three years, and they can acquire a new job within three months when unemployed. This information can be retrieved from the applicant’s job records or other sources. For example, the Bureau of Labor Statistics reports that the average duration of employment in the United States for March 2016 was twenty-nine weeks (7.25 months).3

3.2 Estimation of the coefficients Given the general mathematical formulation discussed in the previous section, the empirical model can be formulated in different ways. For simplicity, we assume that income and expenses follow a linear model conditional on time. However, it is impor- tant to point out that more sophisticated methods, such as cubic splines, can be used to model the seasonality of the banking data. Such an approach has been used with success in finance to model volatility and term structures (Audrino and Buhlmann¨ 2009; Engle and Rangel 2008; Jarrow et al 2004). The model’s estimation is straightforward. We define a statistical model for income and expenses as 12 X Ii;t ˛I ˇi Di "i;t ; (3.7) D C C i 2 D 12 X Ei;t ˛E i Di i;t ; (3.8) D C C i 2 D

3 See www.bls.gov/news.release/empsit.t12.htm. www.risk.net/journals Journal of Credit Risk 10 M. S. Perlin et al

FIGURE 1 Simulated data for income and expenses.

Balance

4500

4000

Value 3500

3000

Expenses 4500

4000

3500 Value 3000

2500 Income 6000

4000 Value

2000

0 6 12 18 24 30 36 42 48 54 60 Time (months)

where Di is a dummy variable that takes the value 1 if the current month is i and 0 otherwise. We exclude the dummy for the first month, i 1, to avoid identification D issues in the regression model. Using the simulated data set as input, we estimate with least squares the empirical model defined in (3.7) and (3.8). The resulting coefficients from the estimation are omitted. Note that they capture the seasonality of the data. For reproducibility, the R code that generates the data and estimates the model is available at https://bit.ly/2X9eaS1. As an illustration of the use of this model, we set the expected time of employ- ment to unemployment as three years (thirty-six months) and the expected time

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 11 of unemployment to employment as three months. These are used to define the transition probabilities of (3.4) using (3.5) and (3.6).

3.3 Methodology for and results of simulating an individual’s future balance and the associated PDs Given information on the regression models, transition matrix and an initial bal- ance of $1000, we simulate an individual’s future balance by sampling the estimated residuals from the regression, "i;t and i;t , conditional on the month of the year. This simulation allowed for seasonality not only with expected income and expenses but also with their corresponding volatility. The estimated transition probabilities will drive the Markov chain forward. In the simulation, we first assume state 1 at time 0 (S0 1): employed. Then, we sample a random number from 0 to 1. If D this number is lower than the probability of staying in regime 1, we set S1 1. D If this random number is higher than the probability of staying in regime 1, we switch the state to S1 2: unemployed. Likewise, following (3.2), if St 1, then D D Ii;t 0; else, Ii;t FI MIi;t . We use the same algorithm to drive the Markov D D C chain for t 2; 3; 4; : : : ; T . We perform 10 000 simulations with a time horizon of D twenty-four months. Once the balance is simulated, calculating the PD of the individual is straight- forward: it can be done by simply looking at the number of simulated scenarios in which the resulting balance was negative for each forward point in time (see (2.7)). Because our first example had no loans, this figure represents the probability of the applicant falling short on their expenses. We call this figure the benchmark default rate curve. In Figure 2, we note that the PD is also seasonal, with a small drop in month 12, which is when a larger cash income occurs. We also see that the PDs generally increase over time, indicating that this applicant is more likely to run out of cash as time passes. While not an intuitive result, this is the effect of the chances of unem- ployment, driven by the transition matrix. Losing income puts pressure on the bal- ance of the individual. Even if the overall chance of unemployment is small, it will still have a great impact on the forward PD. We now illustrate a common practical case in which the same applicant asks for a loan that can be paid through monthly installments of $400 for two years or monthly installments of $200 for four years. Given the previous regression for the expenses, implementing this information in the model is easy by defining

12 X Ei;t ˛E  i Di i;t ; D C C C i 2 D www.risk.net/journals Journal of Credit Risk 12 M. S. Perlin et al

FIGURE 2 Probability of default.

0.06

0.04

Probability of default Probability 0.02

0

0 6 12 18 24 Time (months)

PD (without loan)

where  is the expected change in expenses represented as the monthly loan payment value. Figure 3 presents the impact on the PD. Based on Figure 3, the payment of the loan has an explicit impact on the for- ward PD. For monthly payments of $400, the applicant is very likely to default on their loan in month 4. However, when setting a monthly payment of $200, the PD decreases significantly, indicating a better financial contract for both sides. This illus- tration shows the flexibility of the empirical model and could accommodate differ- ent practical scenarios when evaluating credit risk. Next, in Table 1, we present the calculation of the LGD for each scenario. As is observed, the first scenario leads to an LGD of 3346.6, which is significantly higher than the second case with longer maturity and lower monthly installments. As suspected, the second setup presents a lower expected cost for the bank.

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 13

FIGURE 3 Probabilities of default given a new loan.

1.00

0.75

0.50

Probability of default

0.25

0

0 6 12 18 24 Time (months)

PD (without loan) PD (with $400 pm loan) PD (with $200 pm loan)

Here, “pm” denotes “per month”.

TABLE 1 Risk measures for empirical BaR model.

Case LGD

Installments of $400 per month 3346.6 Installments of $200 per month 475.2

3.4 Optimizing parameters of a loan using a BaR model In practice, the bank receives data from the applicant and must decide on the structure of the offered debt contract, including its maturity and rate of return. If the contract www.risk.net/journals Journal of Credit Risk 14 M. S. Perlin et al

is perceived as risky, the bank is likely to charge a higher yield rate and set a lower maturity range. In this section, we present an illustration of how the BaR formulation can be used to optimize the parameters from the point of view of the bank. We consider a case in which the applicant requests a loan for 10 000 units of cash and has the same banking records as used in the previous example (see Figure 1). The debt is paid back by the applicant in fixed monthly installments that are defined for a given maturity and annual yield rate. This loan is equivalent to a fixed rate bond with monthly coupons (Fabozzi and Mann 2012). Using an annual yield rate, we define the installment value as:

1=12 rm .1 ra/ 1; (3.9) D C rm I 10 000 T ; (3.10) D 1 .1 rm/ C where rm is the monthly yield of the debt contract, ry is the yearly yield of the debt contract, T is the maturity of the contract (in months) and I is the fixed value of monthly installments. Equation (3.10) indicates that the increase in the yield rate and the decrease in the maturity of a contract increase the value of the installments and the risk of the contract. In the BaR model, a higher installment leads to a higher level of uncertainty and, consequently, a higher PD. From the bank side, a natural question to investigate is this: given the expected cashflow dynamic of the applicant, which values of yield and maturity maximize the profit of the contract? We analyze this question by defining the objective func- tion as the expected return from the loan, which is calculated on the basis of the simulation procedure defined in the previous section. Note that the risk is also con- sidered because higher installment values increase the likelihood of a default and, thus, decrease the expected profit. We first create a vector of values for maturity (T 1; 2; : : : ; 23; 24) and yields D (ry 0:025; 0:05; : : : ; 0:475; 0:5). For each pair of maturity and yields, we simu- D lated the BaR model and calculated the expected return from the contract. We per- formed a grid search procedure to find the pair of maturities and yields that max- imizes the value of E.R/, the expected cashflow divided by the total value of the loan. Our results are presented in Figure 4. Figure 4 indicates that the pair of maturities and yields that maximizes the con- tract’s expected profit is twenty-three months and a 30% rate of return on the debt. The white spots in the figure show that a low maturity and a higher yield lead to a null return of the contract. This result is explained by the fact that a higher install- ment (see (3.10)) quickly drains the applicant’s balance, leading to a default on the contract. We also point out that the risk of the contract, LGD (2.8), is minimized with a maturity of eighteen months and a 5% yield. This exercise clearly shows how the

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 15

FIGURE 4 Result of grid search procedure.

0.5

0.4

E(R) 0.12 0.3 0.09

0.06 Annual yield rate Annual 0.03

0.2

0.1

5 10 15 20 25 Maturity (months)

BaR model can be used to better design short-term financial contracts by taking into account the applicant’s banking history.

4 CONCLUSIONS In this paper, we propose an alternative model to calculate personal financial risk by defining a stochastic process for an individual’s cash income and expenditures. For- mal theoretical definitions for this approach along with the usual risk measures are presented. Using an artificial data set, we illustrate the model’s usage by estimating www.risk.net/journals Journal of Credit Risk 16 M. S. Perlin et al

an empirical version of the general formulation and calculating the forward PDs on different loan scenarios. We also present an example of an optimization scenario in which we find the optimal maturity and yield rate of a short-term loan. The main advantage of using our approach is its flexibility. By directly employing a stochastic process for income and expenses, we allow seasonality to play a role in the analysis of short-term credit risk. By using this model, one can verify the impact of several economic factors on an individual’s personal credit risk, eg, loan payment increases, specific payment schedule adjustments or job market changes. Empirical applications of the model are straightforward. On the basis of banking data and information from the job market, one can estimate and simulate an indi- vidual’s future income and expenses. As argued in this paper, the creation of a cen- tral organization to store and analyze banking data could facilitate the process and benefits for both parties in the transaction. Because banks deal with portfolios of individual loans, one suggestion for a future study is to investigate a multivariate version of the model. Formulating a model that incorporates systematic dependencies within a pool of applicants’ cashflows, for which economic shocks such as an increase in general unemployment affect the overall PD on loans, would be interesting in the analysis of the general risk of a bank. Future investigations with real banking data are also suggested. Given individ- ual records, one could study the dynamics of cashflows and test for the structure of an empirical model that efficiently represents the data set and, thus, provides realistic default probability values. Understanding and modeling how applicants react when faced with a low balance in their accounts is also suggested. A person is likely to place a hold on their expenses once their balance passes a particular threshold. An investigation using real data could shed light on this effect. These and other ideas are left for future development.

DECLARATION OF INTEREST

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.

REFERENCES

Allen, L., DeLong, G., and Saunders, A. (2004). Issues in the credit risk modeling of retail markets. Journal of Banking & Finance 28(4), 727–752 (https://doi.org/10.1016/ S0378-4266(03)00197-3). Andren,´ N., Jankensgard,˚ H., and Oxelheim, L. (2005). Exposure-based cash-flow-at-risk: an alternative to VaR for industrial companies. Journal of Applied Corporate Finance 17(3), 76–86 (https://doi.org/10.1111/j.1745-6622.2005.00046.x).

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 17

Audrino, F., and Buhlmann,¨ P. (2009). Splines for financial volatility. Journal of the Royal Statistical Society B 71(3), 655–670 (https://doi.org/10.1111/j.1467-9868.2009 .00696.x). Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., and Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society 54(6), 627–635 (https://doi.org/10.1057/ palgrave.jors.2601545). Bodenhorn, H. (2003). Short-term loans and long-term relationships: relationship lending in early America. Journal of Money, Credit and Banking 35(4), 485–505 (https://doi.org/ 10.1353/mcb.2003.0025). Butaru, F., Chen, Q., Clark, B., Das, S., Lo, A. W., and Siddique, A. (2016). Risk and risk management in the credit card industry. Journal of Banking & Finance 72, 218–239 (https://doi.org/10.1016/j.jbankfin.2016.07.015). Collin-Dufresne, P., and Goldstein, R. S. (2001). Do credit spreads reflect stationary leverage ratios? Journal of Finance 56(5), 1929–1957 (https://doi.org/10.1111/0022- 1082.00395). Crook, J. N., and Bellotti, T. (2010). Time varying and dynamic models for default risk in consumer loans. Journal of the Royal Statistical Society A 173(2), 283–305 (https:// doi.org/10.1111/j.1467-985X.2009.00617.x). Crook, J. N., Edelman, D. B., and Thomas, L. C. (2007). Recent developments in con- sumer credit risk assessment. European Journal of Operational Research 183(3), 1447–1465 (https://doi.org/10.1016/j.ejor.2006.09.100). De Andrade, F. W. M., and Thomas, L. (2007). Structural models in consumer credit. European Journal of Operational Research 183(3), 1569–1581 (https://doi.org/10.1016/ j.ejor.2006.07.049). Engle, R. F., and Rangel, J. G. (2008). The spline–GARCH model for low-frequency volatil- ity and its global macroeconomic causes. Review of Financial Studies 21(3), 1187–1222 (https://doi.org/10.1093/rfs/hhn004). Fabozzi, F. J., and Mann, S. V. (2012). The Handbook of Fixed Income Securities. McGraw-Hill Professional. Hand, D. J., and Henley, W. E. (1997). Statistical classification methods in consumer credit scoring: a review. Journal of the Royal Statistical Society A 160(3), 523–541 (https:// doi.org/10.1111/j.1467-985X.1997.00078.x). Huang, C.-L., Chen, M.-C., and Wang, C.-J. (2007). Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications 33(4), 847–856 (https://doi.org/10.1016/j.eswa.2006.07.007). Huh, J., Chang, W., Lee, J., and Lee, J. (2010). Samsung card lending model. European Journal of Operational Research 207(1), 492–498 (https://doi.org/10.1016/j.ejor.2010 .04.009). Jarrow, R., Ruppert, D., and Yu, Y. (2004). Estimating the interest rate term structure of corporate debt with a semiparametric penalized spline model. Journal of the American Statistical Association 99(465), 57–66 (https://doi.org/10.1198/016214504000000070). Khandani, A. E., Kim, A. J., and Lo, A. W. (2010). Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance 34(11), 2767–2787 (https:// doi.org/10.1016/j.jbankfin.2010.06.001). www.risk.net/journals Journal of Credit Risk 18 M. S. Perlin et al

Khashman, A. (2010). Neural networks for credit risk evaluation: investigation of different neural models and learning schemes. Expert Systems with Applications 37(9), 6233– 6239 (https://doi.org/10.1016/j.eswa.2010.02.101). Leland, H. E., and Toft, K. B. (1996). Optimal capital structure, endogenous bankruptcy, and the term structure of credit spreads. Journal of Finance 51(3), 987–1019 (https:// doi.org/10.1111/j.1540-6261.1996.tb02714.x). Lessmann, S., Seow, H.-V., Baesens, B., and Thomas, L. C. (2013). Benchmarking state- of-the-art classification algorithms for credit scoring: a ten-year update. Working Paper, Conference Archive, Credit Research Centre. Longstaff, F. A., and Schwartz, E. S. (1995). A simple approach to valuing risky fixed and floating rate debt. Journal of Finance 50(3), 789–819 (https://doi.org/10.1111/j.1540- 6261.1995.tb04037.x). Malik, M., and Thomas, L. C. (2012). Transition matrix models of consumer credit ratings. International Journal of Forecasting 28(1), 261–272 (https://doi.org/10.1016/j.ijforecast .2011.01.007). Matuszyk, A., Mues, C., and Thomas, L. C. (2010). Modelling LGD for unsecured personal loans: decision tree approach. Journal of the Operational Research Society 61(3), 393– 398 (https://doi.org/10.1057/jors.2009.67). Merton, R. C. (1974). On the pricing of corporate debt: the risk structure of interest rates. Journal of Finance 29(2), 449–470 (https://doi.org/10.1111/j.1540-6261.1974.tb03058 .x). Oral, C., and Akkaya, G. C. (2015). Cash flow at risk: a tool for financial planning. Procedia Economics and Finance 23, 262–266 (https://doi.org/10.1016/S2212-5671(15)00358- 5). Oreski, S., and Oreski, G. (2014). Genetic algorithm-based heuristic for feature selection in credit risk assessment. Expert Systems with Applications 41(4), 2052–2064 (https:// doi.org/10.1016/j.eswa.2013.09.004). Oreski, S., Oreski, D., and Oreski, G. (2012). Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Sys- tems with Applications 39(16), 12 605–12 617 (https://doi.org/10.1016/j.eswa.2012.05 .023). Perli, R., and Nayda, W. I. (2004). Economic and regulatory capital allocation for revolving retail exposures. Journal of Banking & Finance 28(4), 789–809 (https://doi.org/10.1016/ j.jbankfin.2003.10.007). Shiller, R. J. (2013). Finance and the Good Society. Princeton University Press (https:// doi.org/10.1515/9781400846177). Stein, J. C., Usher, S. E., D. LaGattuta, and Youngen, J. (2001). A comparables approach to measuring cashflow-at-risk for non-financial firms. Journal of Applied Corporate Finance 13(4), 100–109 (https://doi.org/10.1111/j.1745-6622.2001.tb00430.x). Thomas, L. C. (2000). A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers. International Journal of Forecasting 16(2), 149–172 (https:// doi.org/10.1016/S0169-2070(00)00034-0). Thomas, L. C. (2009). Modelling the credit risk for portfolios of consumer loans: analogies with corporate loan models. Mathematics and Computers in Simulation 79(8), 2525– 2534 (https://doi.org/10.1016/j.matcom.2008.12.006).

Journal of Credit Risk www.risk.net/journals BaR: balance at risk 19

Thomas, L. C. (2010). Consumer finance: challenges for operational research. Journal of the Operational Research Society 61(1), 41–52 (https://doi.org/10.1057/jors.2009.104). Wagner, W., and Marsh, I. W. (2006). Credit risk transfer and financial sector stability. Journal of Financial Stability 2(2), 173–193 (https://doi.org/10.1016/j.jfs.2005.11.001). Wiginton, J. C. (1980). A note on the comparison of logit and discriminant models of consumer credit behavior. Journal of Financial and Quantitative Analysis 15(3), 757– 770 (https://doi.org/10.2307/2330408).

www.risk.net/journals Journal of Credit Risk