Next Article in Journal
Board Gender Diversity and Cash Holdings: Empirical Evidence from the European Sport and Leisure Sector
Next Article in Special Issue
Bibliometric Analysis for Working Capital: Identifying Gaps, Co-Authorships and Insights from a Literature Survey
Previous Article in Journal / Special Issue
Do CEO Duality and Ownership Concentration Impact Dividend Policy in Emerging Markets? The Moderating Effect of Crises Period
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Validation of Corporate Probability of Default Models Considering Alternative Use Cases

by
Michael Jacobs, Jr.
Wholesale 1st Line Model Development Validation Services, PNC Financial Services Group—Balance Sheet Analytics & Modeling/Model Development, 340 Madison Avenue, New York, NY 10022, USA
Int. J. Financial Stud. 2021, 9(4), 63; https://doi.org/10.3390/ijfs9040063
Submission received: 16 October 2021 / Revised: 6 November 2021 / Accepted: 8 November 2021 / Published: 24 November 2021
(This article belongs to the Special Issue Corporate Finance)

Abstract

:
In this study, we consider the construction of through-the-cycle (“TTC”) PD models designed for credit underwriting uses and point-in-time (“PIT”) PD models suitable for early warning uses, considering which validation elements should be emphasized in each case. We build PD models using a long history of large corporate firms sourced from Moody’s, with a large number of financial, equity market and macroeconomic variables as candidate explanatory variables. We construct a Merton model-style distance-to-default (“DTD”) measure and build hybrid structural reduced-form models to compare with the financial ratio and macroeconomic variable-only models. In the hybrid models, the financial and macroeconomic explanatory variables still enter significantly and improve the predictive accuracy of the TTC models, which generally lag behind the PIT models in that performance measure. We conclude that care must be taken to judiciously choose the manner in which we validate TTC vs. PIT models, as criteria may be rather different and be apart from standards such as discriminatory power. This study contributes to the literature by providing expert guidance to credit risk modeling, model validation and supervisory practitioners in controlling the model risk associated with such modeling efforts.

1. Introduction

It is expected that financial market participants have accurate measures of a counterparty’s capacity to fulfil future debt obligations, conventionally measured by a credit rating or a score, typically associated with a probability of default (“PD”). Most extant risk rating methodologies distinguish model outputs considered point-in-time (“PIT”) vs. through-the-cycle (“TTC”). Although these terminologies are widely used in the credit risk modeling community, there is some confusion about what these terms precisely mean. In our view, based upon first-hand experience in this domain and a comprehensive literature review, at present a generally accepted definition for these concepts remains elusive, apart from two points of common understanding. First, PIT PD models should leverage all available information, borrower-specific and macroeconomic, which most accurately reflect default risk at any point of time. Second, TTC PD models abstract from cyclical effects and measure credit risk over a longer time period encompassing a mix of economic conditions, exhibiting “stability” of ratings wherein dramatic changes are related mainly to fundamental and not transient economic fluctuations. However, in reality this distinction is not so well defined, as idiosyncratic factors can influence systematic conditions (e.g., credit contagion), and macroeconomic conditions can influence obligors’ fundamental creditworthiness.
There is an understanding in the industry of what distinguishes PIT and TTC constructs, typically defined by how PD estimates behave with respect to the business cycle. However, how this degree of “TTC-ness” vs. “PIT-ness” is defined varies considerably across institutions and applications, and there is no consensus around what thresholds should be established for certain metrics, such as measures of ratings volatility. As a result, most institutions characterize their rating systems as “Hybrid”. While this may be a reasonable description, as arguably the TTC and PIT constructs are ideals, this argument fails to justify the use cases of a PD model where there may be expectations that the model is closer to either one of these poles.
In this study, we develop empirical models that avoid formal definitions of PIT and TTC PDs, rather deriving constructs based upon common sense criteria prevalent in the industry, illustrating which validation techniques are applicable to these approaches. Based upon this empirical approach, we characterize PIT and TTC credit risk measures and discuss the key differences between both rating philosophies. In the process, we address the validation of PD models under both rating philosophies, highlighting that the validation of either system exhibits a particular set of challenges. In the case of the TTC PD rating models, in addition to flexibility in determining measurement of the cycle, there are unsettled questions around the rating stability metric thresholds. In the case of PIT PD rating models, there is the additional question of demonstrating the accuracy of PD estimates at the borrower level, which may not be obvious from observing average PD estimates versus default rates over time. Finally, considering both types of model, there is the question of whether the relative contributions of risk factors are conceptually intuitive, as we would expect that certain variables would dominate in either of these constructs.
There are some additional comments in order to motivate this research. First, there is a misguided perception in the literature and industry that PIT models contain only macroeconomic factors, and that TTC models contain only financial ratios, whereas from a modeling perspective there are other dimensions that define this distinction that we elaborate upon in this research. Furthermore, it may be argued that the validation of a TTC or PIT PD model involves assessing the validity of the cyclical factor, which if not available to the validator, may be accounted for only implicitly. One possibility is for the underlying cycle to be estimated from historical data based upon some theoretical framework, but in this study, we prefer commonly used macroeconomic factors in conjunction with obligor-level default data, in line with industry practice. Related to this point, we do not explicitly address how TTC PD models can be transformed into PIT PD rating models, or vice versa. While the advantage of such alternative constructs is that it can be validated based upon an assumption regarding the systematic factor using the methodologies applicable to each type of PD model, we prefer to validate each as specifically appropriate. The rationale for our approach is that the alternative runs the risk of introducing significant model and estimation risk, thereby leading to the validity of such validation being rendered questionable as compared to testing a pure PIT or TTC PD model.
We employ a long history of borrower level data sourced from Moody’s, around 200,000 quarterly observations from a large population of rated larger corporate borrowers (at least USD 1 billion in sales and domiciled in the U.S. or Canada), spanning the period from 1990 to 2015. The dataset comprises an extensive set of financial ratios, macroeconomic1 and equity market variables as candidate explanatory variables. We build a set of PIT models with a 1-year default horizon and macroeconomic variables, and a set of TTC models with a 3-year default horizon and only financial ratio risk factors.
The position of this research in the academic literature is at the intersection of two streams of inquiry. First, there are a series of empirical studies that focus on the factors that determine corporate default and the forecasting of this phenomenon, which include Altman (1968) and Duffie and Singleton (1999). At the other end of the spectrum, there are mainly theoretical studies that focus on modeling frameworks for either understanding corporate default (e.g., Merton (1974)), or else for perspectives on the TTC vs. PIT dichotomy (e.g., Kiff et al. 2004; Aguais et al. 2008; Cesaroni 2015). In this paper, we blend these considerations of theory and empirics, while also addressing the prediction of default and TTC/PIT construct.
We would like to emphasize that we believe the principal contribution of this paper to be mainly in the domain of practical application rather than methodological innovation. Many practitioners, especially in the wholesale credit and banking book space, still use the techniques employed in this paper. We see our contribution as proposing a structured approach to constructing a suite of TTC and PIT models, while combining reduced form and structural modeling aspects, and then by further proposing a framework for model validation. We would note that many financial institutions in this space do not have such a framework. For example, a lot of banks are still using TTC Basel models that are modified for PIT uses, such as stress testing or portfolio management. Furthermore, a preponderance of banks in this space do not employ hybrid financial and Merton-style models for credit underwriting. In sum, our contribution transcends the academic literature to address issues relevant to financial institution practitioners in the credit risk modeling space, which we believe uniquely positions this research.
The summary of our empirical results are as follows. We present the leading two models in each class of PIT and TTC design, both having favorable rank ordering power, intuitive relative weights on explanatory variables and rating mobility metrics. We also perform predictive accuracy analysis and specification testing, where we observe that the TTC designs are more challenged than the PIT designs in performance, and that unfortunately all designs show some signs of model misspecification. This observation argues for the consideration of alternative risk factors, such as equity market information. In view of this, from the market value of equity and accounting measures of debt for these firms, we are able to construct a Merton model-style distance-to-default (“DTD”) measure and construct hybrid structural reduced-form models, which we compare with the financial ratio and macroeconomic variable-only models. We show that adding DTD measures to our leading models does not invalidate the variables chosen, significantly augments model performance and in particular increases the obligor-level predictive accuracy of the TTC models.
Finally, let us introduce the remainder of this paper, which will proceed as follows. In Section 2, we review the relevant literature, where we address a survey of PD modeling in general, and then the issues around rating philosophy in particular. In Section 3, we address modeling methodology, which we partition into the domains of econometric modeling and statistical assumptions. Section 4 encompasses the empirical analysis of this study, as a description of the modeling data, estimation and validation results. In Section 5, we conclude and summarize the study, discuss policy implications and provide thoughts on avenues for future research.

2. Literature Review

Traditional credit risk models focus on estimating the PD, rather than on the magnitude of potential losses in the event of default (or loss-given-default—“LGD”), and typically specify “failure” to be bankruptcy filing, default, or liquidation, thereby ignoring consideration of the downgrades and upgrades in credit quality that are measured in mark-to-market (“MTM”) credit models. Such default mode (“DM”) models estimate credit losses resulting from default events only, whereas MTM models classify any change in credit quality as a credit event. There are three broad categories of traditional models used to estimate PD: expert systems, including artificial neural networks; rating systems; and credit scoring models.
The most commonly used traditional credit risk measurement methodology is the PD scoring model. The seminal model in this domain is the multiple discriminant analysis (“MDA”) of Altman (1968). Mester (1997) documents the widespread use of credit scoring models amongst banks, with 97% and 70% of them using them to approve credit card and small business loan applications, respectively. Credit scoring models are relatively inexpensive to implement and do not suffer from the subjectivity and inconsistency of expert systems. The spread of these models throughout the world was first surveyed by Altman and Narayanan (1997). The authors find that it is not so much the models’ differences across countries of diverse sizes and in various stages of development that stands out, but rather their similarities. A popularly used vended PD scoring model in the industry is the private firm model of Moody’s Analytics (“MA”; Dwyer et al. 2004).
Merton (1974) models equity in a levered firm as a call option on the firm’s assets with a strike price equal to the debt repayment amount. The PD is determined by valuing the call option using an iterative method to estimate the unobserved variables that determine this, the market value of assets and the volatility of assets, combined with the amount of debt liabilities that have to be repaid at a given credit horizon in order to calculate the firm’s distance-to-default (“DTD”). DTD is the number of standard deviations between the current asset values and the debt repayment amount, so the higher it is, the lower the PD. In an important example of this, in the CreditEdgeTM (“CE”) public firm model of MA, an empirical PD using historical default experience is estimated using a historical database of default rates to determine an empirical estimate of the PD, denoted the expected default frequency (“EDF”). As CE EDF scores are obtained from equity prices, they are more sensitive to changing financial circumstances than external credit ratings that rely predominately on credit underwriting data.
Modern methods of credit risk measurement can be traced to two alternative branches in the asset pricing literature of academic finance. In contrast to the option of the theoretic structural approach, which was pioneered by Merton (1974), a reduced form approach utilizing intensity-based models to estimate stochastic hazard rates follows a study pioneered by Jarrow and Turnbull (1995) and Duffie and Singleton (1999). These two schools of thought offer differing methodologies to accomplish the central task of all credit risk measurement models, which is the estimation of PDs. The structural approach models the economic process of default, whereas reduced form models decompose risky debt prices in order to estimate the random intensity process underlying default. The proprietary model Kamakura Risk Manager (“KRM”), where the econometric approach (the so-called Jarrow-Chava Model—“JCM”) is a reduced-form model based upon the research of Chava and Jarrow (2004), attempts to explicitly adjust for liquidity effects. However, noise from embedded options and other structural anomalies in the default risk-free market further distorts risky debt prices, thereby impacting the results of intensity-based models.
There are several more recent studies of particular relevance to this research that could be mentioned, but for the sake of brevity, we will refer the reader to the comprehensive literature review of Altman (2018). However, we will highlight one important study from a methodological perspective by Jiang (2021). This paper investigates the incentive of credit rating agencies to bias ratings using a semiparametric, ordered-response model. Using Moody’s rating data from 2001 to 2016, the author finds that firms related to Moody’s shareholders were more likely to receive better ratings.
In the recent literature on PD modeling, there has been a proliferation of studies investigating machine learning techniques. Kim (2005) applies adaptive learning networks (ALN), which is a nonparametric model, on both financial and non-financial variables to predict S&P credit ratings. Yu et al. (2008) proposes a six stage neural network ensemble learning model to assess a credit risk measurement on Japanese consumer credit card application approval and UK corporations. Khashman (2010) investigates three neural networks based on a back propagation learning algorithm on the German Credit Approval dataset. The architecture of these neural networks is different according to various parameters used in the model, such as hidden units, learning coefficients, momentum rate and random initial weight range. Pacelli and Azzollini (2011) provide an overview of different types of neural networks used in the credit-rating literature. Among all artificial intelligence techniques, support vector machines (“SVMs”) have demonstrated powerful classification abilities (Cortes and Vapnik 1995; Kim and Sohn 2010; Vapnik 2013; Xiao et al. 2016). Khandani et al. (2010) studies the general classification and regression tree technique (“CART”) on a combination of traditional credit factors and consumer banking transactions to predict consumer credit risk. Veronezi (2016) applied random forest (“RF”) and multilayer perceptron (“MLP”) techniques to predict corporate credit ratings using their financial data. Finally, in addition to all these frequently used techniques, some researchers study other approaches to provide a credit scoring model. Peng et al. (2011) introduce three multiple criteria decision making (“MCDM”) methods to evaluate classification algorithms for financial risk prediction. Chen (2012) investigates the rough set theory (“RST”) approach to classify Asian banks’ ratings. Finally, some researchers take one step further and integrate multiple techniques to achieve a higher accuracy, such as Yeh et al. (2012), who combine random forest feature selection with different approaches such as RST and SVM.
One of the key motivations behind the new generation of PD models being developed in the industry, as well as in this research, is to provide a suite of models that can accommodate multiple uses, such as TTC models for credit underwriting or risk weighted assets (“RWA”), as well as PIT models for credit portfolio management or early warning. One point to highlight is that despite the growing literature on TTC credit ratings, there is still no consensus on the precise definition of this concept, except the general agreement that TTC ratings are adjusted to not reflect cyclical effects. The Basel guidelines (BIS 2006) describe a PIT rating system as a construct that uses all currently available obligor-specific and aggregate information to estimate an obligor’s PD, in contrast to a TTC rating system that, while using obligor-specific information, tends not to adjust ratings in response to changes in macroeconomic conditions. However, the types of such cyclical effects and how they are measured differ considerably in the literature as well as in practice.
First, a number of studies have come up with a formal definition of the concepts of PIT and TTC PD estimates and rating systems. These include Loeffler (2004), who explores the TTC methodology in a structural credit risk model based on Merton (1974), in which a firm’s asset value is separated into a permanent and a cyclical component. In this model, TTC credit ratings are based on forecasting the future asset value of a firm under a stress scenario for the cyclical component. Kiff et al. (2004) investigate the TTC approach in a structural credit risk model in which the definition of TTC ratings follows the one applied by Hamilton et al. (2011), emphasizing that while anecdotal evidence from credit rating agencies confirm their use of the TTC approach, it turns out that there is no single and simple definition of what a TTC rating actually means. In contrast to the majority of studies in the literature that define PIT and TTC credit measures on the basis of a decomposition of credit risk into idiosyncratic and systematic risk factors, Aguais et al. (2008) follow a frequency decomposition view in which a firm’s credit measure is split up into a long-term credit quality trend and a cyclical component which are filtered from the firm’s original credit measure by using a smoothing technique based on the filter in Hodrick and Prescott (1997). Furthermore, the authors argue that in the existing literature, there has been little discussion about whether the C in TTC refers to the business cycle or the credit cycle and highlight that these cycles differ considerably from each other regarding their length. They describe a practical framework for banks to compute PIT and TTC PDs through converting PIT PDs into TTC PDs based on sector-specific credit cycle adjustments to the DTD credit measures of the Merton (1974) model derived from a credit rating agency’s rating or MA’s CE model. Furthermore, they qualitatively discuss key components of PIT-TTC default rating systems and how these systems can be implemented in banks. On the other hand, Cesaroni (2015) analyzes PIT and TTC default probabilities of large credit portfolios in a Merton single-factor model, where the author defines the TTC PD as the expected PIT PD, where the expectation is taken over all possible states of a systematic risk factor. Repullo et al. (2010) propose translating PIT PDs into TTC PDs by ex post smoothing the estimated PIT PDs with countercyclical scaling factors. In connection with the industry next-generation PD model redevelopment efforts and this research, with the objective of supporting TTC vs. PIT ratings, these results support not having formal definitions of TTC vs. PIT ratings, in light of the diversity of approaches seen in the literature.
Second, several studies analyze the ratings of major rating agencies regarding their PIT vs. TTC orientation. These include the Altman and Rijken (2004) who find, based on credit scoring models, that major credit rating agencies pursue a long-term view when assigning ratings, putting less weight on short-term default indicators and hence indicating TTC orientation. Loeffler (2013) shows for Standard and Poor’s and Moody’s rating data that these agencies have a policy of changing a rating only if it is unlikely to be reversed in the future and argues that this can explain the empirical finding that rating changes lag changes of an obligor’s default risk, consistent with the general view of TTC ratings. Altman and Rijken (2006) analyze the TTC methodology of rating agencies from an investor’s PIT perspective and quantify the effects of this methodology on the objectives of rating stability, rating timeliness, and performance in predicting defaults. Among other results, they find that TTC rating procedures delay migration in agency ratings, on average, by ½ a year on the downgrade side and ¾ of a year on the upgrade side, and that from the perspective of an investor’s one-year horizon, TTC ratings significantly reduce the short-term predictive power for defaults. Several papers, such as Amato and Furfine (2004) and Topp and Perl (2010), analyze actual rating data and show that these ratings vary with the business cycle, even though these ratings are supposed to be TTC according to the policies of the credit rating agencies. Loeffler (2013) estimates long-run trends in market-based measures of one-year PDs using different filtering techniques. They show that agency ratings contribute to the identification of these long-run trends, thus providing evidence that credit rating agencies follow to some extent a TTC rating philosophy. To summarize, many studies find that the ratings of major rating agencies show both PIT as well as TTC characteristics, which is consistent with the notion of hybrid rating systems. In connection with this research and industry redevelopment efforts, with the objective of supporting TTC vs. PIT ratings, these results support not having “hard” mobility metric thresholds in evaluating the model output.
Third, the rating philosophy is important from a regulatory and supervisory perspective, as well as from a credit underwriting perspective, not least because capital requirements for banks and insurance firms depend upon credit risk measures. Studies that discuss TTC PDs in the context of Basel II or as a remedy for the potential pro-cyclical nature of Basel II (BIS 2006) include Repullo et al. (2010), who compare smoothing the input of the Basel II formula by using TTC PDs or smoothing its output with a multiplier based on GDP growth. They prefer the GDP growth multiplier because TTC PDs are worse in terms of simplicity, transparency, cost of implementation, and consistency with banks’ risk pricing and risk management systems. Cyclicality of credit risk measures also plays an important role in the context of Basel III (BIS 2011), which states that institutions should have sound internal standards for situations where realized default rates deviate significantly from estimated PDs, and that these standards should take account of business cycles and similar systematic variability in default experience. In two separate consultation papers issued in 2016, The European Banking Authority (2016) proposes to explicitly leave the selection of the rating philosophy to the banks, whereas the Basel Committee for Banking Supervision (“BCBS”; BIS 2016—“Bank for International Settlements”) proposes requiring banks to follow a TTC approach to reduce the variability in PDs and thus RWAs across banks.
Finally, the rating philosophy should influence the validation of rating systems, but the challenges to validate TTC models have been largely ignored in the literature. The BCBS (BIS 2005) further stresses that in order to evaluate the accuracy of PDs reported by banks supervisors need to adapt their PD validation techniques to the specific types of banks’ credit rating systems, in particular with respect to their PIT vs. TTC orientation. However, methods to validate rating systems have paid very little attention to the rating philosophy or focused on PIT models. For example, Cesaroni (2015) observes that predicted default rates are PIT, and thus the validation of a rating system “should” operate on PIT PDs from a theoretical perspective. Petrov and Rubtsov (2016) explicitly mention that they have not yet developed a validation framework consistent with their PIT/TTC methodology.
To conclude this section, we mention an important paper on PIT PD modeling by Đurović (2019) where a framework is proposed for retail PD modeling in accordance with the International Reporting Financial Standards 9 accounting regulation. The model is based upon a term structure of PD conditional to the given forward-looking macroeconomic dynamics. Due to data limitations, a key impediment in forward-looking modelling, the author proposes and illustrates a model averaging technique for the quantification of macroeconomic effects on the PD.

3. Methodology

In this section, we outline the econometric technique and statistical PD modeling in the industry. In principle, for classification tasks including default prediction, while one could use the same loss functions as those used for regression (i.e., the ordinary least squares criterion; “OLS”) in order to optimize the design of the classifier, this would not be the most reasonable way to approach such problems. This is because in classification, the target variable is discrete in nature; hence, alternative measures to those employed in regression are more appropriate for quantifying the quality of model fit. This discussion could be motivated by the classification problem for default prediction through Bayesian decision theory, which has conceptual simplicity and aligns well with common sense and possesses a strong optimality flavor with respect to the probability of an error in classification. However, given that the focus and contribution of this paper does not lie in the domain of econometric technique, we will defer such discussion and focus on the logistic regression model (“LRM”) technique, as it is widely employed and well understood in the literature and practice.
Considering the 2-class { ω i } i = 1 2 case for the LRM that is relevant to PD modeling, the first step is to express the log-odds (or the logit function) of the posterior probabilities as a linear function of the risk factors:
ln ( P ( ω 1 | x ) P ( ω 2 | x ) ) = θ T x ,
where x = ( x 1 , .. , x k ) k is a k dimensional feature vector and θ = ( θ 1 , .. , θ k ) k is a vector of coefficients, and we define x 1 = 1 so that the intercept is subsumed into θ . In that, P ( ω 1 | x ) + P ( ω 2 | x ) = 1 :
P ( ω 1 | x ) = 1 1 + exp ( θ T x ) = σ ( θ T x ) ,
where the function σ ( θ T x ) is known as the logistic sigmoid (or sigmoid link) and has the mathematical properties of a cumulative distribution function that ranges between 0 and 1, with a domain on the real line. Intuitively, this can be viewed as the conditional PD of a score θ T x where higher values indicate greater default risk.
We may estimate the parameter vector θ by the method of maximum likelihood estimation (“MLE”) given a set of training samples, with observations of explanatory variables { x n } n = 1 N and binary dependent variables { y n } n = 1 N , where y n { 0 , 1 } . The likelihood function is given by:
P ( y 1 , , y N | θ ) = n = 1 N ( σ ( θ T x n ) ) y n ( 1 σ ( θ T x n ) ) 1 y n .
The practice is to consider the negative log-likelihood function (or the cross-entropy error), a monotonically increasing transformation of (3), for the purposes of computational convenience:
L ( θ ) = n = 1 N y n ln ( σ ( θ T x n ) ) + ( 1 y n ) ln ( 1 σ ( θ T x n ) ) .
Equation (4) is minimized with respect to θ using iterative methods, such as the steepest descent of Newton’s scheme.
We note an important property of this model that is computationally convenient and leads to stable estimation under most circumstances. Since σ ( θ T x n ) ( 0 , 1 ) according to the properties of the sigmoid link function, it follows that the variance-covariance matrix R is positive definite, which implies that the Hessian matrix 2 L ( θ ) is positive definite. In turn, this implies that the negative log-likelihood function L ( θ ) is convex, and as such this guarantees the existence of a unique minimum to this optimization. However, maximizing the likelihood function may be problematic in the case where the development dataset is linearly separable. In such a case, any point on the hyperplane θ ^ M L E T x = 0 (out of an infinite number of such hyperplanes) that solves the classification task and separates the training samples in each class does so perfectly, which means that every training point is assigned a posterior probability of class membership equal to one (or σ ( θ ^ M L E T x ) = 1 2 ). In this case, the MLE procedure forces the parameter estimate to be infinite ( θ ^ M L E T ), which means geometrically that the sigmoid link function approaches a step function and not an s-curve as a function of the score. This is basically a case of overfitting the development sample, which can be controlled by techniques such as k-fold cross-validation, or including a regularization term inside a corresponding cost function that controls the magnitudes of the parameter estimates (e.g., LASSO techniques for a linear penalty function C ( θ | λ ) = λ θ with a cost parameter λ ).
We conclude this section by discussing the statistical assumptions underlying the LRM model. Logistic regression does not make many of the key assumptions of OLS regression regarding linearity, normality of error terms, homoscedasticity of the error variance and the measurement level. Firstly, LRM does not assume a linearity relationship between the dependent variable and estimator2, which implies that we can accommodate non-linear relationships between the independent and dependent variables without non-linear transformations of the former (although we may choose to do so for other reasons, such as treating outliers), which yields more parsimonious and more intuitive models. Another way to look at this is since we are applying the log-odds transformation to posterior probabilities, by construction we have a linear relationship in the risk drivers and do not necessarily require additional transformations. Secondly, the independent variables do not need to be multivariate normal, which equivalently means that the error terms need not be multivariate normal either. While there is an argument that if the error terms are actually multivariate normal (which is probably not true in practice), then imposing this assumption leads to efficiency gains and possibly a more stable solution; at the same time, there are many more parameters to be estimated. That is because in the normal case we not only have to estimate the k regression coefficients θ = ( θ 1 , .. , θ k ) k , but we also have to estimate the entire variance-covariance matrix (i.e., the variance-covariance matrix in the LRM is a function of θ ), which is O ( k 2 2 ) additional operations and could lead to a more unstable model depending upon data availability as well as more computational overhead. Thirdly, since the variance-covariance matrix also depends on x by construction through the sigmoid link function, variances need not be homoscedastic for each level of the independent variables (while if we imposed a normality assumption, we would require this assumption to hold as well). Lastly, the LRM can handle ordinal and nominal independent variables as they need not be metric (i.e., interval or ratio scaled), which leads to more flexibility in model construction and again avoids counterintuitive transformations and more parameters to be estimated.
However, some other assumptions still apply in the LRM setting. First, the LRM requires the dependent variable to be binary, while other approaches (e.g., ordinal logistic regression—“OLR” or the multinomial regression model—“MRM”) allow the dependent variable to be polytomous, which implies more granularity in modeling. This is because reducing an ordinal or even metric variable to a dichotomous level loses a lot of information, which makes this test inferior compared to OLR in these cases. In the case of PD modeling, if credit states other than default are relevant (e.g., significant downgrade short of default, or prepayment), then this could result in biased estimates and mismeasurement of default risk. However, we note in this regard that for many portfolios, data limitations (especially for large corporate or commercial and industrial portfolios) prevents application of OLR for more states than default (e.g., prepayment events may not be identifiable in data), and conceptually we may argue that observations of ratings have elements of expert judgment and are not “true” events (although in wholesale, the definition of default is partly subjective). An assumption related to this is the independence of irrelevant alternatives, which states that relative odds of a binary outcome should not depend on other possible outcomes under consideration. In the statistics and econometrics literature, there is debate not only about how critical this assumption is, but also on ways to test this assumption and the value of such tests (Cheng and Long 2006; Fry and Harris 1996; Hausman and McFadden 1984; Small and Hsiao 1985).
Another important assumption is that the LRM requires the observations to be independent, which means that that the data points should not come from any dependent samples design (e.g., matched pairings or panel data.) While obviously that is not completely the case in PD modeling in that we have dependent observations, in practice this may not be a very material violation, since if we are capturing most or all of the relevant factors influencing default, then anything else is likely to be idiosyncratic (especially if we are including macroeconomic factors). While in this implementation we are not assuming a parametric distribution for the error terms in the LRM, there are still certain properties that the errors should exhibit, in order for us to have some assurance that the model is not grossly mis-specified (e.g., symmetry around zero, lack of outliers.) However, there is some debate in the literature on the criticality of this assumption, as well as the best way to evaluate LRM residuals (Li and Shepherd 2012; Liu and Zhang 2017).
Finally, we conclude this section by a discussion of the model methodology within the empirical context. The modeling approach as outlined in this section, and the model selection process as elaborated upon in subsequent sections, is common to both PIT and TTC constructs. However, we impose the constraint that only financial factors are considered in the TTC construct, while both the former and macroeconomic variables are considered for the PIT models. This is in addition to the difference in default horizon and other model selection criteria, which results in a differentiation in the TTC and PIT outcomes, in terms of rating mobility and relative factor weights considered intuitive in each construct—i.e., high (lower) rating mobility, and greater (lower) weight on shorter (longer) term financial factors for the PIT (TTC) models.

4. Empirical Analysis

4.1. Description of Modeling Data

The following data are also used for the development of the models in this study:
  • CompustatTM: Standardized fundamental and market data for publicly traded companies including financial statement line items and industry classifications (Global Industry Classification Standards—“GICS” and North American Industry Classification System—“NAICS”) over multiple economic cycles from 1979 onward. These data include default types such as bankruptcy, liquidation, and rating agency’s default rating, all of which are part of the industry standard default definitions.
  • Moody’s Default Risk ServiceTM (“DRS”) Rating History: An extensive database of rating migrations, default and recovery rates across geographies, regions, industries, and sectors.
  • Bankruptcydata.com: A service provided by New Generation Research, Inc. (“NGR”) providing information on corporate bankruptcies.
  • The Center for Research in Security PricesTM (“CRSP”) U.S. Stock Databases: This product is comprised of a database of historical daily and monthly market and corporate action data for over 32,000 active and inactive securities with primary listings on the NYSE, NYSE American, NASDAQ, NYSE Arca and Bats exchanges and include CRSP broad market indexes.
A series of filters are applied to this Moody’s population to construct a population that is closely aligned with the U.S.’s large corporate segment of companies that are publicly rated and have publicly traded equity. In order to construct and achieve this using Moody’s data, the following combination of NAICS and GICS industry codes, region and historical yearly Net Sales are used:
  • Non-commercial and industrial (“C&I”) obligors defined by the following NAICS codes below, are not included in the population:
    • Financials
    • Real Estate Investment Trust (“REIT” or Real Estate Operating Company (“REOC”)
    • Government
    • Dealer Finance
    • Not-for-Profit, including museums, zoos, hospital sites, religious organizations, charities, and education
  • A similar filter is performed according to GICS (see below) classification:
    • Education
    • Financials
    • Real Estate
  • Only obligors based in the U.S. and Canada are included.
  • Only obligors with maximum historical yearly Net Sales of at least USD 1B are included.
  • There are exclusions for obligors with missing GICS codes, and for modeling purposes obligors are categorized into different industry segments on this basis.
  • Records prior to 1Q91 are excluded, the rationale being that capital markets and accounting rules were different before the 1990s, and the macroeconomic data used in the model development are only available after 1990. As one-year change transformations are amongst those applied to the macroeconomic variables, this cutoff is advanced a year from 1990 to 1991.
  • Records that are too close to a default event are not included in the development dataset, which is an industry standard approach, the rationale being that the records of an obligor in this time window do not provide information about future defaults of the obligor, but more likely the existing problems that the obligor is experiencing. Furthermore, a more effective practice is to base this on data that are 6–18 (rather than 1–12) months prior to the default date, as this typically reflects the range of timing between when statements are issued and when ratings are updated (i.e., usually it takes up to six months, depending on time to complete financials, receive them, input, and complete/finalize the ratings).
  • In general, the defaulted obligors’ financial statements after the default date are not included in the modeling dataset. However, in some cases, obligors may exit a default state or “cure” (e.g., emerge from bankruptcy), in which cases, only the statements between default date and cured date are not included.
In our opinion, these data exclusions are reasonable and in line with industry standards, sufficiently documented and supported and do not compromise the integrity of the modeling dataset.
The time periods considered for the Moody’s data is the development period Q191–Q415. Shown in Table 1 below is the comparison of the modeling population by GICS industry sectors, where for each sector defaulted obligors columns represent the percent of defaulted obligors in the sector out of entire population. The data are concentrated in Consumer Discretionary (20%), Industrials (17%), Tech Hardware and Communications (12%), and Energy except E&P (11%).
A similar industry composition is shown below in Table 2 according to the NAICS classification system.
The model development dataset contains financial ratios and default information that are based upon the most recent data available from DRSTM, CompustatTM and bankruptcydata.com, so that the data are timely and a priori should be give the benefit of the doubt with respect to favorable quality. Furthermore, the model development time period of 1Q91–4Q15 spans two economic downturn periods and complete business cycles, the length of which are another factor supporting a verdict of good quality.
Related to this point, we plot the yearly one- and three-year default rates in the model development dataset, shown below in Figure 1. As the goal of model development is to establish for each risk driver that the preliminary trends observed match that of our expectations, there is sufficient variation in this data to support quantitative methods of parameter estimation, further supporting the suitability of the data from a quality perspective.
The following are the categories and names of the explanatory variables appearing in the final candidate models3:
  • Size: Change in Total Assets (“CTA”), Total Liabilities (“TL”)
  • Leverage: Total Liabilities to Total Assets Ratio (“TLTAR”)
  • Coverage Cash Use Ratio (“CUR”), Debt Service Coverage Ratio (“DSCR”)
  • Efficiency: Net Accounts Receivables Days Ratio (“NARDR”)
  • Liquidity: Net Quick Ratio (“NQR”), Net Working Capital to Tangible Assets Ratio (“NWCTAR”)
  • Profitability: Before Tax Profit Margin (“BTPM”)
  • Macroeconomic” Moody’s 500 Equity Price Index Quarterly Average Annual Change (“SP500EPIQAAC”), Consumer Confidence Index Annual Change (“CCIAC”)
  • Merton Structural: Distance-to-Default (“DTD”)
In the subsequent tables (Table 3, Table 4, Table 5 and Table 6) we present the summary statistics for the variables that appear in our final models. These final models were chosen based upon an exhaustive search algorithm in conjunction with 5-fold cross-validation, and we have chosen the leading two models in either the PIT and TTC constructs, as well as incorporating the DTD risk factor or not4. The counts and statistics vary slightly across models, as the Python libraries that we utilize do not accommodate missing values, but nonetheless the differences in these statistics across models are minimal. The counts of observations vary narrowly from about 150 K to observations of about 165 K. The default rate is consistently about 1% (3%) for the PIT (TTC) models.
The Areas Under the Receiver Operating Characteristic Curve (“AUC”) and missing rates for the explanatory variables are summarized in Table 7 at the end of this section5. The univariate AUCs range from 0.6 to 0.8 across risk factors, with some expected deterioration when going from the 1- to 3-year default horizon, which is indicative of strong default rank ordering capability amongst these explanatory variables. The missing rates are generally between 5 and 10%, which is indicative of favorable data quality to support model development.

4.2. Econometric Specifications and Model Validation

In the subsequent tables we present the estimation results and in-sample performance statistics for our final models.
We shall first discuss general features of the model estimation results. Across models, signs of coefficient estimates are in line with economic intuition, and significant levels are indicative of very precisely estimated parameters. AUC statistics indicate that models have a strong ability to rank order default risk, and while as expected this level of discriminatory power declines somewhat at the longer default horizon, in all cases the levels are in line with favorable performance by industry standards.
Regarding the measures of predictive accuracy, the Hosmer–Lemeshow tests (“HL”) show that the PIT models fit the data well, while the TTC models fail to do so. However, we observe that when we introduce DTD into the TTC models, predictive accuracy increases markedly, as the p-values of the HL statistics increase significantly to the point where there is marginal evidence of adequate fit (i.e., the p-values indicate that the TTC models fail only with significance levels greater than 5%). AIC measures are also much higher in the TTC vs. PIT models, but do decline when the DTD risk factors are introduced, consistent with the HL statistics.
We next discuss general features of the estimation that speak to the TTC or PIT qualities of the models. As expected, the TTC models have much lower Singular Value Decomposition (“SVD”) rating mobility metrics as compared to the PIT models, in the range of about 30–35% in the former as compared about a 70–80% range in the latter. The relative magnitude of the factor contribution (“FC”) measures, which quantify the proportion of the total score that is accounted for by an explanatory variable, also support that the models are exhibiting TTC and PIT characteristics. This is because intuitively, we observe that in the TTC models there is greater weight on categories considered more important in credit underwriting (i.e., size, leverage and coverage), whereas in the PIT models this trend is reversed and there is greater emphasis on factors considered more critical to early warning or credit portfolio management (i.e., liquidity, profitability or efficiency).
In Table 8 below, we show the estimation results and in-sample performance measures for PIT Model 1 with both financial and macroeconomic explanatory variables for a 1-year default horizon. FCs are higher on more PIT relevant factors as contrasted to factors considered more salient to TTC constructs. Financial risk factors carry a super-majority of the FC compared to the macroeconomic factors, about 90% in the former as compared to about 10% in the latter, which is a common observation in the industry for PD scorecard models. The model estimation results provide evidence of high discriminatory power, as the AUC is 0.8894. The AIC is 7231.9, which, relative to the TTC models, is indicative of favorable predictive accuracy, which is corroborated by the very high the HL p-value of 0.5945. Finally, the SVD mobility metric 0.7184 supports that this model exhibits PD rating volatility consistent with a PIT model.
In Table 9 below, we show the estimation results and in-sample performance measures for PIT Model 2 with financial, macroeconomic and the Structural–Merton DTD as explanatory variables for a 1-year default horizon. The results are similar to PIT Model 1 in terms of signs of coefficient estimates, statistical significance and relative FCs on financial and macroeconomic variables. DTD enters the model without any deleterious effects on the statistical significance of financial ratios, although the relative contribution of 0.17 absorbs a fair amount of the other variables’ factor weights and eclipses that of the macroeconomic variables. That said, we observe that, collectively, financial and Merton DTD risk factors carry a super-majority of the FC compared to the macroeconomic factors, about 89% in the former as compared to about 11% in the latter, which is a common observation in the industry for PD scorecard models. The model estimation results provide evidence of high discriminatory power as the AUC is 0.8895, which is immaterially lower than then the Model 1 version without DTD. The AIC is 7290.0, which, relative to the TTC models, is indicative of favorable predictive accuracy ad also indicates an improvement in fit as compared to the Model 1 version without the structural model DTD variable, which is corroborated by the very high HL p-value of 0.5782. Finally, the SVD mobility metric 0.7616 supports that this model exhibits PD rating volatility consistent with a PIT model, and moreover the addition of the DTD Merton model proxy improves the PIT aspect of this model relative to its Model 1 counterpart which does not have this feature.
In Table 10 below, we show the estimation results and in-sample performance measures for TTC Model 1 with financial explanatory variables for a 3-year default horizon. The signs of coefficient estimates are intuitive, as all are negative (TL, DSCR, NQR and BTBF), except for TLTAR, which is positive. Parameter estimates are all highly statistically significant. FCs are higher on more TTC relevant factors (i.e., 0.17, 0.31 and 0.23 for TL, TLTAR and DSCR, respectively) as contrasted to the factors considered more salient to PIT constructs (i.e., 0.14 for NQR and BTPM). The model estimation results provide evidence of high discriminatory power, as the AUC is 0.8232, but which as expected is somewhat lower than in the comparable PIT models not containing DTD where they range in the range of 0.88–0.89. The AIC is 17,751.6, which, relative to the comparable PIT models, is indicative of a rather worse predictive power, which is corroborated by the very low HL P-Value of 0.0039, which rejects the null hypothesis that the model is properly specified with respect to a “saturated model” that perfectly fits the data. Finally, the SVD mobility metric 0.3295 supports that this model exhibits PD rating volatility consistent with a TTC model.
In Table 11 below, we show the estimation results and in-sample performance measures for TTC Model 2 having financial and the Structural-Merton DTD explanatory variables for a 3-year default horizon. The signs of coefficient estimates are intuitive, as all are negative (DSCR, NQR and BTBF), except for TLTAR which is positive, and as expected, DTD has a negative parameter estimate. Parameter estimates are all highly statistically significant. FCs are higher on more TTC-relevant factors (i.e., 0.37 and 0.29 for TLTAR and DSCR, respectively) as contrasted to the factors considered more salient to PIT constructs (i.e., 0.08 and 0.09 for NQR and BTPM, respectively). Note that in this model, adding the DTD explanatory variable results in TL not being statistically significant, and we drop it from this specification; additionally, the FC of DTD is 0.17, so that the financial factors still carry most of the relative weight. The model estimation results provide evidence of high discriminatory power, as the AUC is 0.8226, but which as expected is somewhat lower than in the comparable PIT models containing DTD, where they vary in the range of 0.88–0.89. The AIC is 11,834.6, which relative to the comparable PIT models containing DTD (although this is lower than for TTC model 1, so that DTD improves fir materially), is indicative of a rather worse predictive power, which is corroborated by the somewhat low HL p-Value of 0.0973, which rejects the null hypothesis that the model is properly specified with respect to a “saturated model” that perfectly fits the data at the 5% significance level, where we would note that this marginal rejection is an improvement over the comparable TTC version of this model without the Merton DTD variable. Finally, the SVD mobility metric of 0.3539 supports that this model exhibits a PD rating volatility consistent with a TTC model, but we note that the rating volatility measure is somewhat higher than in the comparable TTC model not containing the DTD variable.
We conclude this section by comparing our results to other similar studies in potentially different methodological or empirical contexts. Our results are consistent with a series of empirical studies that focus on the factors that determine corporate default and the forecasting of this phenomenon, (e.g., Altman 1968; Jarrow and Turnbull 1995; Duffie and Singleton 1999), in that we confirm that Merton DTD measures may augment the predictive power of models featuring only financial or macroeconomic factors. Where we innovate in this dimension is in incorporating the TTC vs. PIT constructs as separate models, which addresses this stream of literature (e.g., Kiff et al. 2004; Aguais et al. 2008; Cesaroni 2015), thereby blening these considerations of theory and empirics, while also addressing the prediction of default.

5. Conclusions

In this study, we have developed alternative simple and general econometrically estimated PD models of both TTC and PIT designs. We have avoided formal definitions of PIT vs. TTC PDs, and rather derived constructs based upon common sense criteria prevalent in the industry, and in the process have illustrated which validation techniques are applicable to these different approaches. Based upon this empirical approach to modeling, we have characterized PIT and TTC credit risk measures and have discussed the key differences between both rating philosophies. In the process, we have addressed the validation of PD models under both rating philosophies, highlighting that the validation of either rating systems exhibits particular challenges. In the case of the TTC PD rating models, in addition to the flexibility in determining the nature of the cycle underlying and its measurement, we have answered questions around the thresholds for rating stability metrics that are not settled. In the case of PIT PD rating models, we have spoken to questions around the rigorous demonstration that PD estimates are accurately estimated at the borrower level, which may not be obvious from optically observing the degree to which average PD estimates track default rates over time. Considering both TTC and PIT PD models, we have addressed the issue of whether the relative contributions of risk factors are conceptually intuitive, the expectation being that certain variables would dominate in either of these constructs.
We have observed that the validation of a PD TTC or PIT rating model involves assessing the economic validity of the cyclical factor, which if, with respect to the specific modeling methodology, may not be available to the validator, or else may be accounted for only implicitly. One possibility is for the underlying cycle of the PD rating model to be estimated from historical rating and default data based upon some theoretical framework. However, in this study we have chosen to propose commonly used macroeconomic factors in conjunction with obligor-level default data, in line with the industry practice of building such models.
We have highlighted features of PIT vs. TTC model design in our empirical experiment, yet have not explicitly addressed how TTC PD rating models can be transformed into corresponding PIT PD rating models, or vice versa. While the advantage of such a construct is that the latter can then be validated based upon an assumption regarding the systematic factor and validated using the methodologies applicable to each type of PD rating models, we have chosen to validate each as specifically appropriate. The rationale for our approach is that the alternative runs the risk of introducing significant model risk (i.e., if the theoretical model is mis-specified), as well as additional estimation risk (i.e., if the parameter estimates need to be extracted from historical data), thereby leading to the validity of such validation being rendered questionable as compared to testing a pure PIT or TTC PD rating model.
We have employed a long history of borrower-level data sourced from Moody’s, around 200,000 quarterly observations from a large population of rated larger corporate borrowers (at least USD 1 billion in sales and domiciled in North America), spanning the period from 1990 to 2015. The dataset comprises an extensive set of financial ratios, as well as macroeconomic variables as candidate explanatory variables. We built a set of PIT models with a 1-year default horizon and macroeconomic variables, and a set of TTC models with a 3-year default horizon and only financial ratio risk factors. We presented the leading two models in each class of PIT and TTC designs, both having favorable rank ordering power, and propose the leading model based upon the relative weights on explanatory variables (i.e., certain variables are expected to have different relative contributions in TTC vs. PIT constructs), as well as rating mobility metrics (e.g., PIT models are expected to show more responsive ratings and TTC models more stable ratings.) We also performed specification testing, where we observe that the TTC designs are more challenged than the PIT designs in this dimension of performance. The latter observation argues for the consideration of alternative risk factors, such as equity market information. In view of this, from the market value of equity and accounting measures of debt for these firms, we were able to construct a Merton model-style DTD measure and construct hybrid structural-reduced form models, which we compare with the financial ratio and macroeconomic variable-only models.
We showed that adding DTD measures to our leading models does not invalidate the variables chosen, significantly augments model performance and in particular increases the obligor-level predictive accuracy and fit to the data of the TTC models. We also found that while all classes of models have high discriminatory power, the TTC models actually perform better along the dimension of predictive accuracy or fit to the data when we incorporate the DTD risk factor.
There are various implications for model development and validation practice, as well as supervisory policy, which can be gleaned from this study. First, it is a better practice to take into consideration the use case for a PD model in designing the model, from a fitness for purpose perspective. That said, we believe that a balance must be struck, since it would be infeasible to have separate PD models for every single use6, and what we are arguing for is a parsimonious number of separate designs for major classes that satisfy a set of uses with common requirements. Second, in validating PD models that are designed according to TTC or PIT constructs, in validating such models we should have different emphases on which model performance metrics are scrutinized. In light of these observations and contributions to the literature, we believe that this study provides valuable guidance to model development, model validation and supervisory practitioners. Additionally, we believe that our discourse has contributed to resolving the debates around which class of PD models is best fit for purpose in large corporate credit risk applications, showing evidence that reduced form and Merton structural models can be combined in hybrid frameworks in order to achieve superior performance along the lines of better fit to the data as well as lower measured model risk due to model mis-specification.
Finally, we would like to emphasize that we believe the principal contribution of this paper to be mainly in the domain of practical application rather than methodological innovation. Many practitioners, especially in the wholesale credit and banking book space, still use the techniques employed in this paper. We see our contribution as proposing a structured approach to constructing a suite of TTC and PIT models, while combining reduced form and structural modeling aspects, and then by further proposing a framework for model validation. We would note that many financial institutions in this space do not have such a framework. For example, a lot of banks are still using TTC Basel models that are modified for PIT uses, such as stress testing or portfolio management. Furthermore, a preponderance of banks in this space does not employ hybrid financial and Merton-style models for credit underwriting. In sum, our contribution transcends the academic literature to address issues relevant to financial institution practitioners and prudential supervisors in the credit risk modeling space, which we believe uniquely positions this research.
That said, there are various limitations of this study that should be kept at the front of mind in assessing this contribution. First, there are alternative econometric techniques that we have not considered, such as machine learning models. Second, we have limited our inquiry to a large corporate asset class, and results could differ for other portfolio segments. Third, our framework does not admit the consideration of industry specificity in model specification. Fourth, we have not considered the explicit quantification of model risk in our model validation framework. Finally, we have not addressed jurisdictions apart from the U.S. or a consideration of geographical effects.
Given the wide relevance and scope of the topics addressed in this study, there is no shortage of fruitful avenues along which we could extend this research. Some proposals include, but are not limited to:
  • alternative econometric techniques, such as various classes of machine learning models, including non-parametric alternatives;
  • asset classes beyond the large corporate segment, such as small business, real estate or even retail;
  • applications to stress testing of credit risk portfolios7;
  • the consideration of industry specificity in model specification;
  • the quantification of model risk according to the principle of relative entropy;
  • different modeling methodologies, such as ratings migration or hazard rate models; and
  • datasets in jurisdictions apart from the U.S., or else pooled data encompassing different countries with a consideration of geographical effects.

Funding

This research has received no external funding nor any other form of support from any outside parties.

Institutional Review Board Statement

This is not applicable as this research did not employ human or animal subjects.

Informed Consent Statement

This is not applicable as this research did not employ human subjects.

Data Availability Statement

The data used in this study combines proprietary and publicly available sources, therefore the dataset employed is not available.

Conflicts of Interest

The author declares no conflict of interest.

Notes

1
A key limitation of this construct is that with macroeconomic variables common to all obligors, we are challenged in capturing the cross-sectional variation in the sensitivity to systematic factors across firms. This could be addressed by interaction terms between macroeconomic variables and firm specific factors or industry effects, which can be explored in future research.
2
Note that linearity does not mean that the dependent variable has a linear relationship with the explanatory variables (i.e., we can have non-linear transformations of the latter), but rather that the estimator is a linear function (or weighted average) of the dependent variable, which implies that we can obtain our estimator analytically using linear algebra operations as opposed to iterative techniques such as in the LRM.
3
All candidate explanatory variables are Winsorized at either the 10th, 5th or 1st percentile levels, at either tail of the sample distribution, in order to mitigate the influence of outliers or contamination in data, according to a customized algorithm that analyzes the gaps between these percentiles and caps/floors where these are maximal.
4
Clarifying our model selection process, we balance multiple criteria, both in terms of statistical performance and some qualitative considerations. Firstly, all models have to exhibit the stability of factor selection (where the signs on coefficient estimates are constrained to be economically intuitive) and statistical significance in k-fold cross validation sub-sample estimation. However, this is constrained by the requirement that we have only a single financial factor chosen from each category. Then, the models that meet these criteria are evaluated according to statistical performance metrics such as AIC and AUC, as well as other considerations such as rating mobility and relative factor weights.
5
The plots are omitted for the sake of brevity and are available upon request.
6
We have observed in the industry that a typical bank can have a number of applications for its PD models far into the double digits, and it would be infeasible to have completely separately developed PD models for all such applications.
7
Refer to Jacobs et al. (2015) and Jacobs (2020) for studies that address model validation and model risk quantification methodologies. These studies include supervisory applications such as comprehensive capital analysis and review (“CCAR”) and current expected credit loss (“CECL”), and further feature alternative credit risk model specifications (including machine learning model), macroeconomic scenario generation techniques, as well as the quantification and aggregation of model risk (including the principle of relative entropy).

References

  1. Aguais, Scott D., Lawrence R. Forest Jr., Martin King, and Claire Lennon Marie. 2008. Designing and Implementing a Basel II Compliant PIT–TTC Ratings Framework. White Paper, Barclays Capital, August. Available online: https://mpra.ub.uni-muenchen.de/6902/1/MPRA_paper_6902.pdf (accessed on 30 March 2021).
  2. Altman, Edward I. 1968. Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. Journal of Finance 23: 589–609. Available online: https://pdfs.semanticscholar.org/cab5/059bfc5bf4b70b106434e0cb665f3183fd4a.pdf (accessed on 3 June 2020). [CrossRef]
  3. Altman, Edward I. 2018. A Fifty-year Retrospective on Credit Risk Models, the Altman Z-Score Family of Models and their Applications to Financial Markets and Managerial Strategies. Journal of Credit Risk 14: 1–34. Available online: https://mebfaber.com/wp-content/uploads/2020/11/Altman_Z_score_models_final.pdf (accessed on 9 February 2021). [CrossRef] [Green Version]
  4. Altman, Edward I., and Herbert A. Rijken. 2004. How Rating Agencies Achieve Rating Stability. Journal of Banking and Finance 28: 2679–714. Available online: https://archive.nyu.edu/bitstream/2451/26557/2/FIN-04-031.pdf (accessed on 1 September 2019). [CrossRef] [Green Version]
  5. Altman, Edward I., and Herbert A. Rijken. 2006. A Point-in-Time Perspective on Through-the-Cycle Ratings. Financial Analyst Journal 62: 54–70. Available online: https://www.tandfonline.com/doi/abs/10.2469/faj.v62.n1.4058 (accessed on 9 May 2019).
  6. Altman, Edward I., and Paul Narayanan. 1997. An International Survey of Business Failure Classification Models. In Financial Markets, Institutions and Instruments. New York: New York University Salomon Center, vol. 6, Available online: https://onlinelibrary.wiley.com/doi/abs/10.1111/1468-0416.00010 (accessed on 2 August 2019).
  7. Amato, Jeffrey D., and Craig H. Furfine. 2004. Are Credit Ratings Procyclical. Journal of Banking and Finance 19: 2541–677. Available online: https://www.sciencedirect.com/science/article/abs/pii/S0378426604001049 (accessed on 8 February 2020). [CrossRef]
  8. Cesaroni, Tatiana. 2015. Procyclicality of Credit Rating Systems: How to Manage It. Journal of Economics and Business 82: 62–82. Available online: https://www.bancaditalia.it/pubblicazioni/temi-discussione/2015/2015-1034/en_tema_1034.pdf (accessed on 20 April 2020). [CrossRef] [Green Version]
  9. Chava, Sudhir, and Robert Jarrow. 2004. Bankruptcy Prediction with Industry Effects. Review of Finance 8: 537–69. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=287474 (accessed on 21 January 2019). [CrossRef]
  10. Chen, Yuo-Shivang. 2012. Classifying credit ratings for Asian banks using integrating feature selection and the cpda-based rough sets approach. Knowledge-Based Systems 26: 259–70. Available online: https://www.semanticscholar.org/paper/Classifying-credit-ratings-for-Asian-banks-using-Chen/b0aee646d5f687709851682df361f1e9b3cbd3fa (accessed on 18 November 2021). [CrossRef]
  11. Cheng, Simon, and J. Scott Long. 2006. Testing for IIA in the Multinomial Logit Model. Transportation Research Part B: Methodological 35: 583–600. [Google Scholar] [CrossRef]
  12. Cortes, Carlos, and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning 20: 273–97. Available online: https://mlab.cb.k.u-tokyo.ac.jp/~moris/lecture/cb-mining/4-svm.pdf (accessed on 18 November 2021). [CrossRef]
  13. Duffie, Darrell, and Kenneth J. Singleton. 1999. Modeling Term Structures of Defaultable Bonds. Review of Financial Studies 12: 687–720. Available online: https://academic.oup.com/rfs/article-abstract/12/4/687/1578719?redirectedFrom=fulltext (accessed on 8 July 2021). [CrossRef] [Green Version]
  14. Đurović, Andrija. 2019. Macroeconomic Approach to Point in Time Probability of Default Modeling—IFRS 9 Challenges. Journal of Central Banking Theory and Practice 1: 209–23. Available online: https://www.econstor.eu/bitstream/10419/217671/1/jcbtp-2019-0010.pdf (accessed on 19 September 2020).
  15. Dwyer, Doug W. Lass, Ahmet E. Kogacil, and Roger M. Stein. 2004. Moody’s KMV RiskCalcTM v2.1 Model. Moody’s Analytics. Available online: https://www.moodys.com/sites/products/productattachments/riskcalc%202.1%20whitepaper.pdf (accessed on 5 September 2019).
  16. Fry, Tim R., and Mark N. Harris. 1996. A Monte Carlo Study of Tests for the Independence of Irrelevant Alternatives Property. Transportation Research Part B: Methodological 31: 19–32. [Google Scholar] [CrossRef] [Green Version]
  17. Hamilton, David T., Zhao Sun, and Ding Min. 2011. Through-the-Cycle EDF Credit Measures. White Paper, Moody’s Analytics, August. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1921419 (accessed on 15 August 2020).
  18. Hausman, Jerry A., and Daniel McFadden. 1984. Specification Tests for the Multinomial Logit Model. Econometrica 52: 1219–40. Available online: https://ageconsearch.umn.edu/record/267431/files/monash-171.pdf (accessed on 9 September 2020). [CrossRef] [Green Version]
  19. Hodrick, Robert J., and Edward C. Prescott. 1997. Postwar U.S. Business Cycles: An Empirical Investigation. Journal of Money, Credit and Banking 29: 1–16. Available online: http://27.115.42.149/bbcswebdav/institution/%E7%BB%8F%E6%B5%8E%E5%AD%A6%E9%99%A2/teacherweb/2005000016/AdvancedMacro/Hodrick_Prescott.pdf (accessed on 29 June 2020). [CrossRef]
  20. Jacobs, Michael, Jr. 2020. A Holistic Model Validation Framework for Current Expected Credit Loss (CECL) Model Development and Implementation. The International Journal of Financial Studies 8: 27. Available online: http://michaeljacobsjr.com/files/Jacobs_2020_HolMdlValFrmwrkCECL-MdlDev_Impl_IFFS_vol8no27_pp1-36.pdf (accessed on 12 May 2019). [CrossRef]
  21. Jacobs, Michael, Jr., Ahmet K. Karagozoglu, and Frank J. Sensenbrenner. 2015. Stress Testing and Model Validation: Application of the Bayesian Approach to a Credit Risk Portfolio. The Journal of Risk Model Validation 9: 41–70. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2684227 (accessed on 28 May 2019). [CrossRef]
  22. Jarrow, Robert A., and Stuart M. Turnbull. 1995. Pricing Derivatives on Financial Securities Subject to Credit Risk. Journal of Finance 50: 53–85. Available online: https://www.jstor.org/stable/2329239?seq=1 (accessed on 11 February 2020).
  23. Jiang, Yixiao. 2021. Semiparametric Estimation of a Corporate Bond Rating Model. Econometrics 9: 23. Available online: https://doi.org/10.3390/econometrics9020023 (accessed on 24 May 2021). [CrossRef]
  24. Khandani, Amir E., Adlar J. Kim, and Andrew W. Lo. 2010. Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance 34: 2767–87. Available online: https://econpapers.repec.org/article/eeejbfina/v_3a34_3ay_3a2010_3ai_3a11_3ap_3a2767-2787.htm (accessed on 18 November 2021).
  25. Khashman, Adnan. 2010. Neural networks for credit risk evaluation: Investigation of different neural models and learning schemes. Expert Systems with Applications 37: 6233–39. Available online: https://www.academia.edu/829842/Neural_networks_for_credit_risk_evaluation_Investigation_of_different_neural_models_and_learning_schemes (accessed on 18 November 2021). [CrossRef]
  26. Kiff, John, Michael Kisser, and Liliana Schumacher. 2004. Rating Through-the-Cycle: What does the Concept Imply for Rating Stability and Accuracy. Working Paper, International Monetary Fund, WP/13/64. Available online: https://www.imf.org/external/pubs/ft/wp/2013/wp1364.pdf (accessed on 18 November 2021).
  27. Kim, Kee S. 2005. Predicting bond ratings using publicly available information. Expert Systems with Applications 29: 75–81. Available online: https://www.sciencedirect.com/science/article/abs/pii/S0957417405000084 (accessed on 18 November 2021). [CrossRef]
  28. Kim, Hong Sik, and So Yong Sohn. 2010. Support vector machines for default prediction of smes based on technology credit. European Journal of Operational Research 201: 838–46. Available online: https://www.sciencedirect.com/science/article/abs/pii/S037722170900215X (accessed on 8 August 2020). [CrossRef]
  29. Li, Chun, and Bryan E. Shepherd. 2012. A New Residual for Ordinal Outcomes. Biometrica 99: 473–80. [Google Scholar] [CrossRef] [Green Version]
  30. Liu, Dungang, and Heping Zhang. 2017. Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach. Journal of the American Statistical Association 113: 845–54. [Google Scholar] [CrossRef] [PubMed]
  31. Loeffler, Gunter. 2004. An Anatomy of Rating through the Cycle. Journal of Banking and Finance 28: 695–720. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=275842 (accessed on 27 September 2020). [CrossRef]
  32. Loeffler, Gunter. 2013. Can Rating Agencies Look Through the Cycle? Review of Quantitative Finance and Accounting 40: 623–46. Available online: https://link.springer.com/article/10.1007/s11156-012-0289-9 (accessed on 1 September 2020). [CrossRef]
  33. Merton, Robert C. 1974. On the Pricing of Corporate Debt: The Risk Structure of Interest Rates. Journal of Finance 29: 449–70. Available online: https://onlinelibrary.wiley.com/doi/epdf/10.1111/j.1540-6261.1974.tb03058.x (accessed on 10 October 2021).
  34. Mester, Loreta J. 1997. What’s the Point of Credit Scoring? Federal Reserve Bank of Philadelphia Business Review, September/October. pp. 3–16. Available online: https://fraser.stlouisfed.org/files/docs/historical/frbphi/businessreview/frbphil_rev_199709.pdf (accessed on 10 October 2020).
  35. Pacelli, Vincenzo, and Michele Azzollini. 2011. An artificial neural network approach for credit risk management. Journal of Intelligent Learning Systems and Applications 3: 103. Available online: https://www.researchgate.net/publication/220062573_An_Artificial_Neural_Network_Approach_for_Credit_Risk_Management (accessed on 27 March 2021). [CrossRef] [Green Version]
  36. Peng, Yi, Guoxun Wang, Guoxun Kou, and Yong Shi. 2011. An empirical study of classification algorithm evaluation for financial risk prediction. Applied Soft Computing 11: 2906–15. Available online: https://www.sciencedirect.com/science/article/abs/pii/S1568494610003054 (accessed on 23 January 2021). [CrossRef]
  37. Petrov, Alexander, and Mark Rubtsov. 2016. A Point-in-Time–Through-the-Cycle Approach to Rating Assignment and Probability of Default Calibration. Journal of Risk Model Validation 10: 83–112. Available online: https://www.risk.net/journal-of-risk-model-validation/technical-paper/2460734/a-point-in-time-through-the-cycle-approach-to-rating-assignment-and-probability-of-default-calibration (accessed on 7 July 2021).
  38. Repullo, Rafael, Jesus Saurina, and Carlos Trucharte. 2010. Mitigating the Procyclicality of Basel II. Economic Policy 64: 659–702. Available online: https://econpapers.repec.org/paper/bdewpaper/1028.htm (accessed on 4 April 2019). [CrossRef]
  39. Small, Kenneth A., and Cheng Hsiao. 1985. Multinomial Logit Specification Tests. International Economic Review 26: 619–27. Available online: https://econpapers.repec.org/article/ieriecrev/v_3a26_3ay_3a1985_3ai_3a3_3ap_3a619-27.htm (accessed on 20 August 2019). [CrossRef]
  40. The Bank for International Settlements—Basel Committee on Banking Supervision (BIS). 2005. Studies on the Validation of Internal Rating Systems. Working Paper 14. Basel: The Bank for International Settlements—Basel Committee on Banking Supervision, Available online: https://www.bis.org/publ/bcbs_wp14.htm (accessed on 24 August 2021).
  41. The Bank for International Settlements—Basel Committee on Banking Supervision (BIS). 2006. International Convergence of Capital Measurement and Capital Standards: A Revised Framework. Basel: The Bank for International Settlements—Basel Committee on Banking Supervision, Available online: https://www.bis.org/publ/bcbsca.htm (accessed on 13 May 2021).
  42. The Bank for International Settlements—Basel Committee on Banking Supervision (BIS). 2011. Basel III: A Global Regulatory Framework for More Resilient Banks and Banking Systems. Basel: The Bank for International Settlements—Basel Committee on Banking Supervision, Available online: https://www.bis.org/publ/bcbs189.htm (accessed on 4 April 2020).
  43. The Bank for International Settlements—Basel Committee on Banking Supervision (BIS). 2016. Reducing Variation in Credit Risk-Weighted Assets—Constraints on the Use of Internal Model Approaches. Consultative Document. Basel: The Bank for International Settlements—Basel Committee on Banking Supervision, Available online: http://www.bis.org/bcbs/publ/d362.htm (accessed on 11 November 2020).
  44. The European Banking Authority. 2016. Guidelines on PD Estimation, LGD Estimation and the Treatment of Defaulted Exposures. Consultation Paper. Paris: The European Banking Authority, Available online: https://www.eba.europa.eu/regulation-and-policy/model-validation/guidelines-on-pd-lgd-estimation-and-treatment-of-defaulted-assets (accessed on 6 July 2021).
  45. Topp, Rebekka, and Robert Perl. 2010. Through-the-Cycle Ratings Versus Point-in-Time Ratings and Implications of the Mapping between both Rating Types. Financial Markets, Institutions & Instruments 19: 47–61. Available online: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1468-0416.2009.00154.x (accessed on 18 March 2020).
  46. Vapnik, Vladimir. 2013. The Nature of Statistical Learning Theory. Berlin/Heidelberg: Springer Science & Business Media, Available online: https://link.springer.com/book/10.1007/978-1-4757-2440-0 (accessed on 6 May 2019).
  47. Veronezi, Pedro H. 2016. Corporate Credit Rating Prediction Using Machine Learning Techniques. Master’s thesis, Stevens Institute of Technology, Hoboken, NJ, USA. [Google Scholar]
  48. Xiao, Hongshan, Zhi Xiao, and Yu Wang. 2016. Ensemble classification based on supervised clustering for credit scoring. Applied Soft Computing 43: 73–86. Available online: https://www.researchgate.net/publication/297586887_Ensemble_classification_based_on_supervised_clustering_for_credit_scoring (accessed on 10 February 2021). [CrossRef]
  49. Yeh, Ching-Chiang, Fengyi Lin, and Chih Yu Hsu. 2012. A hybrid KMV model, random forests and rough set theory approach for credit rating. Knowledge-Based Systems 33: 166–72. Available online: https://isiarticles.com/bundles/Article/pre/pdf/48503.pdf (accessed on 11 September 2021). [CrossRef]
  50. Yu, Lean, Shouyang Wang, and Kin Keung Lai. 2008. Credit risk assessment with a multistage neural network ensemble learning approach. Expert Systems with Applications 34: 1434–44. Available online: https://www.researchgate.net/publication/222530702_Credit_risk_assessment_with_a_multistage_neural_network_ensemble_learning_approach (accessed on 3 January 2021). [CrossRef]
Figure 1. Large Corporate Modeling Data—One- and Three-Year Horizon Default Rates over Time (1991–2015).
Figure 1. Large Corporate Modeling Data—One- and Three-Year Horizon Default Rates over Time (1991–2015).
Ijfs 09 00063 g001
Table 1. Large Corporate Modeling Data—GICS Industry Segment Composition for All Moody’s Obligors vs. Defaulted Moody’s Obligors (1991–2015).
Table 1. Large Corporate Modeling Data—GICS Industry Segment Composition for All Moody’s Obligors vs. Defaulted Moody’s Obligors (1991–2015).
GICS Industry SegmentAll Moody’s ObligorsDefaulted Moody’s Obligors
Consumer Discretionary19.6%30.9%
Consumer Staples8.4%6.4%
Energy7.6%5.9%
Healthcare Equipment and Services2.9%2.9%
Industrials31.6%15.1%
Materials10.5%11.3%
Pharmaceuticals and Biotechnology2.7%0.2%
Software and IT Services2.5%1.8%
Technology Hardware and Communications4.3%11.3%
Utilities7.6%5.6%
Table 2. Large Corporate Modeling Data—NAICS Industry Segment Composition for All Moody’s Obligors vs. Defaulted Moody’s Obligors (1991–2015).
Table 2. Large Corporate Modeling Data—NAICS Industry Segment Composition for All Moody’s Obligors vs. Defaulted Moody’s Obligors (1991–2015).
NAICS Industry SegmentAll Moody’s ObligorsDefaulted Moody’s Obligors
Agriculture, Forestry, Hunting and Fishing0.2%0.4%
Accommodation and Food Services2.3%2.9%
Waste Management % Remediation Services2.4%2.1%
Arts, Entertainment and Recreation0.7%1.0%
Construction1.7%2.5%
Educational Services0.1%0.2%
Healthcare and Social Assistance1.6%1.6%
Information Services11.5%12.1%
Management Compensation Enterprises0.1%0.1%
Manufacturing37.7%34.4%
Mining, Oil and Gas6.8%8.6%
Other Services (e.g., Public Administration)0.4%0.6%
Professional, Scientific and Technological Services2.3%2.5%
Real Estate, Rentals and Leasing0.9%1.6%
Retail Trade9.6%12.4%
Transportation and Warehousing5.4%7.0%
Utilities8.3%5.4
Wholesale Trade7.0%2.7
Table 3. Summary Statistics—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables and Default Indicators: 1-Year PIT Model 1.
Table 3. Summary Statistics—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables and Default Indicators: 1-Year PIT Model 1.
VariableCountMeanStandard DeviationMinimum 25th PercentileMedian75th PercentileMaximum
Default Indicator157,3530.010.100.000.000.000.001.00
Change in Total Assets0.140.35−0.40−0.010.060.173.21
Total Liabilities to Total Assets0.600.230.120.450.590.711.53
Cash Use Ratio1.902.84−22.431.412.062.6519.00
Net Accounts Receivables Days130.25101.4411.2668.98106.74159.43754.09
Net Quick Ratio0.341.07−0.85−0.280.060.596.11
Before Tax Profit Margin5.9421.00−146.671.857.0912.8548.70
Moody’s Equity Price Index1.916.09−27.33−0.192.195.6812.81
Consumer Confidence Index2.3421.58−60.97−7.024.8915.3573.21
Table 4. Summary Statistics—Moody’s Large Corporate Financial, Macroeconomic, Merton/Structural Model Distance-to-Default Proxy Measure Explanatory Variables and Default Indicators: 1-Year PIT Model 2.
Table 4. Summary Statistics—Moody’s Large Corporate Financial, Macroeconomic, Merton/Structural Model Distance-to-Default Proxy Measure Explanatory Variables and Default Indicators: 1-Year PIT Model 2.
VariableCountMeanStandard DeviationMinimum 25th PercentileMedian75th PercentileMaximum
Default Indicator160,0020.010.100.000.000.000.001.00
Change in Total Assets0.140.35−0.40−0.010.060.173.21
Total Liabilities to Total Assets0.600.230.120.450.600.711.53
Cash Use Ratio1.902.83−22.431.402.062.6419.00
Net Quick Ratio0.341.06−0.85−0.280.060.596.11
Before Tax Profit Margin5.9820.93−146.671.867.1012.8848.70
Moody’s Equity Price Index1.936.08−27.33−0.192.195.6812.81
Consumer Confidence Index2.3721.56−60.97−7.024.8915.3573.21
Distance-to-Default0.200.43−.1.320.020.070.185.26
Table 5. Summary Statistics—Moody’s Large Corporate Financial and Explanatory Variables and Default Indicators: 3-Year TTC Model 1.
Table 5. Summary Statistics—Moody’s Large Corporate Financial and Explanatory Variables and Default Indicators: 3-Year TTC Model 1.
VariableCountMeanStandard DeviationMinimum 25th PercentileMedian75th PercentileMaximum
Default Indicator150,0640.030.170.00.00.00.01.0
Total Liabilities3640.656741.938.86422.601170.453374.1241,852.00
Total Liabilities to Total Assets0.620.220.120.490.610.721.53
Debt Service Ratio16.4452.82−25.071.744.099.80409.64
Net Quick Ratio0.240.93−0.85−0.300.020.476.11
Before Tax Profit Margin5.5021.08−146.671.576.7212.4048.70
Table 6. Summary Statistics—Moody’s Large Corporate Financial and Merton/Structural Model Distance-to-Default Proxy Measure Explanatory Variables and Default Indicators: 3-Year TTC Model 2.
Table 6. Summary Statistics—Moody’s Large Corporate Financial and Merton/Structural Model Distance-to-Default Proxy Measure Explanatory Variables and Default Indicators: 3-Year TTC Model 2.
VariableCountMeanStandard DeviationMinimum 25th PercentileMedian75th PercentileMaximum
Default Indicator150,0640.030.170.00.00.00.01.0
Total Liabilities3640.656741.938.86422.601170.453374.1241,852.00
Total Liabilities to Total Assets0.620.220.120.490.610.721.53
Debt Service Ratio16.4452.82−25.071.744.099.80409.64
Net Quick Ratio0.240.93−0.85−0.300.020.476.11
Before Tax Profit Margin5.5021.08−146.671.576.7212.4048.70
Distance-to-Default0.200.42−1.320.020.070.285.26
Table 7. Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables Areas Under the Receiver Operating Characteristic Curve (AUC) and Missing Rates for 1-Year Default Horizon PIT and 3-Year Default Horizon TTC Default Indicators.
Table 7. Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables Areas Under the Receiver Operating Characteristic Curve (AUC) and Missing Rates for 1-Year Default Horizon PIT and 3-Year Default Horizon TTC Default Indicators.
PIT 1-Year Default HorizonTTC 3-Year Default Horizon
CategoryExplanatory VariablesAUCMissing RateAUCMissing Rate
SizeChange in Total Assets0.7268.52%
Total Liabilities 0.5824.64%
LeverageTotal Liabilities to Total Assets Ratio0.8434.65%0.7834.65%
CoverageCash Use Ratio0.7887.94%
Debt Service Coverage Ratio 0.79617.0%
EfficiencyNet Accounts Receivables Days Ratio0.6158.17%
LiquidityNet Quick Ratio0.6537.71%0.6177.17%
ProfitabilityBefore Tax Profit Margin0.8272.40%0.7682.40%
MacroeconomicMoody’s 500 Equity Price Index Quarterly Average Annual Change0.6030.00%
Consumer Confidence Index Annual Change0.6070.00%
Merton StructuralDistance-to-Default0.7304.65%0.6694.65%
Table 8. Logistic Regression Estimation Results—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables 1-Year Default Horizon PIT Reduced Form Model 1.
Table 8. Logistic Regression Estimation Results—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables 1-Year Default Horizon PIT Reduced Form Model 1.
Explanatory VariableParameter Estimatep-ValueFactor WeightAICAUCHL p-ValueMobility Index
Change in Total Assets−0.48370.00000.04557231.000.88940.59450.7184
Total Liabilities to Total Assets2.61700.01040.1091
Cash Use Ratio−0.04280.00000.1545
Net Accounts Receivables Days Ratio0.00050.00000.2273
Net Quick Ratio−0.46730.00000.0909
Before Tax Profit Margin−0.01610.00000.2736
Moody’s Equity Index Price Index Quarterly Average−0.01890.00000.0759
Consumer Confidence Index Year-on-Year Change−0.00990.00000.0232
Table 9. Logistic Regression Estimation Results—Moody’s Large Corporate Financial, Macroeconomic and Distance-to-Default Explanatory Variables 1-Year Default Horizon PIT Hybrid Reduced Form/Structural-Merton Model 2.
Table 9. Logistic Regression Estimation Results—Moody’s Large Corporate Financial, Macroeconomic and Distance-to-Default Explanatory Variables 1-Year Default Horizon PIT Hybrid Reduced Form/Structural-Merton Model 2.
Explanatory VariableParameter Estimatep-ValueFactor WeightAICAUCHL p-ValueMobility Index
Change in Total Assets−0.46640.00000.04857290.000.88950.57820.7617
Total Liabilities to Total Assets2.53850.00000.1165
Cash Use Ratio−0.04280.00000.1650
Net Quick Ratio−0.01690.00000.0971
Before Tax Profit Margin−0.01690.00000.2913
Moody’s Equity Index Price Index Quarterly Average−0.01860.00000.0801
Consumer Confidence Index Year-on-Year Change−0.01000.00000.0267
Distance to Default−0.19130.00520.1748
Table 10. Logistic Regression Estimation Results—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables 3-Year Default Horizon TTC Reduced Form Model 1.
Table 10. Logistic Regression Estimation Results—Moody’s Large Corporate Financial and Macroeconomic Explanatory Variables 3-Year Default Horizon TTC Reduced Form Model 1.
Explanatory VariableParameter Estimatep-ValueFactor WeightAICAUCHL p-ValueMobility Index
Value of Total Liabilities−6.97 × 10−60.00000.177317,751.000.82320.00390.3295
Total Liabilities to Total Assets2.02390.00300.3133
Debt Service Coverage Ratio−0.04310.00000.2332
Net Quick Ratio−0.24120.00000.1372
Before Tax Profit Margin−0.01290.00000.1390
Table 11. Logistic Regression Estimation Results—Moody’s Large Corporate Financial, Macroeconomic and Distance-to-Default Explanatory Variables 3-Year Default Horizon TTC Hybrid Reduced Form/Structural-Merton Model 2.
Table 11. Logistic Regression Estimation Results—Moody’s Large Corporate Financial, Macroeconomic and Distance-to-Default Explanatory Variables 3-Year Default Horizon TTC Hybrid Reduced Form/Structural-Merton Model 2.
Explanatory VariableParameter Estimatep-ValueFactor WeightAICAUCHL p-ValueDeviance/Degrees of FreedomPseudo R-SquaredMobility Index
Total Liabilities to Total Assets2.95800.00000.370711,834.000.82260.09730.23650.14910.3539
Debt Service Coverage Ratio−0.04280.00000.2917
Net Quick Ratio−0.24030.00000.0808
Before Tax Profit Margin−0.01290.00000.0902
Distance to Default−0.15410.00000.1666
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jacobs, M., Jr. Validation of Corporate Probability of Default Models Considering Alternative Use Cases. Int. J. Financial Stud. 2021, 9, 63. https://doi.org/10.3390/ijfs9040063

AMA Style

Jacobs M Jr. Validation of Corporate Probability of Default Models Considering Alternative Use Cases. International Journal of Financial Studies. 2021; 9(4):63. https://doi.org/10.3390/ijfs9040063

Chicago/Turabian Style

Jacobs, Michael, Jr. 2021. "Validation of Corporate Probability of Default Models Considering Alternative Use Cases" International Journal of Financial Studies 9, no. 4: 63. https://doi.org/10.3390/ijfs9040063

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop