Next Article in Journal
A Parameterized Modeling Method for Magnetic Circuits of Adjustable Permanent Magnet Couplers
Previous Article in Journal
A Model-Free Control Scheme for Rehabilitation Robots: Integrating Real-Time Observations with a Deep Neural Network for Enhanced Control and Reliability
Previous Article in Special Issue
Percolation Problems on N-Ary Trees
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Generalized Log Gamma Approach: Theoretical Contributions and an Application to Companies’ Life Expectancy

by
José H. Dias Gonçalves
1,
João J. Ferreira Gomes
2,
Lihki Rubio
3,* and
Filipe R. Ramos
2
1
Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal
2
Centro de Estatística e Aplicações (CEAUL), Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal
3
Department of Mathematics and Statistics, Universidad del Norte, Atlántico, Barranquilla 081007, Colombia
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(23), 4792; https://doi.org/10.3390/math11234792
Submission received: 20 July 2023 / Revised: 25 August 2023 / Accepted: 3 September 2023 / Published: 27 November 2023
(This article belongs to the Special Issue Probability Distributions and Their Applications)

Abstract

:
The survival of a company has been a topic of growing interest in the scientific community. Measuring the life expectancy of Portuguese telecommunications companies using generalized log-gamma (GLG) distribution is a new research endeavor. Regarding the new theoretical contributions, original expressions for the moments and mode of the GLG distribution are presented. In this empirical study, data on the entrepreneurial fabric in the Information and Communication sector from 2004 to 2018, when some companies were born or died, were used. In addition to the GLG, three other statistical distributions with two parameters are analyzed: gamma, Weibull, and log-normal. Maximum likelihood parameters and confidence intervals for survival probabilities are estimated and compared. The Akaike information criterion is used to compare the performance of the four estimated models. The results show that GLG distribution is a promising solution to assess the resilience and longevity of a firm.

1. Introduction

In an increasingly global and competitive economic environment, the creation of a company faces major challenges. Their resilience and survival, especially in the early stages, are under constant scrutiny. Many do not hold up and fail. Analyzing the specific reasons companies fail, general trends, or even the macroeconomic conditions affecting business failure have been long-standing topics in the scientific literature (e.g., [1,2,3,4]). The attempt to model and understand real phenomena, such as the one presented, is a timeless and undeniably useful topic that involves articulating statistical, mathematical, and computational methods and procedures. Some studies reported in the scientific literature related to this topic are presented below.
In the Portuguese context, some previous works have focused on studying the survival of new firms [5,6,7]. On the other hand, the survival of new firms based on certain conditions of entry and permanence in the market is addressed by [6] and developed by the same authors in a later work [7]. In addition, [8] proposes an empirical model to understand the impact that the founding conditions of a firm have on its life expectancy; [9] suggests a mortality table for domestic firms. On an international scale, [10] analyzes over 25,000 North American firms from 1950 to 2009 and concludes that the life expectancy of a firm does not depend on its age. The search for company longevity is approached in [11] from a business strategy perspective, in which key indicators that promote the growth of a company’s longevity are presented. According to the S&P 500 index, the average life of companies in 2020 was just over 21 years, whereas in 1965, it was about 32 years [12]. Since this index includes the 500 largest companies and represents almost 80% of the available market capitalization, there is a natural concern about the life span of smaller companies. It is reasonable to assume that fewer companies will survive than their market-leading counterparts. In essence, most studies establish a positive correlation between company size and life expectancy and emphasize industry size as a factor in the health of the organizations within it. Thus, considering the impacts it may have on the economic life of the business fabric or even of a country, it is necessary to understand the lifetime of a set of companies. However, it is important to identify robust and accurate models (e.g., fitted probability distributions) that assist in the modeling process, providing support for compression and forecasting. When studying firm survival, it is crucial to obtain density and survival functions for the whole lifetime of a company. Decision makers can use these tools to assess sectoral resilience and make comparisons before, for example, making a long-term investment decision that may affect their portfolio.
From the pioneering works of [13,14,15] and the analysis of other theoretical works on generalized log-gamma (GLG) [16,17,18], some features are easy to notice. Moreover, approximate conditional inference techniques on the distributions of location and scale parameters, based on normal approximations to the signed square root distributions of likelihood ratio statistics, are examined by [19]. By leveraging these methods, it becomes possible to derive approximate inferences for both the quantiles and the scale parameter of the generalized log-gamma distribution. However, [20] proposed robust estimators for the generalized log-gamma distribution using a (weighted) Q τ estimator, which minimizes a scale τ that measures differences between the empirical and theoretical quantiles. Unfortunately, this estimator does not follow a normal distribution asymptotically but is a practical starting point for a one-step weighted likelihood estimator. As mentioned in [21], the GLG and other particular distributions mentioned in the scientific literature seem applicable in analyzing lifetime and reliability data. This applicability becomes even more relevant when considering the sectors of activity with a business fabric characterized by a strong heterogeneity regarding service life. More specifically, a small number of companies have a very long history of activity (dominating the sector) and a constant emergence of small and medium-sized companies not surviving in the market, failing soon after starting their activity.
Many applications of the generalized log-gamma distribution can be found in the literature, for example, the Coale–McNeil nuptiality model studied by [22]. For example, the author proposes an approach to developing country-specific standard schedules and illustrates its usefulness in a regression analysis. An application to fertility projection is proposed by modeling the fertility calendar by birth order. Moreover, [23] modified the generalized log-gamma regression model to consider the presence of long-term survivors. The authors of [24] analyzed a real data set from the medical domain using a four-parameter extension of the generalized gamma distribution capable of modeling a bathtub-shaped hazard rate function and examined other commonly used models such as log-exponential, log-Weibull, and log-normal regression with cure rates, determining that a generalized log-gamma regression model that incorporates a cure rate is best. The beauty and importance of this distribution lie in its ability to model monotonic and non-monotonic failure rate functions, which are quite common in lifetime and reliability data analysis. The authors of [25] used the generalized Gamma distribution to model traumatic brain injury data, considering different stages of hospitalization related to the survival rates of a limited number of patients after a traffic accident. We propose different estimation methods to obtain improved estimators of the model parameters, which can be recommended for use in small samples. The authors of [26] analyzed the potential of the inverse generalized gamma (IGG) distribution for modeling reliability data with an upside-down bathtub hazard rate function using real environmental data. In this study, Bayesian inferences are presented for the parameters of the IGG distribution using non-informative priors, such as the Jeffreys prior and the reference prior. The authors of [27] analyzed the estimation of the generalized gamma distribution parameters from left-truncated and right-censored data. A stochastic version of the expectation maximization (EM) algorithm is proposed as an alternative method for computing approximate maximum likelihood estimations (MLE). The proposed stochastic EM algorithm is demonstrated to be a useful alternative estimation method for model fitting of the generalized gamma distribution. The authors of [28] studied cell phone ages, considering that they follow a gamma distribution under progressive first-fault (PFF) censoring. The data set of radio transceivers was analyzed as an application. For this distribution, these researchers estimated all the unknown parameters and the Shannon and Rényi entropies using the maximum likelihood (ML) method. The asymptotic confidence intervals of the ML estimators of the target parameters were produced using the normal approximation to ML and log-transformed ML. The authors of [29] proposed the asymmetric LGN distribution as a flexible option for the error term in linear regression models. The maximum likelihood method was implemented to estimate the parameters of the LGN model, and the Fisher information matrix was derived, showing that it is non-singular. Illustrations with real data show that the proposed model can be an effective alternative to existing models, such as normal, normal-power, normal Tobit, and normal-power Tobit.
Furthermore, in the context of the distributions of interest to this research, [30] proposed a four-parameter extended generalized gamma model for modeling lifetime data. According to the authors, the proposed heteroscedastic regression model can be used more effectively in the analysis of survival data since it includes several widely known regression models as special models. Various simulations are performed for different parameter settings, sample sizes, and censoring percentages. The authors of [31] defined and studied a four-parameter model called the generalized odd log-logistic flexible Weibull distribution, providing an extensive study of the quantile function. With an application to real engineering data sets, these authors used the maximum likelihood method for estimating the model parameters and performed various simulations for different parameter settings, sample sizes, and censoring percentages.
Following a methodology based on probability distributions, in [32], a family of generalized gamma distributions, T -gamma family, has been proposed using the T R { Y } framework. The family of distributions is generated using the quantile functions of uniform, exponential, log-logistic, logistic, and extreme value distributions. Based on the theoretical developments proposed by the authors, four data sets with various shapes are fitted using members of the T -gamma family of distributions. Also, an interesting methodological approach is proposed using probability distributions [33]. In this study, a new lifetime distribution is proposed to model the lifetime of the series system, which incorporates most of the composite lifetime distribution. Several mathematical properties obtained for this distribution included the moment-generating function, moments, and order statistics. According to the authors, the proposed distribution has a flexible density function, and the proposed family offers a better fit based on tests on a real data set (the unknown parameters of the proposed generalized family were estimated using the MLE technique) from statistical analysis. Using the distribution employed by [34] via a lifetime application, [35] proposed a new model called the odd log-logistic generalized half-normal distribution for describing fatigue lifetime data, discussing the method of maximum likelihood to fit the model parameters. For different parameter settings and sample sizes, some simulation studies compare the performance of the new lifetime model. The authors of [36] introduced a new generator of probability distribution, the adjusted log-logistic generalized (ALLoG) distribution, and a new extension of the standard one-parameter exponential distribution called the adjusted log-logistic generalized exponential (ALLoGExp) distribution. Using the MLE method to estimate the model parameters, the importance and flexibility of the ALLoGExp distribution were demonstrated with a real and uncensored lifetime data set, and its fit was compared with five other exponential-related distributions.
On this basis, it is considered beneficial to develop research focusing on GLG distribution and its applications. Thus, according to the state of the art described in the previous paragraphs, this work proposes maximum likelihood estimates for the generalized log gamma model parameters and the confidence intervals for survival probabilities. In addition, expressions for the moments and the GLG distribution mode are presented. As an original application, the survival problem of telecommunications companies is analyzed, which, at the time of presenting this paper, had not been addressed using the GLG model and the techniques proposed in this research. From the set of data obtained (despite the limitations that will be discussed at the end of this study), modeling the life expectancy of companies (following a methodology based on probability distributions supported by the scientific literature) constitutes the objective of the empirical part of this research.
Having described multiple applications of the generalized log-gamma distribution, the parameter estimation techniques found in the literature, and the main objective of this present work, each section will be divided as follows: Section 1 provides an overview of the literature related to the application problem proposed in this work, as well as the objectives, research questions, and main contributions of this investigation. Section 2 reviews the GLG distribution: (i) a brief original study on the random variable X ~ G L G ( μ , σ , q ) is presented; (ii) GLG is presented as a general case of some other known statistical distributions; (iii) some original theoretical contributions on GLG are presented. Next, Section 3 presents the data to be used in this study and some considerations on the methodological procedures, where the estimation methodology focused on the maximum likelihood method in the presence of censored data are discussed. Section 4 presents the results of applying the proposed models to a real-world dataset. Finally, Section 5 concludes with a discussion of the results, conclusions, and references to the limitations and future work.

2. Generalized Log-Gamma Distribution: Theoretical Framework and Complements

2.1. Generalized Log-Gamma Density Function

According to [15], the GLG density function is introduced in Definition 1, with parameters μ , σ , and q . The gamma function (denoted as Γ) is used with the argument q 2 , where μ and σ represent the mean and standard deviation, respectively.
Definition 1.
For all  x > 0 , σ > 0 ,   μ R   and  w = x μ ) / σ , let GLG be the density function with parameters μ σ   and  q , defined by
f x | μ , σ , q = q q 2 q 2 x σ   Γ q 2 exp q 2 q w exp q w ; q 0 1 2 1 2 x σ π exp ln x μ 2 2 σ 2   ;                                           q = 0
Note that if a random variable U has a Gamma distribution with shape parameter q 2   ( q 0 ) and scale parameter 1, U ~ G a m m a q 2 , 1 , then,
X = exp μ + σ ln q 2 U q
follows a GLG distribution, with parameters μ , σ , q , and the X support Ω X , for X ~ G L G μ , σ , q , is giving by Ω X = R + . Indeed, suppose that the variable U assumes a particular value u a > 0 . In this case, X will take a particular value x a = e x p μ + σ ln q 2 u a / q , and it can be observed that
lim u a 0 + x a       a n d   lim u a + x a
depend on whether the parameter q is positive or negative. Due to the above, for q 0 , it follows that
lim u a 0 + x a = lim u a 0 + exp μ + σ ln q 2 u a q = 0 +             q > 0   +       q < 0
using properties of logarithms and limits,
lim u a + x a = e x p μ + 2 σ l n q q l i m u a + e x p σ l n u a q = +       q > 0   0 +           q < 0
In addition, the function x a = e x p μ + σ ln q 2 u a / q is continuous for all u a > 0 .
  • Case 1: q > 0 , x a is increasing with u a , and assume all values from lim u a 0 + x a = 0 + to lim u a + x a = +
  • Case 2: q < 0 , x a decreases with u a , and assume all values from lim u a 0 + x a = + to lim u a + x a = 0 +
Both results allow us to conclude that x a takes any positive value, so the support of X is given by Ω X = R + .

2.2. Particular Cases of Generalized Log Gamma Distribution

In this section, four special cases of the GLG distribution are discussed. First, it is shown that the generalized gamma distribution corresponds to a reparameterization of GLG for positive values of q [15]. Then, two brief subsections are presented, where gamma and Weibull are written according to GLG. Finally, a log-normal distribution is referred to as a GLG special case when q 0 .

2.2.1. Generalized Gamma Distribution

The generalized gamma distribution was introduced by [13] and can be seen as the half-part of GLG for q > 0 . When q < 0 , it is not possible to establish a parametric correspondence between the two distributions. Thus, it can be said that GLG is a generalized gamma extension.
A random variable X follows a generalized gamma distribution, with a positive scale parameter β and two positive shape parameters d and n , if the correspondent density function is given via (4) as follows:
f x | β , d , n = n β d x d 1 exp x β n Γ d n
for positive values of x . With γ = d / n , the density function in (4) can be rewritten as
f x | β , γ , n = n β n γ x n γ 1 exp x β n Γ γ
for positive values of x , β ,   γ and n .
Proposition 1.
Consider  X ~ G L G μ , σ , q . If  q > 0 , then  X  also follows a generalized gamma distribution with scale parameter  β  and shape parameters  γ  and  n . The two sets of parameters are related using
μ = ln β + ln γ / n q = 1 / γ                               σ = 1 / n γ                        
and
β = e x p μ + 2 σ / q ln q γ = 1 / q 2                                                               n = q / σ                                                                   .
Proof. 
Note that the generalized gamma density function can be rewritten as Equation (1). Hence, GLG and generalized gamma sets of parameters can match. Rewriting the density function described in (7) yields the following:
f x | β , γ , n = n β n γ x n γ 1 exp x β n Γ γ = n exp n γ s exp n s x   Γ γ
It is known that s = ln x ln β , which means the natural logarithm of the scale parameter β is already isolated, allowing to achieve an equivalency with GLG density. The next step consists of adding and subtracting ln γ / n to the expression of s . The objective is to wrap the two shape parameters γ and n with the scale parameter β . This is related to the introduction of the GLG density, which incorporates three parameters as arguments for an exponential function. Hence, considering s = ln x ln β and t = ln x ln β + ln γ n ,
f x | β , γ , n = n exp n γ ln x ln β ln γ n + ln γ n exp n ln x ln β ln γ n + ln γ n x   Γ γ = n exp ln γ γ exp n γ t γ   e x p n t x   Γ γ
Finally, it follows that:
f x | β , γ , n = n γ γ exp γ n t e x p n t x   Γ γ = n γ γ x   Γ γ exp γ n t e x p n t
Considering γ = q 2 and
q w = q ln x μ / σ =   n   q / σ ln x ln β + ln γ / n μ
We can conclude that the two sets of parameters are related according to a system of equations solvable for the parameters of GLG:
μ = ln β + ln γ / n q 2 = γ q / σ = n q q 2 q 2 σ   Γ q 2 = n γ γ Γ γ   μ = ln β + ln γ / n q = 1 / γ σ = q / n q n q 2 q 2 q   Γ q 2 = n q 2 q 2 Γ q 2 μ = ln β + ln γ / n q   = 1 / γ σ = 1 / n γ q q = 1
Note that n = q / σ is a sufficient condition to prove q > 0 , due to the parameters n and σ being positive. Consequently, q 2 = 1 / γ is equivalent to q = 1 / γ and q = q and the relationship indicated in the equation system (6) is obtained. To achieve the result described in (7), it is sufficient to solve the equation system (6) for the generalized gamma parameters β , γ and n
μ = ln β + ln γ / n q = 1 / γ                             σ = 1 / n γ                       ln β = μ ln γ / n γ = q 2                                   n γ = 1 / σ                         β = e x p μ + 2 σ / q ln q γ = 1 / q 2                                                             n = q / σ                                                                  
Therefore, the generalized gamma is a GLG special case when q > 0 . □
Proposition 1 allows us to frame the gamma and Weibull distributions as two GLG special cases with a positive q .

2.2.2. Gamma Distribution

As mentioned in [37], the gamma distribution was first used by Laplace in 1836 in Laplace’s theory of errors. Two years later, Bienaymé introduced a Bayesian study of linear shapes in multinomial variables. In 1844, Ellis obtained the gamma distribution as a sum of exponential random variables. The authors of [38] mentioned that the gamma distribution naturally appeared in the theory associated with normal distributed variables. It is known that the gamma distribution is a generalized gamma special case with a unitary shape parameter n . Then, replacing n by 1 in (5), the correspondent density function is obtained as follows:
f x | β , γ ) = 1 β γ x γ 1 exp x β Γ γ
where x > 0 . Scale parameter β and shape parameter γ are both positive. Considering that the gamma distribution is a special case of the generalized gamma with a unitary shape parameter n , after replacing this parameter in the system (6), we can obtain the following:
μ = ln β + ln γ q = 1 / γ                   σ = 1 / γ                

2.2.3. Weibull Distribution

The Weibull distribution was introduced by the Physicist Waloddi Weibull in 1939 for modeling material resistance. At the beginning of the 1950s, this parametric model was directed at a wide range of applications. According to [38], Weibull is notable for the attention he has received in the scientific community since the early 1970s. It can be observed that the Weibull distribution is a special case of generalized gamma with a unit shape parameter γ . Replacing this parameter by 1 in Equation (5), we can obtain the density function as follows:
f x | β , n = n β x β n 1 exp x β n
where x > 0 , β is a positive scale parameter, and n is a positive shape parameter.
Considering that the Weibull distribution is a generalized gamma special case with a unitary shape parameter γ , by replacing this parameter with 1 in the system (6), we can obtain the following:
μ = ln β + ln 1 / n q = 1 / 1                             σ = 1 / n 1                       μ = ln β q = 1           σ = 1 / n

2.2.4. Log-Normal Distribution

The pioneers in log-normal distribution development are referred to in [38]: Galton, e McAlister, Kapteyen, Van Uven, and Gibrat. In the economic sciences, this distribution is sometimes called the Cobb–Douglas distribution and has a particular interest in production function representations.
Log-normal can be introduced as a generalized log-gamma special case. In the study developed on the GLG creation, [15] started to refer to the generalized gamma density function described in Equation (5) and converged to the log-normal density function when the shape parameter γ + . Starting from this premise, the author proved that the log-normal is a GLG special case when q 0 . The theoretical contributions of other authors [39,40,41] were also relevant to achieving this result. So, the density function (1) can be obtained, where x and σ assume only positive values and μ R . When it is necessary to model data with a very asymmetrical distribution on the right, the log-normal distribution is, within two-parameter models, one of the most adequate approaches.

2.3. Theoretical Contributions to the Study of the Generalized Log Gamma Distribution

A review of the literature developed over almost five decades on GLG shows that there is still room for theoretical developments on this parametric model. The statistical distribution proposed by [42] is the starting point for defining a wide family of distributions. However, its four parameters made it difficult to handle and apply in practice.
The generalized gamma distribution was introduced by [13] and consists of eliminating the location parameter defined by [42] while keeping the two shape parameters and the scale parameter. Three years later, ref. [14] advanced the methodology for estimating the parameters of the generalized gamma. Later, ref. [15] suggested the GLG, which consists of reparameterization and an extension of the result proposed by Stacy [13,14]. In fact, GLG allows negative values for the shape parameter q, proving to be more flexible than Stacy’s proposal. The main advantage of Prentice’s contribution is the ease of estimation using the maximum likelihood method.
Since [15], theoretical advances have been made, accompanied by relevant practical applications. In the late 1970s, ref. [43] showed that the extension of generalized gamma is linked to a family of parametric models and described likelihood-based methods to evaluate some special cases. In addition, they presented applications to various data sets, notably in the field of medicine. Inference methods directed at GLG made remarkable advances in 1987 [19]. In a short chapter on models with additional shape parameters, ref. [44] discusses GLG in his book on statistical models and methods for lifetime data. More recently, ref. [20] introduced robust GLG estimators.
Thus, building on pioneering works and other later theoretical contributions (such as some of the papers mentioned above), new theoretical contributions are presented in this research. Original expressions for the moments and mode of the distribution are proposed. In addition, some forms of GLG density are analyzed via graphical representations. The aim is to graphically highlight the mode and moments, as well as to show some cases where the absence of these measures appears. Finally, the expected value, variance, and mode are analyzed considering the value of q .

2.3.1. Theoretical Contributions

To obtain the general expression for the GLG distribution moments, see Lemma 1.
Lemma 1
(GLG Distribution Moments). Considering a positive integer number m , if  X ~ G L G μ , σ , q , then
E X m = exp m μ + 2 m σ ln q q Γ q 2 + m σ q Γ q 2
only if q > 1 m σ     q 0 . When m + , E X m only exists for q > 0 .
Proof. 
Denoting f U u | q the density function of U ,
f U u | q = u q 2 1 exp u Γ q 2               u > 0 ,     q 0 ,
where Γ a = 0 + u a 1   exp u d u is the gamma function for a positive number a . To isolate the random variable U , one can rewrite the expression for X in the following form:
X = exp μ + σ ln q 2 U q = exp μ + σ ln q 2 q + σ q ln U = = exp μ + σ ln q 2 q exp σ q ln U = exp μ + σ ln q 2 q U σ q
So, GLG m moment is given via:
E X m = E exp μ + σ ln q 2 q U σ q m = exp m μ + 2 m σ ln q q E U m σ q
The expected value can now be calculated as follows:
E X m = exp m μ + 2 m σ ln q q C 0 + u m σ q f U u | q d = C 0 + u m σ q u q 2 1 exp u Γ q 2 d u = = C Γ q 2 0 + u q 2 + m σ q 1 exp u d u = C Γ q 2 Γ q 2 + m σ q
Finally,
E X m = exp m μ + 2 m σ ln q q Γ q 2 + m σ q Γ q 2 ,
only if q 2 + m σ q > 0     q 0 .
Otherwise, this moment is equal to + , which implies the inexistence of E X m . Simplifying the condition referred to above, with q 0 , yields
q 2 + m σ q > 0   1 q 2 + m σ q q 2 > 0   1 + m σ q q 2 > 0   q 2 > 0 1 + m σ q > 0   m σ q > 1   σ > 0 q > 1 m σ q 0
Since lim m + 1 m σ = 0 , when m is a sufficiently large number, the correspondent moment only exists for q > 0 . □
Regarding the existence of the mode in the GLG distribution and how it provides useful statistical information (especially when the first movement does not exist), see Lemma 2.
Lemma 2
(GLG Distribution Mode). Considering X ~ L G G μ , σ , q . If  q < 1 σ , with  q 0 , then the mode is given via
M o = e x p μ + σ ln ( 1 σ q ) q .
Otherwise, GLG density is a decreasing function.
Proof. 
To q 0 , rewriting the first branch of Equation (1) yields,
f x | μ , σ , q = q q 2 q 2 x σ   Γ q 2 exp q 2 q w exp q w = q q 2 q 2 x σ   Γ q 2 exp q 1 w q 2 e x p q w
Obtaining the first derivative of f respect to x ,
f x = q q 2 q 2 x σ   Γ q 2 exp q 1 w q 2 e x p q w + q q 2 q 2 x σ   Γ q 2 exp q 1 w q 2 e x p q w ´ = q q 2 q 2 x 2 σ   Γ q 2 exp q 1 w q 2 e x p q w + q q 2 q 2 x σ   Γ q 2 exp q 1 w q 2 e x p q w q 1 w x q 2 e x p q w q w x
Changing the terms’ order in the previous expression and considering w = ln x μ / σ ,
f x = x q q 2 q 2 x 2   σ   Γ q 2 exp q 1 w q 2 e x p q w q 1 1 σ x q 2 e x p q w q 1 σ x q q 2 q 2 x 2   σ Γ q 2 exp q 1 w q 2 e x p q w = q q 2 q 2 x 2   σ Γ q 2 > 0 exp q 1 w q 2 e x p q w > 0 q 1 σ q 1 σ e x p q w 1 D
Since q 0 , this derivative is only null when D = 0 . This equation can be solved using the following:
D = 0 q 1 σ exp q ln x μ σ = 1 q 1 σ exp q ln x μ σ = 1 σ q
This equation only has a solution when 1 σ q > 0 , which is equivalent to q < 1 σ . In this case,
q ln x μ σ = ln ( 1 σ q ) ln x = μ + σ ln ( 1 σ q ) q .
Finally, we can obtain,
x = e x p μ + σ ln ( 1 σ q ) q ,   q < 1 σ .
Let us now solve the in equation f / x > 0 , to find the values of x where GLG density is an increasing function.
This relationship is only observed when D > 0 , so
q 1 σ q 1 σ e x p q w 1 D > 0 q 1 σ e x p q w > 1 q 1 σ q 1 exp q w < q 1 σ
The resolution of in Equation (14) depends on q .
  • Case 1: q > 0
If q > 0 , the previous expression (14) is equivalent to
exp q w < 1 σ q .
For 1 σ q 0 q 1 σ , the in Equation (15) is impossible, so f / x 0 .
It was already proved that for the same values of q , f / x 0 , then f / x < 0 . This means GLG density is a decreasing function when q 1 σ .
When 0 < q < 1 σ , in Equation (15) is equivalent to
q w < ln ( 1 σ q ) q ln x μ σ w < ln ( 1 σ q ) ln x < μ + σ ln ( 1 σ q ) q x < e x p μ + σ ln ( 1 σ q ) q .
So, for 0 < q < 1 σ , GLG density increases when x e x p μ + σ ln ( 1 σ q ) q and decreases after that value, confirming that x = e x p μ + σ ln ( 1 σ q ) q is the distribution mode.
  • Case 2: q < 0
When q < 0 , it can be concluded that 1 σ q > 1 . So, in this case, the previous expression (14) is equivalent to
exp q w > 1 σ q > 1 q w > ln ( 1 σ q ) q ln x μ σ w > ln ( 1 σ q ) q ln x μ > σ ln ( 1 σ q ) q < 0 ln x < μ + σ ln ( 1 σ q ) q x < e x p μ + σ ln ( 1 σ q ) q .
So, the same result for the 0 < q < 1 σ case is obtained. The proof is now concluded. □

2.3.2. Density function: Graphical Representations

This section presents graphical representations of the GLG density function, considering three different situations.
  • First situation:  1 σ < q < 1 σ
The first case, when 1 σ < q < 1 σ , corresponds to the case when expected values and modes exist. The two graphical representations in Figure 1 represent two GLG density functions, both considering μ = σ = 1 , where in Figure 1A q = 0.5 and Figure 1B refers to q = 0.5 . Both distributions have a heavy tail on the right (characteristic of the GLG distribution), giving rise to a M o 0.6796 and E X 3.3979 in the first case, and M o 1.2081 and E X 7.2488 in the second case. The mode is higher when q = 1 / 2 and the density maximum value decreases. Furthermore, a negative q is associated with longer and thicker tails, corresponding to higher expected values.
  • Second situation:  q 1 σ
In this case, the GLG density is a decreasing function of x , and the expected value is a finite number. This situation occurs when q 1 σ . Figure 2 represents two GLG density functions, both considering μ = σ = 1 . Case Figure 2A considers q = 1 , which originates E X 2.7183 . Case Figure 2B considers q = 2 and results in E X 1.8375 . From Figure 2, we can see that (considering the same values of μ and σ ), when q increases and its difference from 1 σ becomes larger, the expected value tends to decrease.
  • Third situation:  q 1 σ
Finally, q 1 σ is considered, which corresponds to the case without the expected value and mode equal to e x p μ + σ ln ( 1 σ q ) q . The two graphical representations in Figure 3 represent two GLG density functions, both considering μ = σ = 1 , where in Figure 3A q = 1 , and Figure 3B refers to q = 2 .
Both distributions (without expected value) have longer and thicker tails on the right, giving rise to a M o 1.3591 in the first case and M o 1.5694 in the second case.

2.3.3. Expected Value, Variance, and Mode as Functions of the Shape Parameter

To access some parameters of the GLG distribution (expected value, variance, and mode) as a function of the shape parameter q , three graphical representations are considered in Figure 4. All the measures consider μ = σ = 1 .
Case Figure 4A and Case Figure 4B show that for unitary values of μ and σ , considering the result in Lemma 1, the expected value only exists for q > 1 and, since variance is equal to E X 2   E X , it can only be obtained for q > 1 / 2 . It is possible to observe that the expected value and variance are lower for the highest values of q . Case Figure 4C represents the mode evolution as q varies. Considering the result in Lemma 2, a mode does not exist for q 1 . It can be observed that the lowest values of q are associated with higher mode values. When q , the mode approaches a finite value. Retaking the result in Lemma 2, the following limit can be computed:
lim q M o = lim q e x p μ + σ ln ( 1 σ q ) q = e x p μ exp lim q σ ln ( 1 σ q ) q + / = = C a u c h y s   R u l e e x p μ exp lim q σ σ 1 σ q 1 = e x p μ exp σ 2 + = e x p μ
So, when q , the mode approaches the location parameter exponentially. In Case Figure 4C, the horizontal asymptote is given by the equation y = e .

3. Data and Methods

This section begins by describing the data used in the empirical part of this paper. Next, reference is made to some methodological procedures, where the estimation methodology centered on the maximum likelihood method in the presence of censored data are discussed. As for the theoretical models involved in this research (probability distributions), the considerations and theoretical framework were presented in the previous section. As for the computational implementation, this was performed in the R programming language. The 4.3.1. R software packages used in this study are ggplot2 [45], survival [46], and flexsurv [47]. Briefly, the survival package is used to incorporate the indication of whether it is an observed or censored lifetime into the firm’s uptime. The flexsurv package is used to estimate the parametric models and obtain the point estimates and confidence intervals of the survival probabilities using the methods proposed by [48].

3.1. Data

To address the objective of applying the theoretical models discussed, life expectancy data from companies were used.
The dataset was provided by the Instituto Nacional de Estatística (INE) via the Integrated Business Accounting Systems—SCIE INE [49]. The developed analysis is based on the groups of Section J—Information and Communication; according to NACE Rev. 2 (European Classification of Economic Activities). The sectoral group on which this research focuses is G61—Telecommunications.
This study is based on the period 2004–2018. The fact that some enterprises were born and died during this period allows us to know and assess the survival time of a substantial number of enterprises. However, for the remaining ones, it is not possible to accurately determine their time on the market. Therefore, the presence of censored data are considered.
In Figure 5, four types of censoring are distinguished: (i) case 1 presents companies for which the period of activity is known; (ii) case 2 aggregates companies that emerged in the period 2004–2018 and remain in the market after that period; (iii) in case 3, organizations created before 2004 and activity ceased before 2018 were considered; (iv) case 4 corresponds to companies with activities started before 2004 and remained after 2018. This generic graphical approach was based on [27].
Table 1 presents the number of companies with observed and censored life expectancies (disaggregated by case) among the 1922 companies analyzed. The presented data indicate a censorship rate of around 47.45%. Considering the different types of censorship, most companies belong to Case 2; it is guaranteed that they will remain active after 2018. Companies belonging to Case 4 represent a minority, including the most consolidated companies in the economic sector.
For more detailed information on non-censored firms (only firms with known times of activity are considered, corresponding to Case 1), some descriptive statistics on the number of years of activity are presented in Table 2.
For all individual enterprises considered, these enterprises stay in business between three and four years on average. Half of the companies do not reach 4 years (median = 3), which shows that these sector groups are relatively fragile and not very resilient. In fact, none of these companies fully covered the period 2004–2018. Table 3 specifies the duration of activity of the enterprises that were not fully observed, i.e., the enterprises included in Cases 2, 3, and 4 of Figure 5.
From the data presented, it is possible to notice a high number of enterprises that have only been observed for one or two years. This presents an additional challenge to the estimation process, given the scarcity of information available in these situations.

3.2. Methods: Estimation Methodology with Censored Data

Having presented the data under study in the present section, some considerations are made about the methodological methods and procedures followed in this research. Regarding the models used in this study, four parametric models are applied to a firm’s life expectancy: GLG, gamma, Weibull, and log-normal.
Given the characteristics of the data under study (an asymmetric distribution with a heavy right tail), it is believed that the mentioned family of distributions could be a good option for modeling purposes (more details in Section 4). Before presenting the results, a more detailed description of the estimation methodology with censored data are necessary.
The estimation process only starts after determining the values of the variable c i , which represents the number of activity years for the company i in the period 2004–2018. In Case 1 of Figure 5, c i represents the total lifetime of a ceased enterprise during the analyzed period. In Case 2, the variable c i may be considered the age of the company in 2018, that is, at the end of the period. In Cases 3 and 4, c i is a minorant to companies’ life expectancy. Hence,
c i = F i n a l _ y e a r i S t a r t i n g _ y e a r i + 1
where S t a r t i n g y e a r i and F i n a l _ y e a r i refer, respectively, to the first and last years of the activity of the company i , during the period 2004–2018. Knowing that X i = c i +   l i , with   l i [ 0 , + ] , it can define a status variable δ i , which assumes a value of 1 if X i is an observed value and 0 otherwise. The most suitable estimation methodologies applied to the censored data consider the maximum likelihood method because it adjusts to the occurrence of censorship in each observation. However, the maximum likelihood method does not allow for the consideration of overly complex censorship mechanisms [50].
In the present analysis, the simplification focuses on conceptualizing Cases 2 to 4 of Figure 5 as right-censored observations. Some companies will only be observed until the censorship time c i , and it can be guaranteed that X i > c i . It can observe the concretization of the random variables ( T 1 , T 2 , , T n ) , where
T i = X i ,     i f   X i c i   c i ,     i f   X i > c i .
Considering n companies, with censorship times ( c 1 , c 2 , , c n ) and observed activity times ( x 1 , x 2 , , x n ) , each business individual is associated to the pair ( x i , c i ) , and the lifetime expectation is only observed if x i c i . Assuming the variables X i are independent and identically distributed, with a probability density function f x and a survival function S x , following the definition suggested by [51], data can be represented by n pairs of random variables T i , i , where
T i = m i n X i , c i   and   i = 1 ,       if   X i c i   0 ,       if   X i > c i .
Considering a sample of t i , δ i obtained from the pair of random variables T i , i , we can verify that δ i matches the binary variable status defined previously, that is, δ i assumes the value 1, if the company lifetime expectancy is known, and zero, if that activity time is censored. Consequently, T i = X i for the unitary status value and T i = c i otherwise. On the basis that X i are identically distributed and independent of the censorship, the random vector T i , i is also identically distributed. Hence, it can be proved that the joint probability density function of T , is given via
f T , t , δ = f   x δ S c 1 δ
where f and S are, respectively, the density and the survival functions of the random variable X [51]. If the pairs of vectors T i , i are independent, the likelihood function can be written using
L = i = 1 n f x i δ i S c i 1 δ i
The parameter values that maximize L or ln L are the same. Hence,
ln L = i = 1 n δ i ln f x i + i = 1 n 1 δ i ln S c i
Assuming the random variable X distribution is made up of p parameters θ 1 , θ 2 , ,   θ p , maximum likelihood estimators θ ^ 1 , θ ^ 2 , , θ ^ p are the solutions of the first order conditions, described on the system of Equation (22).
M θ 1 = 0 M θ 2 = 0 M θ p = 0 .
It should also be noted that for those optimal solutions, the second-order conditions must be satisfied, that is, the Hessian Matrix H (23) must be negative definite.
H = 2 M θ 1 2 2 M θ 1 θ 2 2 M θ 2 θ 1 2 M θ 2 2 2 M θ 1 θ p 2 M θ 2 θ p 2 M θ p θ 1 2 M θ p θ 2 2 M θ p 2

4. Results

After the description of the data made in Section 3.1, this section is dedicated to the presentation of the results of the four considered models. To access the four parametric models, the respective parameters are estimated. The results are presented in Table 4.
Figure 6 includes the probability density functions (PDF) resulting from the estimation process via the maximum likelihood method. The variable t refers to the activity time in years.
As shown in Figure 6, the value of the Akaike Information Criterion (AIC) [52] is presented in each graphical representation. This criterion allows us to compare the model’s quality in terms of goodness of fit, and the associated coefficient is defined as
A I C = 2 L θ ^ 1 ,   θ ^ 2 , ,   θ ^ k X 1 ,   X 2 , , X k + 2 k
where, L θ 1 ,   θ 2 , ,   θ k X 1 ,   X 2 , , X k is the likelihood function of the model with k parameters based on a sample of size n and θ ^ 1 ,   θ ^ 2 , ,   θ ^ k refer to the parameter estimators using the MLE method. In this work, the Akaike criterion has been used to evaluate the goodness of fit for each studied model. Regarding the calculated AIC values, the best model for the G61 sector group density function is the generalized Log Gamma. Nevertheless, the data set studied here presents a large amount of censored data (see Table 1). Therefore, in future work, we propose to study, validate, and implement new metrics for goodness of fit when building models with a large amount of censored data.
In addition, for an easier visualization of model contrasts, the PDF and survival function, S ( t ) , are overlappingly displayed in Figure 7.
The GLG is the model that best fits our data. Considering the distributions of the two parameters, log-normal stands out relative to gamma and Weibull. As shown in Figure 7, it can be noted that GLG considers a higher failure probability in the first years of activity, best covering the most fragile parts of the companies. Regarding the long-term survival probabilities (higher than 30 years), GLG admits a more elevated life expectancy than the remaining models, more effectively incorporating the life expectancy of the most resilient companies. Comparing the four models and considering the AIC information, it is possible to order the four models from the best fit to the least fit: GLG, log-normal, gamma, and Weibull. After a detailed examination of gamma and Weibull parameterizations as particular cases of GLG, it can be observed that for these two cases, the parameter q assumes a positive value, while for the GLG estimated model, this parameter is negative. Then, it can be deduced that the condition q > 0 is a central reason for the worst results of the gamma and Weibull models. The most favorable result associated with the log-normal results corresponds to the closest proximity of a negative value for the parameter q , in this case, q = 0 (see Section 2.2).
Once the parametric models have been estimated and their quality assessed, one main application of this study is estimating the survival probabilities of companies. Table 5 contains the estimations and the confidence intervals (95%) of survival probabilities considering the four chosen parametric models.
From the table, we can observe that the survival probability is lower than 50%, starting in the fifth/sixth year. The two parameter models (gamma, Weibull, and log-normal) recorded less optimistic survival probability estimations, which are more pronounced in gamma and Weibull. Furthermore, note that after the eighth year, it can be observed that the estimated probabilities for the GLG distribution are always higher than those of the other distributions (gamma, Weibull, and log-normal), providing more optimal survival probabilities associated, for example, with the strongest companies in the market.

5. Discussion and Conclusions

Business competitiveness is nothing new. However, because of an increasingly global environment, it is becoming more and more evident. This situation, combined with other constraints, contributes to many companies not being able to resist and prematurely closing their operations. The business fabric of the Information and Communication sector is no exception. To better understand this phenomenon (from an analysis and prediction perspective), it is important to identify the mathematical/statistical models that fit this type of data behavior. In light of the above information, when assessing the life span of enterprises, particularly in this business area, the literature points to a marked and unbalanced heterogeneity in terms of proportion.
A considerable number of firms do not exceed four or five years of activity, while a small part remains active for many years. This leads to a strongly skewed probabilistic distribution of lifetime, with a heavy tail on the right. In the scientific literature, several probability distributions with these characteristics can be identified. Among these possibilities is the GLG probability distribution, for which the literature identifies other distributions as possible cases (gamma, Weibull, and log-normal distributions). We believe that by using GLG, we have modeled the resilience of the telecommunications sector in Portugal in a way that has never been seen before. This study employs the Akaike criterion to assess the goodness of fit of each model. However, the dataset under investigation contains a substantial volume of censored data (see Table 1). Therefore, in future work, we suggest exploring, validating, and incorporating new metrics to evaluate the goodness of fit when working with large censored datasets. To apply GLG to a dataset referring to the life span of companies, which is not yet reported in the literature, a theoretical study of this parametric distribution (and consequent family of distributions) was necessary. This work identified some gaps in the literature. In the theoretical developments, expressions for the moments and mode of the GLG distribution are presented. From this study, it is possible to conclude that the existence of moments centered at the origin rarely arises for a negative q . Regarding the mode, it exists whenever q is less than the scale parameter inverse. It was inferred that the interval 1 / σ < q < 1 / σ enables the existence of a mode and an expected value. When the parameters μ and σ are fixed, it can be observed that the GLG expected value, variance, and mode decrease when the shape parameter q is higher. Furthermore, negative values of q are associated with longer and thicker density function tails. This approach using GLG, which allows the q parameter to be less than zero, will give a new impetus to modeling company lifetimes.
To achieve the objectives related to the empirical side of this research, four parametric models were applied to a dataset on the lifetime of firms in the groups of Section J—Information and Communication. For a more robust analysis, given some difficulties in the dataset, a censoring methodology is presented in this research. Based on the state of the art, the originality of the application of GLG models to the life expectancy of companies can be reported. The developed study points out GLG models as a promising solution to model the longevity of firms in the sector. Given that there is a strong presence of censored data, the maximum likelihood method reveals the best estimation alternative. In addition, it should be noted that the model’s complexity, combined with the data treatment exigencies, implies the use of sophisticated optimization algorithms. In the project development, the flexsurvreg function from the library flexsurv of the R statistical software made the estimation process possible using the BFGS (Broyden, Fletcher, Goldfarb, and Shanno) optimization algorithms, which are characterized by being a Quasi-Newton method that numerically calculates the second derivatives of the Hessian Matrix. It can be said that there are no convergency problems in the estimation methodology developed. All the models show probabilities of surviving more than 15 years, far below the censorship rate. This can be assumed to be a positive indicator regarding the estimation process because the censorship rates represent a major factor in the probability of surviving after 15 years. In this case, the analysis period has a fifteen-year duration. However, in the future, this horizon may be extended by including more data, which will represent a valuable addition to the conclusions reported in this study.
For this sectorial group, the estimated parameter q assumed a negative value, which justifies the non-use of generalized gamma, restricted to a positive value of q . Concerning modeling the life expectancy of companies, it is possible to see that GLG is more suitable than its three special cases with two parameters. The complexity and behavior of a lifetime require a three-parameter model that can characterize the high probability of failure in the first years of activity for the most fragile companies and, simultaneously, highlight the long-life expectancy of the most resilient ones. Although the presence of censored data has created new challenges in the estimation process, it was possible to provide a set of strong results, which made those conclusions possible. Considering the GLG model, the probability of a company surviving ten or more years is 31.9%. The same model shows that nearly 50% of companies survive for more than five years. Based on GLG estimates, the probability of surviving fifteen years or more is almost 25%. Considering the above, it is worth mentioning that this study is a starting point for an in-depth understanding of the success and failure records of companies and can be applied to any sectoral group in domestic or non-domestic economies. The consolidation of the conclusions referred to in this study depends on data collection, and it is worth noting the ambition to extend the analysis to a broader period in the future. However, in addition to the theoretical contributions highlighted, the practical parts of this research highlight the following contributions: (i) it allows us to compare the resilience between different sectors of the economy (for example, we could compare the resilience of the telecommunications sector with the press sector); (ii) it allows us to compare the resilience of the same sector in two different time periods: for example, comparing 2004–2018 with 2008–2022 to see if there is a statistically significant difference between these periods.
Thus, trusting that this work will shed light on the subject, some limitations in this present research are recognized: (i) a reductionist view of reality, modeling the expected life of firms without considering factors that may be determinant for their survival (e.g., initial investment, capital stock, number of employees, impact on the economy); (ii) the information used in this study has some limitations, as the database does not include information on possible mergers and reorganizations of companies, which may lead to one-off biases; (iii) making inferences about the suitability of these models since they have only been tested on a specific dataset. In addition, future work should explore improvements in modeling this phenomenon, namely (i) evaluating the possibility of using other models that better fit the data analyzed, (ii) testing these models on another data set, (iii) improving the dataset by including information on firms that can be used as additional variables in studying the life expectancy of firms, and (iv) considering regression models that allow for a more robust evaluation and more accurate forecasting.
In summary, the contributions of this research are acknowledged, although some limitations are noted, some of which are beyond the scope of this study. In addition to the application made to a real data set, the theoretical contributions made provide contributions to the subject and to the scientific debate. If we use this distribution with less censored data and some associated explanatory variables, we may have found the right path to the ultimate goal: a malleable probability distribution with tails that can adapt to life expectancies close to infinity and, at the same time, adapt to very high early death rates.

Author Contributions

All authors (J.H.D.G., J.J.F.G., L.R. and F.R.R.) contributed to the study’s conception. Individual contributions: conceptualization, J.H.D.G. and F.R.R.; literature review, J.H.D.G., L.R. and F.R.R.; theoretical/mathematical frameworks, J.H.D.G., J.J.F.G. and F.R.R.; methodology, J.H.D.G. and J.J.F.G.; formal analysis, L.R. and F.R.R.; computational implementations, J.H.D.G. and J.J.F.G.; data analysis, J.H.D.G., J.J.F.G., L.R. and F.R.R.; writing—original draft preparation, J.H.D.G. and F.R.R.; writing—review and editing, F.R.R.; final validation, F.R.R. All authors have read and agreed to the published version of the manuscript.

Funding

L.R. is supported by the Universidad del Norte. J.J.F.G. and F.R.R. are financed by national funds through FCT—Fundação para a Ciência e a Tecnologia under the project UIDB/00006/2020.

Data Availability Statement

The data supporting the results of this study are available from the National Institute of Statistics (INE, https://www.ine.pt). However, there are restrictions on the availability of these data, which were used under license for this study and are not publicly available. Nevertheless, the data are available to the authors upon reasonable request and with authorization from INE.

Acknowledgments

This work is partially financed by national funds through FCT—Fundação para a Ciência e a Tecnologia under the project UIDB/00006/2020. J.H.D.G dedicates this working paper to her mother, Maria Madalena. The same author would like to thank the employer institution—the Portuguese Regulatory Authority for the Media (ERC)—for their timetable flexibility, which was essential to this scientific project development. The authors would like to thank Miguel Graça Balbina and Beatriz Nogueira Sequeira for their suggestions. The authors express gratitude to Statistics Portugal for making data available from the Integrated Business Accounting Systems (SCIE).

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of this study, in the collection, analysis, or interpretation of data, in the writing of this manuscript, or in the decision to publish these results.

References

  1. Altman, E.I. Why Businesses Fail. J. Bus. Strategy 1983, 3, 15–21. [Google Scholar] [CrossRef]
  2. Baumard, P.; Starbuck, W.H. Learning from Failures: Why It May Not Happen. Long Range Plann 2005, 38, 281–298. [Google Scholar] [CrossRef]
  3. Benedettini, O.; Neely, A.; Swink, M. Why Do Servitized Firms Fail? A Risk-Based Explanation. Int. J. Oper. Prod. Manag. 2015, 35, 946–979. [Google Scholar] [CrossRef]
  4. Mourdoukoutas, P.; Stefanidis, A. Why Companies Succeed or Fail: Corporate Cycles and Firm Function in Tandem. J. Bus. Strategy 2023, 44, 76–86. [Google Scholar] [CrossRef]
  5. Mata, J.; Portugal, P. Life Duration of New Firms. J. Ind. Econ. 1994, 42, 227–245. [Google Scholar] [CrossRef]
  6. Mata, J.; Portugal, P.; Guimarães, P. The Survival of New Plants: Start-up Conditions and Post-Entry Evolution. Int. J. Ind. Organ. 1995, 13, 459–481. [Google Scholar] [CrossRef]
  7. Mata, J.; Portugal, P. Patterns of Entry, Post-Entry Growth and Survival. Small Bus. Econ. 2004, 22, 283–298. [Google Scholar] [CrossRef]
  8. Geroski, P.A.; Mata, J.; Portugal, P. Founding Conditions and the Survival of New Firms. Strateg. Manag. J. 2010, 31, 510–529. [Google Scholar] [CrossRef]
  9. Reis, P.N.; Augusto, M.G. What Is a Firm’s Life Expectancy? Empirical Evidence in the Context of Portuguese Companies. J. Bus. Valuat. Econ. Loss Anal. 2015, 10, 45–75. [Google Scholar] [CrossRef]
  10. Daepp, M.I.G.; Hamilton, M.J.; West, G.B.; Bettencourt, L.M.A. The Mortality of Companies. J. R. Soc. Interface 2015, 12, 20150120. [Google Scholar] [CrossRef]
  11. Sharma, K.; Dixit, M.R. Live Long and Prosper: The Search for Business Longevity. In Strategic Direction; Emerald Group Publishing Ltd.: Bingley, UK, 2017; pp. 1–3. [Google Scholar] [CrossRef]
  12. Clark, D. Average Company Lifespan on Standard&Poor’s 500 Index. Available online: https://www.statista.com/statistics/1259275/average-company-lifespan/ (accessed on 10 June 2021).
  13. Stacy, E.W. A Generalization of the Gamma Distribution. Ann. Math. Stat. 1962, 33, 1187–1192. [Google Scholar] [CrossRef]
  14. Stacy, E.W.; Mihram, G.A. Parameter Estimation for a Generalized Gamma Distribution. Technometrics 1965, 7, 349–358. [Google Scholar] [CrossRef]
  15. Prentice, R.L. A Log Gamma Model and Its Maximum Likelihood Estimation. Biometrika 1974, 61, 539–544. [Google Scholar] [CrossRef]
  16. Bourguignon, M.; Lima, M.d.C.S.; Leão, J.; Nascimento, A.D.C.; Pinho, L.G.B.; Cordeiro, G.M. A New Generalized Gamma Distribution with Applications. Am. J. Math. Manag. Sci. 2015, 34, 309–342. [Google Scholar] [CrossRef]
  17. Lahcene, B. A New Extended-Gamma Family of Distributions: Properties and Applications. J. Appl. Math. Comput. 2021, 5, 9–17. [Google Scholar] [CrossRef]
  18. Lawless, J.F. Inference in the Generalized Gamma and Log Gamma Distributions. Technometrics 1980, 22, 409–419. [Google Scholar] [CrossRef]
  19. DiCiccio, T.J. Approximate Inference for the Generalized Gamma Distribution. Technometrics 1987, 29, 33–40. [Google Scholar] [CrossRef]
  20. Agostinelli, C.; Marazzi, A.; Yohai, V.J. Robust Estimators of the Generalized Log-Gamma Distribution. Technometrics 2014, 56, 92–101. [Google Scholar] [CrossRef]
  21. de Pascoa, M.A.R.; Ortega, E.M.M.; Cordeiro, G.M. The Kumaraswamy Generalized Gamma Distribution with Application in Survival Analysis. Stat. Methodol. 2011, 8, 411–433. [Google Scholar] [CrossRef]
  22. Kaneko, R. Elaboration of the Coale-McNeil Nuptiality Model as The Generalized Log Gamma Distribution: A New Identity and Empirical Enhancements. Demogr. Res. 2003, 9, 223–262. [Google Scholar] [CrossRef]
  23. Ortega, E.M.; Cancho, V.G.; Lachos, V.H. A Generalized Log-Gamma Mixture Model for Cure Rate: Estimation and Sensitivity Analysis. Indian J. Stat. 2009, 71-B, 1–29. [Google Scholar]
  24. Cordeiro, G.M.; Ortega, E.M.M.; Silva, G.O. The Exponentiated Generalized Gamma Distribution with Application to Lifetime Data. J. Stat. Comput. Simul. 2011, 81, 827–842. [Google Scholar] [CrossRef]
  25. Ramos, P.L.; Nascimento, D.C.; Ferreira, P.H.; Weber, K.T.; Santos, T.E.G.; Louzada, F. Modeling Traumatic Brain Injury Lifetime Data: Improved Estimators for the Generalized Gamma Distribution under Small Samples. PLoS ONE 2019, 14, e0221332. [Google Scholar] [CrossRef] [PubMed]
  26. Ramos, P.L.; Mota, A.L.; Ferreira, P.H.; Ramos, E.; Tomazella, V.L.D.; Louzada, F. Bayesian Analysis of the Inverse Generalized Gamma Distribution Using Objective Priors. J. Stat. Comput. Simul. 2021, 91, 786–816. [Google Scholar] [CrossRef]
  27. Shang, X.; Ng, H.K.T. On Parameter Estimation for the Generalized Gamma Distribution Based on Left-truncated and Right-censored Data. Comput. Math. Methods 2021, 3, e1091. [Google Scholar] [CrossRef]
  28. Ahmed, E.A.; El-Morshedy, M.; Al-Essa, L.A.; Eliwa, M.S. Statistical Inference on the Entropy Measures of Gamma Distribution under Progressive Censoring: EM and MCMC Algorithms. Mathematics 2023, 11, 2298. [Google Scholar] [CrossRef]
  29. Tovar-Falón, R.; Martínez-Flórez, G.; Bolfarine, H. Modelling Asymmetric Data by Using the Log-Gamma-Normal Regression Model. Mathematics 2022, 10, 1199. [Google Scholar] [CrossRef]
  30. Prataviera, F.; Ortega, E.M.M.; Cordeiro, G.M.; Braga, A.d.S. The Heteroscedastic Odd Log-Logistic Generalized Gamma Regression Model for Censored Data. Commun. Stat. Simul. Comput. 2019, 48, 1815–1839. [Google Scholar] [CrossRef]
  31. Prataviera, F.; Ortega, E.M.M.; Cordeiro, G.M.; Pescim, R.R.; Verssani, B.A.W. A New Generalized Odd Log-Logistic Flexible Weibull Regression Model with Applications in Repairable Systems. Reliab. Eng. Syst. Saf. 2018, 176, 13–26. [Google Scholar] [CrossRef]
  32. Alzaatreh, A.; Carl, L.; Famoye, F. Family of Generalized Gamma Distributions: Properties and Applications. Hacet. J. Math. Stat. 2016, 45, 869–886. [Google Scholar] [CrossRef]
  33. Rashid, A.; Ahmad, Z.; Jan, T.R. A New Lifetime Distribution for Series System: Model, Properties and Application. J. Mod. Appl. Stat. Methods 2018, 17, 43–67. [Google Scholar] [CrossRef]
  34. Cooray, K.; Ananda, M.M.A. A Generalization of the Half-Normal Distribution with Applications to Lifetime Data. Commun. Stat. Theory Methods 2008, 37, 1323–1337. [Google Scholar] [CrossRef]
  35. Cordeiro, G.M.; Alizadeh, M.; Pescim, R.R.; Ortega, E.M.M. The Odd Log-Logistic Generalized Half-Normal Lifetime Distribution: Properties and Applications. Commun. Stat. Theory Methods 2017, 46, 4195–4214. [Google Scholar] [CrossRef]
  36. Okorie, I.E.; Akpanta, A.C.; Ohakwe, J.; Chikezie, D.C.; Onyemachi, C.U.; Obi, E.O. The Adjusted Log-Logistic Generalized Exponential Distribution with Application to Lifetime Data. Int. J. Stat. Probab. 2017, 5, 1. [Google Scholar] [CrossRef]
  37. Lancaster, H.O. Forerunners of the Pearson Χ2. Aust. N. Z. J. Stat. 1966, 8, 117–126. [Google Scholar] [CrossRef]
  38. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1995; Volume 2. [Google Scholar]
  39. Abramowitz, M.; Stegun, I.A.; Romer, R.H. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; American Association of Physics Teachers: College Park, MD, USA, 1988. [Google Scholar]
  40. Bartlett, M.S.; Kendall, D.G. The Statistical Analysis of Variance-Heterogeneity and the Logarithmic Transformation. Suppl. J. R. Stat. Soc. 1946, 8, 128–138. [Google Scholar] [CrossRef]
  41. Hager, H.W.; Bain, L.J. Inferential Procedures for the Generalized Gamma Distribution. J. Am. Stat. Assoc. 1970, 65, 1601–1609. [Google Scholar] [CrossRef]
  42. Amoroso, L. Ricerche Intorno Alla Curva Dei Redditi. Ann. Mat. Pura Appl. 1925, 2, 123–159. [Google Scholar] [CrossRef]
  43. Farewell, V.T.; Prentice, R.L. A Study of Distributional Shape in Life Testing. Technometrics 1977, 19, 69–75. [Google Scholar] [CrossRef]
  44. Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  45. Wickham, H.; Chang, W.; Wickham, M.H. Package ‘Ggplot2′: Create Elegant Data Visualisations Using the Grammar of Graphics; R Foundation for Statistical Computing: Vienna, Austria, 2016; pp. 1–189. [Google Scholar]
  46. Therneau, T. A Package for Survival Analysis in R; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  47. Jackson, C.; Metcalfe, P.; Amdahl, J.; Jackson, M.C. Package ‘Flexsurv’: Flexible Parametric Survival and Multi-State Models; R Foundation for Statistical Computing: Vienna, Austria, 2023. [Google Scholar]
  48. Mandel, M. Simulation-Based Confidence Intervals for Functions with Complicated Derivatives. Am. Stat. 2013, 67, 76–81. [Google Scholar] [CrossRef]
  49. INE. Documento Metodológico—Sistema de Contas Integradas das Empresas. Available online: https://smi.ine.pt/DocumentacaoMetodologica/Detalhes/1211 (accessed on 15 June 2021).
  50. Prinja, S.; Gupta, N.; Verma, R. Censoring in Clinical Trials: Review of Survival Analysis Techniques. Indian J. Community Med. 2010, 35, 217. [Google Scholar] [CrossRef] [PubMed]
  51. Patti, S.; Biganzoli, E.; Boracchi, P. Review of the Maximum Likelihood Functions for Right Censored Data. A New Elementary Derivation; COBRA Preprint Series; The Berkeley Electronic Press: Berkeley, CA, USA, 2007. [Google Scholar]
  52. Akaike, H. A New Look at the Statistical Model Identification. In Selected Papers of Hirotugu Akaike; Parzen, E., Tanabe, K., Kitagawa, G., Eds.; Springer: New York, NY, USA, 1998; pp. 215–222. [Google Scholar] [CrossRef]
Figure 1. GLG density function (expected value and mode both exist): (A) q positive; (B) q negative.
Figure 1. GLG density function (expected value and mode both exist): (A) q positive; (B) q negative.
Mathematics 11 04792 g001
Figure 2. GLG density function (with expected value and without mode): (A) q = 1 ; (B) q = 2 .
Figure 2. GLG density function (with expected value and without mode): (A) q = 1 ; (B) q = 2 .
Mathematics 11 04792 g002
Figure 3. GLG density function (without expected value and with mode): (A) q = 1 ; (B) q = 2 .
Figure 3. GLG density function (without expected value and with mode): (A) q = 1 ; (B) q = 2 .
Mathematics 11 04792 g003
Figure 4. Parameters of the GLG distribution as a function of the shape parameter q : (A) Expected value; (B) Variance; (C) Mode.
Figure 4. Parameters of the GLG distribution as a function of the shape parameter q : (A) Expected value; (B) Variance; (C) Mode.
Mathematics 11 04792 g004
Figure 5. A graphic representation of the censorship typologies present in this study.
Figure 5. A graphic representation of the censorship typologies present in this study.
Mathematics 11 04792 g005
Figure 6. Density functions for the sectoral group G61: (A) Generalized log gamma; (B) Gamma; (C) Weibull; (D) Log normal.
Figure 6. Density functions for the sectoral group G61: (A) Generalized log gamma; (B) Gamma; (C) Weibull; (D) Log normal.
Mathematics 11 04792 g006
Figure 7. Graphical representations for the sectoral group G61: (A) Density function; (B) Survival function.
Figure 7. Graphical representations for the sectoral group G61: (A) Density function; (B) Survival function.
Mathematics 11 04792 g007
Table 1. Number of companies with observed and censored life expectations.
Table 1. Number of companies with observed and censored life expectations.
DesignationNumber
Companies with observed life expectations1010
Companies with censored life expectations912
 Censored life expectations—Case 2(714)
 Censored life expectations—Case 3(150)
 Censored life expectations—Case 4(48)
Table 2. Number of years of activity of non-censored enterprises: descriptive statistics.
Table 2. Number of years of activity of non-censored enterprises: descriptive statistics.
CountMeanQ1MedianQ3Max.
10103.323414
Table 3. Number of companies by censored times of activity.
Table 3. Number of companies by censored times of activity.
Times of Activity (Censored)Number
Case 2Case 3Case 4
1+126260
2+107220
3+8870
4+76240
5+56170
6+5580
7+4130
8+4980
9+24120
10+1770
11+3380
12+1040
13+1030
14+1410
15+8048
Table 4. Estimation of model parameters with 95% confidence intervals (LCL and UCL) and moments.
Table 4. Estimation of model parameters with 95% confidence intervals (LCL and UCL) and moments.
Generalized Log GammaGammaWeibullLog Normal
PARESTLCLUCLPARESTLCLUCLPARESTLCLUCLPARESTLCLUCL
μ 0.920.781.05β5.164.635.74β7.877.458.30meanlog1.651.591.70
σ 0.930.861.00γ1.411.311.52n1.181.121.23sdlog1.020.981.07
q −1.69−2.02−1.36------------
Legend: PAR—Parameters; EST—Estimative; LCL/UCL—Lower/Upper confidence limits.
Table 5. Survival probabilities estimates for the sectoral group G61 and 95% confidence intervals.
Table 5. Survival probabilities estimates for the sectoral group G61 and 95% confidence intervals.
TimeGeneralized Log GammaGammaWeibullLog Normal
LCLESTUCLLCLESTUCLLCLESTUCLLCLESTUCL
10.960.970.970.920.930.940.910.920.920.940.950.95
20.770.790.810.820.830.850.800.820.830.810.820.84
30.630.650.670.720.730.750.710.720.740.690.700.72
40.530.550.580.620.640.660.620.640.660.580.600.62
50.470.490.510.530.550.570.540.560.580.490.520.54
60.410.440.460.450.470.500.460.480.500.420.440.47
70.370.400.420.380.410.430.400.420.440.360.390.41
80.340.370.390.320.350.370.340.360.380.310.340.36
90.310.340.360.270.290.320.290.310.330.270.300.32
100.290.320.340.230.250.270.240.270.290.240.260.28
110.270.300.330.190.210.240.200.230.250.210.230.25
120.260.280.310.160.180.200.170.190.220.190.210.23
130.240.270.300.130.150.170.140.160.190.170.190.21
140.230.260.280.110.130.150.120.140.160.150.170.19
150.220.250.270.090.110.130.100.120.140.130.150.17
Legend: EST—Estimative; LCL/UCL—Lower/Upper confidence limits.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dias Gonçalves, J.H.; Ferreira Gomes, J.J.; Rubio, L.; Ramos, F.R. A Generalized Log Gamma Approach: Theoretical Contributions and an Application to Companies’ Life Expectancy. Mathematics 2023, 11, 4792. https://doi.org/10.3390/math11234792

AMA Style

Dias Gonçalves JH, Ferreira Gomes JJ, Rubio L, Ramos FR. A Generalized Log Gamma Approach: Theoretical Contributions and an Application to Companies’ Life Expectancy. Mathematics. 2023; 11(23):4792. https://doi.org/10.3390/math11234792

Chicago/Turabian Style

Dias Gonçalves, José H., João J. Ferreira Gomes, Lihki Rubio, and Filipe R. Ramos. 2023. "A Generalized Log Gamma Approach: Theoretical Contributions and an Application to Companies’ Life Expectancy" Mathematics 11, no. 23: 4792. https://doi.org/10.3390/math11234792

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop