Next Article in Journal
Properties of q-Symmetric Starlike Functions of Janowski Type
Next Article in Special Issue
Simultaneous Confidence Intervals for All Pairwise Differences between the Coefficients of Variation of Multiple Birnbaum–Saunders Distributions
Previous Article in Journal
Stochastic Model for the LMS Algorithm with Symmetric/Antisymmetric Properties
Previous Article in Special Issue
Confidence Intervals for Comparing the Variances of Two Independent Birnbaum–Saunders Distributions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Interval Estimation for the Two-Parameter Exponential Distribution Based on the Upper Record Values

Department of Statistics, Tamkang University, Tamsui, Taipei 251301, Taiwan
Symmetry 2022, 14(9), 1906; https://doi.org/10.3390/sym14091906
Submission received: 22 August 2022 / Revised: 3 September 2022 / Accepted: 8 September 2022 / Published: 12 September 2022

Abstract

:
Using the data for upper record values, the interval estimation for the scale parameter of two-parameter exponential distribution is presented. In addition, two methods for the joint confidence region of two parameters are proposed. In terms of confidence region area, the simulation comparison of two methods of the confidence region is performed in this paper. The criterion of minimum confidence region area is used to obtain the optimal method of the confidence region. To illustrate our proposed interval estimation methods, one biometrical example is used and the corresponding confidence interval length and confidence region area are also calculated. Our research topic is related to the asymmetrical probability distributions and applications across disciplines.

1. Introduction

In lifetime test experiments and reliability analysis, the exponential distribution is widely used in modelling the lifetime data. Wu [1] proposed a modified one-stage multiple comparison procedure for comparing exponential location parameters with the control when the scale parameters are unequal (under heteroscedasticity). Please see Johnson and Kotz [2], Bain [3], Lawless and Singhal [4] and Zelen [5] for other applications of exponential distributions. Other than exponential distributions, Shafiq et al. [6] identified an effective statistical distribution called the modified Kies Frechet (MKIF) model for examining COVID-19 mortality rates in Canada and the Netherlands. El-Khatib and Hatemi-J [7] suggested formulas to make the valuation of options and the underlying hedging strategies during financial crisis more precise. Alzaatreh et al. [8] presented a new distribution called Weibull–Pareto distribution and the properties of this distribution are obtained and illustrated by some real data sets.
Now we start to define the upper record value data as follows: let X 1 , , X n be a random sample from a two-parameter exponential distribution. We treated X j as an upper record value if X j > X l for all l < j . The observed time point for the first upper record value is denoted as U ( 0 ) = 1 . Then the observed time point for the nth upper record value is denoted as U ( n ) , where U ( n ) = min { k : X k > X U ( n 1 ) } . Then { X U ( 0 ) , X U ( 1 ) , , X U ( n ) } is called the sequence of upper record values of the original sequence { X 1 , , X n } . An analogous definition of lower record values is obtained by changing the sign of ‘>’ to the sign of ‘<’. In this study, we were only concerned with the case of upper record values. However, all results should be applicable to the case of lower record values. Record values data appeared in many real life applications, such as life-tests, weather, sports, economics, and so on. In the last two decades, many authors had studied the statistical inference based on the record values. (See for example, Al-Hussaini and Ahmad [9].) For simplicity, we denote the sequence of upper record values { X U ( 0 ) , X U ( 1 ) , , X U ( n ) } as { R 0 , R 1 , , R n } .
Since some statistical properties of exponential distribution depend on their two parameters, the development of the confidence region for two parameters is necessary. Asgharzadeh et al. [10] proposed interval estimation for the Pareto distribution using record values. For the inverse Weibull distribution, Jana and Bera [11] proposed the interval estimation of multicomponent stress–strength reliability. Wu [12] proposed the Bayesian interval estimation for the two-parameter exponential distribution based on the right type II censored sample. Wu [13] proposed the Bayesian interval estimation for the two-parameter exponential distribution using the progressive type II censored sample. In this research, we would like to propose the confidence interval of the scale parameter for two-parameter exponential distribution using the upper record value data. Two methods of joint confidence region for two parameters are also proposed. The Monte Carlo simulation method is used to assess the performance of all interval estimation results in terms of the length of the confidence interval or the area of the confidence region. At last, we give a biometrical example to illustrate the proposed methods.

2. Interval Estimation and Confidence Region for Two Parameters

At first, we consider a random variable X following a two-parameter exponential distribution with scale parameter θ and location parameter μ . Its probability density function (pdf) is given by:
f ( x ) = 1 θ exp [ ( x μ ) / θ ] ,   x μ , μ > 0 , θ > 0 .
Its cumulative distribution function (cdf) is given by F ( x ) = 1 exp ( x μ θ ) .
Consider the sequence of upper record values { R 0 , R 1 , , R n } following the two-parameter exponential distribution. From Arnold et al. [14], the joint pdf of { R 0 , R 1 , , R n } is:
f { r 0 , r 1 , , r n } = f ( r n ) i = 0 n 1 f ( r i ) 1 F ( r i ) = 1 θ n + 1 exp [ ( r n μ ) / θ ] ,   r 0 μ .
The log-likelihood function can be obtained as:
L ( μ , θ ) = ( n + 1 ) ln θ ( r n μ ) / θ .
Since the log-likelihood function is an increasing function of µ, the maximum likelihood estimator (MLE) for µ is given by μ ^ M = r 0 . Substituting the MLE of µ into the log-likelihood function and taking the derivative of the log-likelihood function with respect to θ , then we have θ L ( r 0 , θ ) = ( n + 1 ) θ + r n r 0 θ 2 .
Equating the above equation to zero, then the MLE of θ can be obtained as:
θ ^ M = r n r 0 n + 1 .
After the transformations of Y i = R i μ θ , i = 0 , 1 , , n , we can obtain a sequence of upper record values Y 0 < Y 1 < < Y n from a standard exponential distribution. Due to the property of being memoryless for exponential distribution, it is apparent that the differences between two successive upper record values will be independently and identically distributed (iid) from standard exponential distribution. That is, Z 0 = Y 0 , Z 1 = Y 1 Y 0 , , Z n = Y n Y n 1 are independent identically distributed from standard exponential distribution.
In order to build the interval estimation for parameters, two sets of pivotal quantities are considered. The first set of pivotal quantities consists of h 1 ( μ , θ ) = 2 Z 0 = 2 ( R 0 μ θ ) and g 1 ( θ ) = 2 i = 1 n Z i = 2 ( Y n Y 0 ) = 2 R n R 0 θ . The distributions for these two pivotal quantities are independently chi-squared distributed with 2 and 2 n degrees of freedom, respectively. We denote the chi-squared distribution with degrees of freedom v as χ 2 ( ν ) and the F distribution with degrees of freedom v1 and v2 as F(v1,v2). Furthermore, the second set of pivotal quantities consists of h 2 ( μ ) = h 1 / 2 g 1 / ( 2 n ) = Z 0 / 2 i = 1 n Z i / ( 2 n ) = n ( R 0 μ R n R 0 ) and g 2 ( μ , θ ) = 2 i = 0 n Z i = 2 Y n = 2 R n μ θ . The distributions of these two pivotal quantities are h 2 ( μ ) ~ F ( 2 , 2 n ) and g 2 ( μ , θ ) ~ χ 2 ( 2 ( n + 1 ) ) . The distributions of all pivotal quantities are not a function of parameters. Utilizing the pivotal quantity g 1 ( θ ) , we can build the confidence interval for the scale parameter θ as follows:
Proposition 1.
Considering a sequence of upper record values denoted by R 0 , R 1 , , R n following the two-parameter exponential distribution with parameters θ and μ , the ( 1 α ) 100% confidence intervals of the scale parameter θ is:
( 2 ( R n R 0 ) χ α 2 2 ( 2 n ) , 2 ( R n R 0 ) χ 1 α 2 2 ( 2 n ) ) ,
where χ α 2 2 ( 2 n ) is the right-tailed α / 2 percentile for chi-squared distribution with 2 n degrees of freedom.
Proof of Proposition 1.
Since the distribution of the pivotal quantity g 1 ( θ ) is g 1 ( θ ) ~ χ 2 ( 2 n ) , we have:
1 α = P ( χ 1 α 2 2 ( 2 n ) < g 1 ( θ ) < χ α 2 2 ( 2 n ) ) = P ( 2 ( R n R 0 ) χ α 2 2 ( 2 n ) < θ < 2 ( R n R 0 ) χ 1 α 2 2 ( 2 n ) ) .
Utilizing the first set of pivotal quantities h 1 ( μ , θ ) and g 1 ( θ ) , we can construct the confidence region of two parameters in Theorem 1 and it is called Method 1. □
Theorem 1.
Considering a sequence of upper record values denoted by R 0 , R 1 , , R n following the two-parameter exponential distribution with parameters θ and μ , the ( 1 α )100% joint confidence region of two parameters θ and μ is given by:
μ ( R 0 χ 1 1 α 2 2 ( 2 ) θ 2 , R 0 χ 1 + 1 α 2 2 ( 2 ) θ 2 ) ,
where θ ( 2 ( R n R 0 ) χ 1 1 α 2 2 ( 2 n ) , 2 ( R n R 0 ) χ 1 + 1 α 2 2 ( 2 n ) ) and χ 1 1 α 2 2 ( ν ) and χ 1 + 1 α 2 2 ( ν ) represent the right-tailed 1 1 α 2 and 1 + 1 α 2 percentile for the chi-squared distribution with ν degrees of freedom.
Proof of Theorem 1.
Since the distributions of the first set of pivotal quantities are independent and h 1 ( μ , θ ) ~ χ 2 ( 2 ) and g 1 ( θ ) ~ χ 2 ( 2 n ) , we have
1 α = 1 α 1 α = P ( χ 1 + 1 α 2 2 ( 2 ) < h 1 ( μ , θ ) < χ 1 1 α 2 2 ( 2 ) ) × P ( χ 1 + 1 α 2 2 ( 2 n ) < g 1 ( θ ) < χ 1 1 α 2 2 ( 2 n ) ) = P ( χ 1 + 1 α 2 2 ( 2 ) < h 1 ( μ , θ ) < χ 1 1 α 2 2 ( 2 ) , χ 1 + 1 α 2 2 ( 2 n ) < g 1 ( θ ) < χ 1 1 α 2 2 ( 2 n ) ) = P ( R 0 χ 1 1 α 2 2 ( 2 ) θ / 2 < μ < R 0 χ 1 + 1 α 2 2 ( 2 ) θ / 2 ) , 2 ( R n R 0 ) χ 1 1 α 2 2 ( 2 n ) < θ < 2 ( R n R 0 ) χ 1 + 1 α 2 2 ( 2 n ) ) .
The area for the confidence region based on Method 1 denoted by Area1 is obtained as follows:
Area 1 = ( χ 1 1 α 2 2 ( 2 ) χ 1 + 1 α 2 2 ( 2 ) ) ( U 1 2 L 1 2 ) 4 = L 1 U 1 ( χ 1 1 α 2 2 ( 2 ) χ 1 + 1 α 2 2 ( 2 ) ) θ 2 d θ ,
where U 1 = 2 ( R n R 0 ) χ 1 + 1 α 2 2 ( 2 n ) and L 1 = 2 ( R n R 0 ) χ 1 1 α 2 2 ( 2 n ) .
Utilizing the second set of pivotal quantities h 2 ( μ ) and g 2 ( μ , θ ) , we can construct the confidence region of two parameters in Theorem 2 and it is called Method 2. □
Theorem 2.
Considering a sequence of upper record values R 0 , R 1 , , R n following the two-parameter exponential distribution with parameters θ and μ , the ( 1 α ) 100% joint confidence region of two parameters θ and μ is given by:
θ ( 2 ( R n μ ) χ 1 1 α 2 2 ( 2 ( n + 1 ) ) , 2 ( R n μ ) χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) ) ,
where μ ( R 0 F 1 1 α 2 ( 2 , 2 n ) R n R 0 n , R 0 F 1 + 1 α 2 ( 2 , 2 n ) R n R 0 n ) and F 1 1 α 2 ( 2 , 2 n ) and F 1 + 1 α 2 ( 2 , 2 n ) represent the right-tailed 1 1 α 2 and 1 + 1 α 2 percentile for F distribution with 2 and 2n-2 degrees of freedom; χ 1 1 α 2 2 ( 2 ( n + 1 ) ) and χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) represent the right-tailed 1 1 α 2 and 1 + 1 α 2 percentile for the chi-squared distribution with 2 ( n + 1 ) degrees of freedom.
Proof of Theorem 2.
Since the distributions of the second set of pivotal quantities are independent and h 2 ( μ ) ~ F ( 2 , 2 n ) and g 2 ( μ , θ ) ~ χ 2 ( 2 ( n + 1 ) ) , we have:
1 α = 1 α 1 α = P ( F 1 + 1 α 2 ( 2 , 2 n ) < h 2 ( μ ) < F 1 1 α 2 ( 2 , 2 n ) ) × P ( χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) < g 2 ( μ , θ ) < χ 1 1 α 2 2 ( 2 ( n + 1 ) ) ) = P ( F 1 + 1 α 2 ( 2 , 2 n ) < h 2 ( μ ) < F 1 1 α 2 ( 2 , 2 n ) ) , χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) < g 2 ( μ , θ ) < χ 1 1 α 2 2 ( 2 ( n + 1 ) ) ) = P ( R 0 F 1 1 α 2 ( 2 , 2 n ) R n R 0 n < μ < R 0 F 1 + 1 α 2 ( 2 , 2 n ) R n R 0 n , 2 ( R n μ ) χ 1 1 α 2 2 ( 2 ( n + 1 ) ) < θ < 2 ( R n μ ) χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) ) .
The area for the confidence region based on Method 2 denoted by Area2 is obtained as follows:
Area 2 = L 2 U 2 2 ( R n μ ) ( 1 χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) 1 χ 1 1 α 2 2 ( 2 ( n + 1 ) ) ) d μ = ( R n μ ) 2 ( 1 χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) 1 χ 1 1 α 2 2 ( 2 ( n + 1 ) ) ) | U 2 L 2 = ( ( R n L 2 ) 2 ( R n U 2 ) 2 ) ( 1 χ 1 + 1 α 2 2 ( 2 ( n + 1 ) ) 1 χ 1 1 α 2 2 ( 2 ( n + 1 ) ) ) ,
where U 2 = R 0 F 1 + 1 α 2 ( 2 , 2 n ) R n R 0 n and L 2 = R 0 F 1 1 α 2 ( 2 , 2 n ) R n R 0 n . □

3. Simulation Study

Using Proposition 1 and Theorems 1–2 (Methods 1–2), the simulated average confidence length and confidence area with the nominal confidence coefficient 1-α = 0.90, 0.95 for n = 1(1)10(10)30, 60 are obtained in Table 1 based on 5000 simulation runs. From Table 1, we found that the confidence length and confidence area are decreasing when sample size n is increasing for fixed 1-α = 0.90, 0.95 and increasing when confidence level 1-α is increasing for fixed n. As for the confidence region, Method 2 has better performance than Method 1 when n 2 for 1-α = 0.90 and when n 3 for 1-α = 0.95 in terms of shorter confidence area. Therefore, Method 2 is recommended for use for the confidence region of two parameters when n 2 for 1-α = 0.90 and when n 3 for 1-α = 0.95 and Method 1 is recommended for use only when n = 1 for 1-α = 0.90 when n = 1,2 for 1-α = 0.95. We also found that the simulation results indicate that all proposed methods can reach the nominal confidence coefficient.

4. A Biometrical Example

In Proschan [15], the data for times between successive failures of air conditioning equipment in a Boeing 70 airplane are used to illustrate the proposed methods in Proposition 1 and Theorems 1–2. The data are 57,48,74,29,12,70,21,29, 326,59,27,153,26, 386,502. It is assumed that the data fit an exponential distribution. The sequence of upper record values are { R 0 , R 1 , R 2 , R 3 , R 4 } = {57,74,326,386,502}.
By Proposition 1, the 95% confidence interval for θ is obtained as (50.75695, 408.30731) with confidence length 357.5504. By Theorem 1 (Method 1), the 95% joint confidence region for θ and μ is given by:
( 57 0.01274 θ < μ < 57 4.36929 θ , 45.77403 < θ < 503.66646 )
with area 548,019.8.
By Theorem 2 (Method 2), the 95% confidence region is given by ( 824.628 < μ < 55.58029 , 2 ( 504 μ ) 22.52085 < θ < 2 ( 504 μ ) 2.716047 ) with area 505,305.9. The confidence regions for two parameters obtained by Method 1 and Method 2 are plotted in Figure 1.
From Figure 1, the users can see that the area obtained by Method 2 is smaller than the one obtained by Method 1. Therefore, Method 2 is recommended for the construction of the confidence region of two parameters rather than Method 1 for this example.

5. Conclusions

This paper proposed the confidence interval for the scale parameter θ and two methods for the development of the confidence region of θ and μ for the two-parameter exponential distribution based on the upper record value data. From the simulation comparison study, we recommend Method 2 rather than Method 1 for the construction of the confidence region for two parameters under most cases. We also found that all proposed methods can reach the nominal confidence coefficient. At last, we employed one biometrical example to illustrate the application of the proposed interval estimation for two parameters in this paper.

Funding

This research and the APC are funded by [National Science and Technology Council, Taiwan] NSTC 111-2118-M-032-003-MY2.

Data Availability Statement

The data presented in this study are openly available in Proschan [15].

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Wu, S.F. A modified one stage multiple comparison procedure of exponential location parameters with the control under heteroscedasticity. Commun. Stat.-Theory Methods 2021. [Google Scholar] [CrossRef]
  2. Johnson, N.L.; Kotz, S. Continuous Univariate Distributions; John Wiley & Sons, Inc.: New York, NY, USA, 1970. [Google Scholar]
  3. Bain, L.J. Statistical Analysis of Reliability and Life Testing Models; Marcer Dekker: New York, NY, USA, 1978. [Google Scholar]
  4. Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons, Inc.: New York, NY, USA, 1982. [Google Scholar]
  5. Zelen, M. Application of exponential models to problems in cancer research. J. R. Stat. Soc. 1966, 129, 368–398. [Google Scholar] [CrossRef]
  6. Shafiq, A.; Lone, S.A.; Sindhu, T.N.; Khatib, Y.E.; Al-Mdallal, Q.M.; Muhammad, T. A new modified Kies Fréchet distribution: Applications of mortality rate of COVID-19. Results Physics 2021, 28, 104638. [Google Scholar] [CrossRef] [PubMed]
  7. El-Khatib, Y.; Hatemi-J, A. Option valuation and hedging in markets with a crunch. J. Econ. Stud. 2017, 44, 801–815. [Google Scholar] [CrossRef]
  8. Alzaatreh, A.; Famoye, F.; Lee, C. Weibull-Pareto Distribution and Its Applications. Commun. Stat.-Theory Methods 2013, 42, 1673–1691. [Google Scholar] [CrossRef]
  9. Al-Hussaini, E.K.; Ahmad, A.A. On Bayesian Interval Prediction of Future Records. Test 2003, 12, 79–99. [Google Scholar] [CrossRef]
  10. Asgharzadeh, A.; Abdi, M.; Kuş, C. Interval estimation for the two-parameter Pareto distribution based on record. Selçuk J. Appl. Math. 2011, 149–161. [Google Scholar]
  11. Jana, N.; Bera, S. Interval estimation of multicomponent stress–strength reliability based on inverse Weibull distribution. Math. Comput. Simul. 2022, 191, 95–119. [Google Scholar] [CrossRef]
  12. Wu, S.F. Bayesian interval estimation for the two-parameter exponential distribution based on the right type II censored sample. Symmetry 2022, 14, 352. [Google Scholar] [CrossRef]
  13. Wu, S.F. Interval estimation for the two-parameter exponential distribution under progressive type II censoring on the Bayesian approach. Symmetry 2022, 14, 808. [Google Scholar] [CrossRef]
  14. Arnold, B.C.; Balakrishnan, N.; Nagaraja, H.N. Records; John Wiley & Sons, Inc.: New York, NY, USA, 1998. [Google Scholar]
  15. Proschan, F. Theoretical explanation of observed decreasing failure rate. Technometrics 1963, 3, 375–383. [Google Scholar] [CrossRef]
Figure 1. As shown, 95% confidence region for the example under Method 1 and Method 2.
Figure 1. As shown, 95% confidence region for the example under Method 1 and Method 2.
Symmetry 14 01906 g001
Table 1. The average length and area for the interval estimation of the exponential distribution with ( μ , θ ) = (0,1) under 1-α = 0.90 and 0.95.
Table 1. The average length and area for the interval estimation of the exponential distribution with ( μ , θ ) = (0,1) under 1-α = 0.90 and 0.95.
1-α = 0.901-α = 0.95
nLengthMethod 1Method 2LengthMethod 1Method 2
118.94425170.4635672.39440.106928621.4838499.85
25.2178181.8382167.77427.8828471.5438474.2633
33.214255.478150.04544.4151113.3213106.0673
42.416029.636526.85693.199154.392150.1530
51.995219.861418.15982.600335.361332.5916
61.745415.358514.16632.202824.992823.1242
71.538712.097111.24441.949719.723918.3398
81.417110.44449.77251.753616.268115.2019
91.29458.94088.41281.598913.753312.9117
101.20207.85457.42621.498012.304711.6006
150.93395.31725.1091.14497.93187.5936
200.79424.24134.11220.96256.1535.9454
300.62993.12363.05790.76174.48114.3750
600.43652.02512.00310.52192.82272.7876
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wu, S.-F. Interval Estimation for the Two-Parameter Exponential Distribution Based on the Upper Record Values. Symmetry 2022, 14, 1906. https://doi.org/10.3390/sym14091906

AMA Style

Wu S-F. Interval Estimation for the Two-Parameter Exponential Distribution Based on the Upper Record Values. Symmetry. 2022; 14(9):1906. https://doi.org/10.3390/sym14091906

Chicago/Turabian Style

Wu, Shu-Fei. 2022. "Interval Estimation for the Two-Parameter Exponential Distribution Based on the Upper Record Values" Symmetry 14, no. 9: 1906. https://doi.org/10.3390/sym14091906

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop