Maximum Pseudo-Likelihood Estimation of Copula Models and Moments of Order Statistics

Dias, Alexandra

doi:10.3390/risks12010015

Open AccessArticle

Maximum Pseudo-Likelihood Estimation of Copula Models and Moments of Order Statistics

by

Alexandra Dias

School for Business and Society, University of York, York YO10 5DD, UK

Risks 2024, 12(1), 15; https://doi.org/10.3390/risks12010015

Submission received: 20 December 2023 / Revised: 15 January 2024 / Accepted: 16 January 2024 / Published: 18 January 2024

(This article belongs to the Special Issue Interplay between Financial and Actuarial Mathematics II)

Download

Browse Figures

Versions Notes

Abstract

:

It has been shown that, despite being consistent and in some cases efficient, maximum pseudo-likelihood (MPL) estimation for copula models overestimates the level of dependence, especially for small samples with a low level of dependence. This is especially relevant in finance and insurance applications when data are scarce. We show that the canonical MPL method uses the mean of order statistics, and we propose to use the median or the mode instead. We show that the MPL estimators proposed are consistent and asymptotically normal. In a simulation study, we compare the finite sample performance of the proposed estimators with that of the original MPL and the inversion method estimators based on Kendall’s tau and Spearman’s rho. In our results, the modified MPL estimators, especially the one based on the mode of the order statistics, have a better finite sample performance both in terms of bias and mean square error. An application to general insurance data shows that the level of dependence estimated between different products can vary substantially with the estimation method used.

Keywords:

copula model; finite sample properties; general insurance; rank statistics; relative efficiency; semiparametric estimation

1. Introduction

Copula models are widely used in insurance and finance for pricing, hedging, and risk management, as well as in health sciences, hydrology, and other applied sciences; see e.g., Chen and Guo (2019); Czado (2019); Joe (2014); Kularatne et al. (2021); McNeil et al. (2015). Such wide applicability has triggered important contributions both in probabilistic and statistical aspects of copula models; see Durante and Sempi (2015); Joe (2014) and references therein. The estimation of copula model parameters from observed data appears, at first, to be a straightforward inference exercise. However, it has, in fact, significant pitfalls. The estimation of copula models without fully understanding the properties of the estimators can have undesirable consequences such as, among others, the overestimation of the dependence in the data; see the discussion in Fermanian and Scaillet (2005). One of the difficulties is that the distribution of the univariate margins is, in principle, unknown. Estimation procedures have been proposed to circumvent this problem, but no estimation procedure seems to be clearly the best. In fact, Kojadinovic and Yan (2010) show that the performance of commonly used estimation methods depends on the size of the sample and the strength of the dependence in the data. In finance, large samples of data are often available but the same does not happen in other applications where data are, by their nature, limited. This is the case, for instance, if the observations naturally occur at a low frequency or the population of interest is, by itself, small. Here, we propose new semiparametric estimators and show that the level of dependence obtained can differ substantially in a general insurance case where the data are available only quarterly.

Sklar’s representation theorem (Sklar 1959) characterizes a so-called copula model for a random vector

X = (X_{1}, \dots, X_{d})

with multivariate distribution H by a copula function, C, and univariate marginal cumulative distribution functions (cdfs)

F_{i} (x_{i}) = P (X_{i} \leq x_{i})

for

i = 1, \dots, d

, as

H (x) = C [F_{1} (x_{1}), \dots, F_{d} (x_{d})], x \in R^{d} .

A copula

C : {[0, 1]}^{d} \to [0, 1]

is then a multivariate distribution with standard uniform univariate margins. If the univariate marginal cdfs are continuous, then the copula is unique. The versatility of copula models is apparent from Sklar’s representation theorem. By combining different distributions for the univariate margins with copula functions, a variety of models can be easily specified. Such flexibility can have a cost when it comes to the task of estimating the copula model from observed data.

Assuming that the univariate margins and copula all belong to absolutely continuous families of distributions, the obvious estimation method is maximum likelihood (ML). By default, the ML estimation of a model’s parameters is performed in one step. But, mainly due to numerical problems, which typically arise during the optimization of a likelihood function with several parameters and possibly multi-dimensional integrals, a two-step maximum likelihood estimation method has been introduced, the so-called inference functions for margins (IFM) from Joe and Xu (1996) and Joe (1997). The IFM method consists of estimating first the parameters for each univariate margin distribution independently, and then estimating the dependence parameters from the multivariate log-likelihood where the univariate margins parameter estimates are held fixed. Although the two-step IFM method can suffer from some loss of efficiency in cases of strong dependence, it still enjoys strong asymptotic efficiency as shown by Joe (2005). A further advantage of a two-step estimation method is that the estimation of the univariate margins parameters is not affected by a possible misspecification of the multivariate copula model. A fundamental challenge with the ML estimation, either the one or two-step procedure, is to ensure the correct choice of distributions for the univariate margins. This is especially relevant if we are particularly interested in modelling the dependence structure of the random vector. Through a simulation study, Fermanian and Scaillet (2005) find that misspecification of the margins may translate into a severe positive bias and high mean square errors in the estimation of the copula parameters leading to an overestimation of the degree of dependence in the data. An extensive simulation study from Kim et al. (2007) shows that the one-step ML and the IFM methods are indeed nonrobust against misspecification of the marginal distributions. Kim et al. (2007) also shows that, when the margins are unknown, in order to avoid the consequences of misspecification, it is better to use the maximum pseudo-likelihood (MPL) estimation procedure studied in Genest et al. (1995) and Shih and Louis (1995). Semiparametric estimation in copula models is indeed used widely even in nonstationary cases, for instance climate data, as is the case in Nasri et al. (2019).

For a random sample

{(X_{1, i}, \dots, X_{d, i}) : i = 1, \dots, n}

from distribution

H (x) = C_{θ} [F_{1} (x_{1}), \dots, F_{d} (x_{d})]

, the MPL is a semiparametric estimation procedure consisting of selecting the parameter

\hat{θ}

that maximizes the log pseudo-likelihood function

\sum_{i = 1}^{n} log c_{θ} [{\hat{F}}_{1, n} (X_{1, i}), \dots, {\hat{F}}_{d, n} (X_{d, i})],

where

c_{θ}

is the probability density function (pdf) of the copula family

{C_{θ}}

, and the univariate marginal distributions

{\hat{F}}_{j, n}

estimator is a rescaled empirical distribution function of the jth variable. Further asymptotic properties of the MPL estimator have been studied in Klaassen and Wellner (1997) and Genest and Werker (2002). The finite sample properties of the MPL estimator have been studied in Kojadinovic and Yan (2010) in a study where they compare the MPL estimator with the two method-of-moments (MM) estimators based on the inversion of Spearman’s rho and Kendall’s tau coefficients. The MM estimators have been studied by Genest (1987); Genest and Rivest (1993); Oakes (1982). Kojadinovic and Yan (2010) found that the MPL estimator performs better than the MM estimators in terms of mean squared error, except for small and weakly dependent vectors. Using the MM procedure as an alternative to MPL for small weakly dependent vectors is not the best solution, as we will demonstrate in this study. Instead, we propose to modify the canonical MPL estimator by using consistent nonparametric estimators of the univariate marginal distributions different from that used since Genest et al. (1995).

After deriving theoretically the consistency and asymptotic normality of the proposed MPL estimators, we study their small sample properties via a simulation study. We compare three alternative MPL estimators with the canonical MPL, and with the MM estimators based on the inversion of Kendall’s tau and Spearman’s rho. We find that changing the nonparametric estimator of the univariate margins indeed improves the finite sample performance of the MPL estimator, in terms of bias and mean squared error, while preserving its asymptotic properties. To confirm the large sample performance of the estimators we evaluate their relative efficiency via simulation.

Instead of proposing to use alternative nonparametric estimators of the univariate margins, another possibility would be to obtain a bias correction function for the canonical MPL estimator. Such a bias correction function would depend not only on the copula parameter and sample size, but also, importantly, on the specific copula itself. The approach that we propose to use here has the advantage of not depending on the specific copula. A bias reduction correction can also have the effect of increasing the variance of the estimator and possibly the mean square error, (see e.g., Søbye et al. 2021). That does not happen with the estimators we propose here.

Another possible approach is to estimate the multivariate model nonparametrically using empirical copulas, the asymptotic properties of which can be found in Genest and Segers (2010) and Segers et al. (2017). See also, e.g., Yang et al. (2020) on the nonparametric estimation of copula regression models. Naturally, empirical copula model estimation requires larger samples. Especially in applications where the size of the sample available is limited, there might be enough data to estimate the univariate marginal distribution functions nonparametrically but not enough data to estimate the empirical copula. That is one of the reasons why the semiparametric method from Genest et al. (1995) has become commonly used in applications.

Although we chose to compare the MPL estimators proposed here with the MM estimators, as in Kojadinovic and Yan (2010), other semiparametric estimators have been introduced in the literature. Tsukahara (2005) studied two semiparametric estimation procedures and concluded that these, overall, have a higher mean squared error when compared with the canonical MPL estimator. Chen et al. (2006) introduced and studied the properties of an MPL estimator where the unknown marginal density functions are approximated by linear combinations of finite-dimensional known basis functions with increasing complexity called sieves. They find that for weak dependence the sieve method performs comparably to the canonical MPL in finite samples. Given these results, comparing the proposed estimators with the canonical MPL and the MM estimators seems an appropriate choice.

In Section 2 of this article, we introduce the canonical MPL estimation procedure and its statistical properties. It is our starting point, as we benchmark the MPL estimators that we propose against the canonical MPL estimator. Section 3 motivates and proposes the new MPL estimators. Section 4 addresses their asymptotic properties. We show that the finite sample properties of the MPL estimators depend on the copula model in Section 5. Section 6 summarizes the MM estimators used in the simulation study. In Section 7 we report and discuss the results of the simulation study where we compare the small sample performance of the six estimators. We apply our results to a case of general insurance data in Section 8. Section 9 concludes the paper. Proofs and tables with simulation results are given in the Appendix A and Appendix B.

2. The Canonical MPL Estimator

Given a multivariate copula model with univariate marginal absolutely continuous distribution functions, the so-called canonical maximum pseudo-likelihood method consists of estimating univariate marginal distributions

{\hat{F}}_{1}, \dots, {\hat{F}}_{d}

from the marginal empirical distributions, as a first step assuming that the univariate variables are independent, and then selecting the copula parameter that maximizes the log pseudo-likelihood function,

\sum_{i = 1}^{n} log c_{θ} [{\hat{F}}_{1} (X_{1, i}), \dots, {\hat{F}}_{d} (X_{d, i})] = \sum_{i = 1}^{n} log c_{θ} ({\hat{U}}_{1, i}, \dots, {\hat{U}}_{d, i}) .

(1)

In the canonical MPL estimation, the so-called pseudo-observations

{\hat{U}}_{i} = ({\hat{U}}_{1, i}, \dots, {\hat{U}}_{d, i})

are obtained from

X_{i} = (X_{1, i}, \dots, X_{d, i})

as

{\hat{U}}_{i} = (\frac{n}{n + 1} F_{1, n} (X_{1, i}), \dots, \frac{n}{n + 1} F_{d, n} (X_{d, i})) i = 1, \dots, n,

(2)

where

F_{j, n}

is the empirical cumulative distribution function

F_{j, n} (x) = 1 / n \sum_{k = 1}^{n} 1 (X_{j, k} \leq x)

for

j = 1 \dots, d

, and

1 (A)

denotes the indicator function of event A. The rescaling of the empirical distribution function by the factor

n / (n + 1)

in expression (2) is made to avoid computational problems on the boundary of

{[0, 1]}^{d}

. The use of the empirical distribution function to transform the margins to uniform can be traced back to Genest et al. (1995). The large sample properties of the canonical MPL estimator were studied by Genest et al. (1995) and Shih and Louis (1995), who showed that this estimator is consistent and asymptotically normal, and efficient at independence. Later, Genest and Werker (2002) argue that the latter is rather the exception than the rule and identify two cases of semiparametric efficiency. These are the independence and the normal copula, for which the result could already be found in Klaassen and Wellner (1997).

3. Alternative MPL Estimators

The semiparametric canonical MPL estimation procedure hinges on a nonparametric estimator of each marginal univariate distribution

F_{j}

for

j = 1, \dots, d

. As introduced in the previous section, this nonparametric estimator is the rescaled empirical distribution function

n / (n + 1) F_{j, n}

. Here, we motivate and propose the use of alternative nonparametric estimators for the univariate margins in the MPL estimation procedure.

In the implementation of the canonical MPL method, for each univariate margin

j = 1, \dots, d

, the pseudo-observations

{\hat{U}}_{j, 1}, \dots, {\hat{U}}_{j, n}

, defined in (2), are calculated as

{\hat{U}}_{j, i} = \sum_{k = 1}^{n} 1 (X_{j, k} \leq X_{j, i}) / (n + 1) = R_{j, i} / (n + 1),

(3)

where

R_{j, i}

is the rank of

X_{j, i}

among

X_{j, 1}, \dots, X_{j, n}

.

For clarification, MPL estimation presents no scalability problems. The estimators involve ranking the multivariate observations one margin at the time. Hence, it can be easily implemented for high dimensional data sets.

3.1. Pseudo-Observations and Moments of Order Statistics

To motivate the new estimators, first we show the relation between the pseudo-observations

{\hat{U}}_{j, i} = R_{j, i} / (n + 1)

and order statistics. Assume that

X_{1}, X_{2}, \dots, X_{n}

are n-independent and identically distributed (iid) univariate random variables. Arrange these in ascending order of magnitude as

X_{(1)} \leq X_{(2)} \leq \dots \leq X_{(n)}

, and call

X_{(r)}

the rth order statistic, for

r = 1, 2, \dots, n

.

Proposition 1.

Consider a random sample

(X_{1}, X_{2}, \dots, X_{n})

from a univariate distribution with continuous cdf F and the corresponding transformed vector

(U_{1}, U_{2}, \dots, U_{n})

where

U_{i} = F (X_{i})

for

i = 1, \dots, n

. If we define the function

a (r) = E [U_{(r)}]

for

1 \leq r \leq n

, then each pseudo-observation

{\hat{U}}_{i} = \frac{R_{i}}{n + 1}

, for

i = 1, \dots, n

, can be obtained as

{\hat{U}}_{i} = a (R_{i})

, where

R_{i}

is the rank of

X_{i}

among

X_{1}, X_{2}, \dots, X_{n}

.

The proof of Proposition 1 can be found in the Appendix A. The conclusion is that the pseudo-observations in (3), proposed by Genest et al. (1995), can be obtained from the expected value of the order statistics defined as a function of the rank of the corresponding sample observations, i.e.,

({\hat{U}}_{1, i}, \dots, {\hat{U}}_{d, i}) = (a (R_{1, i}), \dots, a (R_{d, i})) = (\frac{R_{1, i}}{n + 1}, \dots, \frac{R_{d, i}}{n + 1}) for i = 1, \dots, n,

(4)

where

a (r) = E [U_{(r)}] = E [F {(X)}_{(r)}] = \frac{r}{n + 1}

, for

1 \leq r \leq n

, and

R_{j, i}

is the rank of

X_{j, i}

within

X_{j, 1}, \dots, X_{j, n}

.

Note that, as we are assuming that the random variable X is continuous and its cdf F is an increasing function, we have that the rank of

X_{i}

among

X_{1}, \dots, X_{n}

is the same as the rank of

F (X_{i})

among

F (X_{1}), \dots, F (X_{n})

. We remark here that Clayton and Cuzick (1985) also used expected order statistics from unit exponential distributions in the estimation of the dependence parameter of a bivariate hazards model.

At this point, it is important to recall that our goal is to improve the performance of the canonical MPL which uses the pseudo-observations computed as in (4). With this objective in mind, we explore the properties of the pseudo-observations in (4) inherited from the fact that these are obtained from expected values of order statistics, and how this affects the performance of the canonical MPL estimator.

If the random variable X has cdf F, then the distribution of the order statistics

F {(X)}_{(r)}

is skewed (except for

r = n / 2

if n is even), especially when r is closer to 1 or n. Given that the expected value can be highly influenced by the skewness of the distribution, it is then possible that the properties of the pseudo-observations in (4) are affected by the skewness of

F {(X)}_{(r)}

and consequently also the canonical MPL estimator. Figure 1 displays the pdf of the order statistics

F {(X)}_{(49)}

and

F {(X)}_{(196)}

in

(0.75, 1)

, for samples of size

n = 50

and

n = 200

respectively. The strong skewness of the pdf implies that the mean is further away from the peak of the distribution than the median and, obviously, the mode. The pseudo-observations calculated using the mean of the order statistics might suffer from the skewness of the pdf. Hence, we propose to use the median or the mode of the order statistics, instead of the mean, to compute the pseudo-observations, and we study their effect on the performance of the new MPL estimators obtained from (1). From Figure 1 we can see that the skewness of the order statistics pdf is higher for smaller samples. We will see that, in fact, the smaller the sample, the larger the improvement of the performance of the estimator when using the median or the mode rather than the mean of the order statistics.

3.2. Pseudo-Observations and the Median of Order Statistics

We first propose to use the median of the rth order statistic as an alternative to using the mean of the order statistic. If the continuous random variable X has cdf F then

F (X)

is drawn from a standard uniform distribution and the median of the order statistic

F {(X)}_{(r)}

is

med (F {(X)}_{(r)}) = I_{1 / 2}^{[- 1]} (r, n - r + 1), for 1 \leq r \leq n,

where

I_{p} (a, b) = \int_{0}^{p} t^{a - 1} {(1 - t)}^{b - 1} d t / B (a, b)

is the regularised incomplete beta function. The computations can be made faster using the approximation (see Hyndman and Fan 1996; Kerman 2011) given by

med (F {(X)}_{(r)}) \approx \frac{r - \frac{1}{3}}{n + \frac{1}{3}}, for 1 \leq r \leq n .

Defining the function

g (r) = \frac{r - 1 / 3}{n + 1 / 3}

, the corresponding pseudo-observations for the estimation of the copula parameter via the pseudo-likelihood method are then

({\bar{U}}_{1, i}, \dots, {\bar{U}}_{d, i}) = (g (R_{1, i}), \dots, g (R_{d, i})) = (\frac{R_{1, i} - \frac{1}{3}}{n + \frac{1}{3}}, \dots, \frac{R_{d, i} - \frac{1}{3}}{n + \frac{1}{3}}), for 1 \leq i \leq n .

(5)

We will refer to the copula parameter estimation procedure consisting of using the pseudo-observations given by (5) in the log pseudo-likelihood function in (1) as the median MPL.

3.3. Pseudo-Observations and the Mode of Order Statistics

The second alternative we explore to compute the pseudo-observations is using the mode of the rth order statistic from a standard uniform distribution, which is given by

mode (F {(X)}_{(r)}) = \frac{r - 1}{n - 1} for 1 \leq r \leq n .

In this case, defining the function

h (r) = \frac{r - 1}{n - 1}

, the pseudo-observations are

(U_{1, i}^{*}, \dots, U_{d, i}^{*}) = (h (R_{1, i}), \dots, h (R_{d, i})) = (\frac{R_{1, i} - 1}{n - 1}, \dots, \frac{R_{d, i} - 1}{n - 1}), for 1 \leq i \leq n .

(6)

We will refer to the copula parameter estimation procedure consisting of using the pseudo-observations given by (6) in the log pseudo-likelihood function as the mode MPL.

For the minimum and the maximum in each margin, i.e., for

X_{j (1)}

and

X_{j (n)}

(

j = 1, \dots, d

), it is not possible to use the mode of the corresponding order statistic as pseudo-observations in the pseudo log-likelihood function because these would be zero and one, respectively. In these cases, we use instead the mean of the order statistics

1 / (n + 1)

and

n / (n + 1)

as in the canonical MPL because this is our benchmark estimator.

At this point we would like to remark the following. Instead of calculating the pseudo-observations as the mean, median or mode of the order statistics

F {(X)}_{(r)}

, we could consider using

F (E (X_{(r)}))

,

F (median (X_{(r)}))

or

F (mode (X_{(r)}))

. If F is strictly monotonic then

F (median (X_{(r)})) = median (F {(X)}_{(r)})

, which is one of the proposed estimators above. The pseudo-observations, calculated as

F (E (X_{(r)}))

or

F (mode (X_{(r)}))

, depend on the distribution F. As we want to assume that F is unknown, we do not consider these alternatives.

3.4. Midpoint and Pseudo-Observations

In the canonical MPL estimator, the motivation to rescale the empirical distribution by multiplying it with

n / (n + 1)

is justified (starting with Genest et al. 1995) due to the need to keep the pseudo-observations away from the boundary of the interval

(0, 1)

. To that end, the adjustment to the empirical distribution function

F_{j, n}

can rather be carried out using

F_{j, n} (x) - \frac{1}{2 n} = \frac{1}{n} [\sum_{i = 1}^{n} 1 (X_{j, i} \leq x) - \frac{1}{2}] .

Here, the additive factor

- 1 / (2 n)

ensures that the pseudo-observations are strictly in the interval

(0, 1)

. This approach, introduced by Hazen (1914), is popular with hydrologists and it is also used by Joe (2014) in the process of converting sample observations to normal scores. We include it in our study as an alternative to calculating the pseudo-observations, which are then given by

({\tilde{U}}_{1, i}, \dots, {\tilde{U}}_{d, i}) = ({\hat{F}}_{1} (X_{1, i}), \dots, {\hat{F}}_{d} (X_{d, i})) = (\frac{R_{1, i} - 1 / 2}{n}, \dots, \frac{R_{d, i} - 1 / 2}{n}), for 1 \leq i \leq n .

(7)

We will refer to the copula parameter estimation procedure consisting of using the pseudo-observations given by (7) in the log pseudo-likelihood function in (1) as the midpoint MPL.

4. Large Sample Properties of the MPL Estimators

Before moving on to the small-sample performance simulation study, we consider the consistency and asymptotic normality of the different estimators. As already pointed out by other authors, (e.g., Genest et al. 1995; Kojadinovic and Yan 2010), using

{\hat{U}}_{j, i} = R_{j, i} / (n + 1)

as pseudo-observations in the log pseudo-likelihood function in (1) corresponds to multiplying

n / (n + 1)

by the empirical distribution of the univariate jth variable. Each of the estimators of the univariate marginal distribution functions

F_{j}

used above can be written as a function of the empirical distribution estimator

F_{j, n}

for the corresponding variable. Given that the empirical distribution is a consistent estimator, as an immediate consequence of the strong law of large numbers, the consistency of the univariate cdf estimators used follows.

Genest et al. (1995) show the consistency and asymptotic normality of the canonical MPL estimator building on the work of Ruymgaart et al. (1972). In this section, we generalize their result for the median, mode, and midpoint MPL estimators proposed here. For simplicity of exposition, hereafter we consider bivariate distributions

H_{θ} (x_{1}, x_{2})

with copula

C_{θ}

, real parameter

θ

, and continuous univariate cdfs

F_{1}

and

F_{2}

, such that

H_{θ} (x_{1}, x_{2}) = C_{θ} [F_{1} (x_{1}), F_{2} (x_{2})]

,

(x_{1}, x_{2}) \in R^{2}

. The results obtained can be generalised to the multivariate case.

The regularity conditions for the consistency and asymptotic normality of the MPL estimators are similar to those underlying the maximum likelihood estimator. Given a random sample

{(X_{1 i}, X_{2 i}) : i = 1, \dots, n}

from distribution

H_{θ}

, the MPL estimate

{\hat{θ}}_{n}

takes the value that maximizes the log pseudo-likelihood function (1)

L (θ) = \sum_{i = 1}^{n} log c_{θ} [{\hat{F}}_{1} (X_{1, i}), {\hat{F}}_{2} (X_{2, i})] .

Let

l (θ, u_{1}, u_{2}) = log [c_{θ} (u_{1}, u_{2})]

. The semiparametric estimate

{\hat{θ}}_{n}

solves the equation

\frac{\partial}{\partial θ} L (θ) = \sum_{i = 1}^{n} l_{θ} [θ, {\hat{F}}_{1} (X_{1, i}), {\hat{F}}_{2} (X_{2, i})] = 0,

(8)

with

l_{θ}

denoting the partial derivative of l with respect to

θ

. To derive an expression for the semiparametric MPL estimator

{\hat{θ}}_{n}

we follow Genest et al. (1995) and start by expanding (8) in a Taylor series. As a result, we obtain

\frac{1}{n} \frac{\partial}{\partial θ} {L (θ) |}_{θ = {\hat{θ}}_{n}} = 0 \approx A_{n} - ({\hat{θ}}_{n} - θ) B_{n},

where

A_{n} = \frac{1}{n} \sum_{i = 1}^{n} l_{θ} [θ, {\hat{F}}_{1} (X_{1, i}), {\hat{F}}_{2} (X_{2, i})], B_{n} = - \frac{1}{n} \sum_{i = 1}^{n} l_{θ, θ} [θ, {\hat{F}}_{1} (X_{1, i}), {\hat{F}}_{2} (X_{2, k})]

and

l_{θ, θ}

denotes the second derivative of l with respect to

θ

. Hence, a standardised version of

{\hat{θ}}_{n}

is

n^{1 / 2} ({\hat{θ}}_{n} - θ) \approx n^{1 / 2} A_{n} / B_{n},

(9)

whose large sample properties relate to those of multivariate rank statistics of the form

R_{n} = \frac{1}{n} \sum_{k = 1}^{n} J [{\hat{F}}_{1} (X_{1, k}), {\hat{F}}_{2} (X_{2, k})],

under the following assumptions.

Assumption 1.

J (u_{1}, u_{2})

is a continuous function from

{(0, 1)}^{2}

into

R

such that

μ = E [J \{F_{1} (X_{1}), F_{2} (X_{2})\}] = \int J (u_{1}, u_{2}) d C (u_{1}, u_{2})

exists.

Assumption 2.

Define the function

r (u) = u (1 - u)

, on

(0, 1)

, let p and q be positive numbers satisfying

1 / p + 1 / q = 1

and

δ > 0

.

(i): $l_{θ, θ} (u_{1}, u_{2}) \leq M r {(u_{1})}^{a} r {(u_{2})}^{b}$ with M a positive constant, $a = (- 1 + δ) / p$ and $b = (- 1 + δ) / q$ ;
(ii): $l_{θ} (u_{1}, u_{2}) \leq M r {(u_{1})}^{a} r {(u_{2})}^{b}$ with M a positive constant, $a = (- 0.5 + δ) / p$ and $b = (- 0.5 + δ) / q$ , and $l_{θ}$ admits continuous partial derivatives $l_{θ, i} (u_{1}, u_{2}) = \partial l_{θ} (u_{1}, u_{2}) / \partial u_{i}$ on ${(0, 1)}^{2}$ such that $l_{θ, i} (u_{1}, u_{2}) \leq M r {(u_{1})}^{d_{i}} r {(u_{2})}^{d_{3 - i}}$ with $d_{1} = a - 1$ and $d_{2} = b$ .

The limiting behaviour of the MPL estimators can then be summarised as follows.

Proposition 2.

Under assumptions 1 and 2, each of the median, mode and midpoint MPL estimators

{\hat{θ}}_{n}

is consistent and

n^{1 / 2} ({\hat{θ}}_{n} - θ)

is asymptotically normal with variance

\frac{1}{β^{2}} v a r \{l_{θ} [θ, F_{1} (X_{1}), F_{2} (X_{2})] + W_{1} (X_{1}) + W_{2} (X_{2})\},

where

β = - E [l_{θ, θ} {θ, F_{1} (X_{1}), F_{2} (X_{2})}]

and

W_{i} (X_{i}) = \int 1 [F_{i} (X_{i}) \leq u_{i}] l_{θ, i} (θ, u_{1}, u_{2}) c_{θ} (u_{1}, u_{2}) d u_{1} d u_{2},

with

1 (A)

denoting the indicator of A and

l_{θ, i} (θ, u_{1}, u_{2}) = \partial l_{θ} (θ, u_{1}, u_{2}) / \partial u_{i}

.

The proof of Proposition 2 and necessary results are obtained in the Appendix A. The simulation study in Section 7 illustrates this result. For the cases of multidimensional dependence parameter or in a multivariate context, the previous results on the consistency and asymptotic normality of the modified MPL estimators can be extended as in Genest et al. (1995) following similar arguments.

5. Finite Sample Properties of the MPL Estimators

Consider the random sample

{(X_{1 i}, X_{2 i}) : i = 1, \dots, n}

of iid pairs from distribution

H_{θ} (x_{1}, x_{2})

. Let

R = (R_{1}, \dots, R_{n})

be the vector of ranks corresponding to

X_{1} = (X_{1, 1}, \dots, X_{1, n})

, and

Q = (Q_{1}, \dots, Q_{n})

the vector of ranks corresponding to

X_{2} = (X_{2, 1}, \dots, X_{2, n})

. The mean square error of

{\hat{θ}}_{n}

can be derived (at least approximated) from the moments of

A_{n}^{'} = n A_{n}

and

B_{n}^{'} = n B_{n}

and relation (9). Hence, we are interested in the properties of statistics

A_{n}^{'}

and

B_{n}^{'}

which are both of the form

J_{n} = \sum_{i = 1}^{n} J (R_{i}, Q_{i})

.

Let

D = (D_{1}, \dots, D_{n})

denote the inverse of

R

in

R

, the space of all permutations of

e = (1, 2, \dots, n)

. Define

R^{0} = Q \circ D

, where

R^{0} = (R_{1}^{0}, \dots, R_{n}^{0})

. We can then write the statistic

J_{n}

in its dual form

J_{n} = \sum_{i = 1}^{n} J (i, R_{i}^{0}) .

If

X_{1}

and

X_{2}

are independent then

R^{0}

has a uniform distribution in

R

and the derivation of its moments is straightforward (see e.g., Hájek 1969). Proposition 3 in the Appendix A shows how to obtain the moments of

A_{n}^{'}

and

B_{n}^{'}

given the distribution of

R^{0}

. But if

X_{1}

and

X_{2}

are not independent then the distribution of

R^{0}

, to the best of the author’s knowledge, is unknown. To give an idea of how different the distribution of the

R_{i}^{0}

is from a uniform in the non independent case, we run a simulation from a Clayton copula with dependence parameter corresponding to a Kendall’s tau correlation of

τ = 0.4

(see Joe 2014). We simulated

N =

50,000 samples, each sample having

n = 50

pairs of observations from the Clayton copula. The histograms of the simulated observations of

R_{i}^{0}

for

i = 1, 10, 20, 30, 40, 50

are displayed in Figure 2. The histograms in Figure 2 show how the distribution of

R_{i}^{0}

can be far from a uniform in the case of dependent samples. Given that the finite sample properties of the MPL estimators depend on the copula family via the unknown distribution of

R_{i}^{0}

we proceed our investigation of the finite sample properties of the MPL estimators with a simulation study.

6. Method-of-Moments Estimators

In our simulation study, we also compare the performance of the four semiparametric MPL estimators with the method-of-moments (MM) estimators obtained from the relation between the copula parameter and the coefficients Kendall’s tau,

τ

, and Spearman’s rho,

ρ

; see Oakes (1982), Genest (1987), Genest and Rivest (1993). Copula parameter estimates obtained from these rank coefficients via the MM can be referred to as inversion-method estimates. The reason to include the two inversion-method estimators is first, because these perform better than the canonical MPL estimator for small weakly dependent samples, and second, to facilitate the comparison of our results with other related studies.

The MM estimation procedure is mostly used in the bivariate one-parameter copula model case, although it may be used in the multivariate and/or multiparameter cases, for instance, by imposing conditions on the dependence structure. In our simulation study we restrict ourselves to the one-parameter bivariate copulas case as explained in Section 7. Hence, consider the random sample

X_{1}, \dots, X_{n}

from an absolutely continuous bivariate copula model

C_{θ} (F_{1}, F_{2})

, where

θ

belongs to an open subset of

R

, and

F_{1}

and

F_{2}

are continuous cdfs. Inversion-method estimators rely on a consistent estimator of a copula moment. A consistent estimator of the copula moment Kendall’s tau is given by

τ_{n} = \frac{4}{n (n - 1)} \sum_{i \neq j} 1 (X_{1, i} \leq X_{1, j}) 1 (X_{2, i} \leq X_{2, j}) - 1 .

Given the ranks

R_{1}, \dots, R_{n}

corresponding to

X_{1}, \dots, X_{n}

, where

R_{j, i}

is the rank of

X_{j, i}

among

X_{j, 1}, \dots, X_{j, n}

for

j = 1, 2

, a consistent estimator of the bivariate copula moment Spearman’s rho is

ρ_{n} = \frac{12}{n (n + 1) (n - 1)} \sum_{i = 1}^{n} R_{1, i} R_{2, i} - 3 \frac{n + 1}{n - 1} .

The copula parameter estimate,

\hat{θ}

, is then obtained by inversion from the relation between

θ

and

τ

or

ρ

as

{\hat{θ}}_{τ} = τ^{- 1} (τ_{n})

or as

{\hat{θ}}_{ρ} = ρ^{- 1} (ρ_{n})

, when the functions

τ

and

ρ

are bijections. In those cases where there is no analytic expression for the relation between the copula parameter and

τ

or

ρ

then a numerical approximation must be used. The consistency, asymptotic normality, and variance of

{\hat{θ}}_{τ}

and

{\hat{θ}}_{ρ}

are well documented in the literature and we refrain from repeating it here, directing the reader to Kojadinovic and Yan (2010) and relevant references therein.

7. The Finite Sample Performance of the Estimators

In this section, we compare the performance of the semiparametric pseudo-likelihood estimator when calculating the pseudo-observations as in (4)–(7), and the MM Kendall’s tau and Spearman’s rho estimators. Recall that we refer to the MPL estimators for the copula model parameters corresponding to (4)–(7) as canonical MPL, median MPL, mode MPL, and midpoint MPL, respectively. To compare the performance of the six estimators, we perform a simulation study. The calculations are performed using R (R core Team 2020) and the package copula (Hofert et al. 2020).

Given their wide applicability to finance and insurance, we consider the copula families Clayton, Gumbel–Hougaard, Plackett, Normal, and Student-t. The Clayton family was first written in the form of a copula by Kimeldorf and Sampson (1975). Due to its joint lower tail dependence property, this family as been used to model the association between inter-event times, from epidemiology to insurance. The Gumbel (1960) copula can be used to model joint upper tail dependence, for instance, between large losses on financial assets or insurance claims. The bivariate Plackett (1965) family is radially symmetric and has been used as an alternative to the bivariate normal copula; see Nelsen (2006). The Normal and Student-t copulas are often used in classic finance and insurance multivariate models. Details on each of these copula families can be found, e.g., in Joe (2014). Without loss of generality, we consider the case of positive dependence in the simulation study.

We use six different levels of dependence corresponding to Kendall’s tau of 0.1, 0.2, 0.3, 0.4, 0.6, and 0.8, and four sample sizes of 50, 100, 200, and 400. These choices are also informed by the study of Kojadinovic and Yan (2010) to make it possible to benchmark some of our results against theirs. For each level of dependence and sample size, we simulate 5000 samples from all the copula families. Each sample is then used to estimate the copula parameter and standard error.

For clarification, we do not study the effect of the univariate marginal distributions because these play no role on the copula MPL estimation procedure. The pseudo-observations used in (1) to obtain the MPL estimators are adjusted ranks of each marginal observations and do not depend on the particular distribution of each margin. The set of ranks corresponding to an iid random sample

(X_{1}, \dots, X_{n})

from distribution F is a permutation T from the set of all possible permutations of

(1, \dots, n)

. If the observations are independent, then the probability of obtaining permutation T is

1 / n!

, independently of the distribution F, (see e.g., Hájek 1969).

7.1. Results

For each copula and degree of dependence considered, we present in Table A1, Table A2, Table A3 and Table A4 (in Appendix B) the results for sample sizes 50, 100, 200, and 400, respectively. In the tables, the different copula models are labelled as: C for the Clayton, G for the Gumbel–Hougaard, P for the Plackett, N for the Normal, and t for Student-t. For the six estimators, we report the percentage relative bias (PRB

_{\hat{θ}} = (\hat{θ} - θ) / θ \times 100

), the empirical standard deviation of the estimates (

s_{\hat{θ}}

), the mean of the estimated standard errors (

{\bar{s e}}_{\hat{θ}}

), and the empirical percentage coverage (PC

_{\hat{θ}}

) of the approximate 95% confidence interval for the dependence parameter calculated as

\hat{θ} \pm 1.96 s e_{\hat{θ}}

. In the tables, we identify the results using a different subscript for each estimator. The notation for the canonical MPL is

{\hat{θ}}^{c}

, for the median MPL is

{\hat{θ}}^{m}

, for the mode MPL is

{\hat{θ}}^{M}

, for the midpoint MPL is

{\hat{θ}}^{*}

, for the MM Kendall’s tau inversion is

τ

, and for the MM Spearman’s rho inversion is

ρ

.

The results for the percentage relative bias can be visualised in Figure 3, where we plot the PRB for

n = 50

. As already observed by Kojadinovic and Yan (2010), the MM estimators have a smaller relative bias than the canonical MPL for small weakly dependent samples, except for the Student-t, where the Spearman’s inversion method performs quite poorly. However, the relative advantage of the MM estimators over the canonical MPL reduces when the sample size increases (see Table A2, Table A3 and Table A4 in Appendix B). For dependence levels

τ \geq 0.4

the MM estimators can actually have a much larger PRB as it is the case for the Plackett copula. The newly considered median, mode and midpoint MPL estimators have smaller bias than the canonical MPL for weakly dependent samples (

τ \leq 0.4

) across all sample sizes. The mode and the midpoint MPL estimators have lower bias than the MM estimators for weakly dependent samples especially for smaller samples. The differences between the estimators in terms of bias reduce as the sample size increases. The median MPL performs remarkably well, in terms of bias, for the Normal and Student-t copulas across all levels of dependence.

The values for the empirical standard deviation of the estimates are very close to the mean of the estimated standard errors. This supports the assumptions underlying the estimator for the asymptotic variance. The empirical percentage coverage (PC) does not seem very different across the six estimators either. We can see that the PC tends to be larger than the 95% level for weaker dependence (

τ = 0.1

) and smaller than the 95% level for stronger dependence. From the results for the standard errors and percentage coverage obtained from the simulations, we find no evidence to contradict the asymptotic normality of the estimators. Overall, the results are consistent across the different copula families and sample sizes considered here.

Table A5 contains the estimated root mean square error (RMSE) for the canonical MPL estimator obtained for each sample size, copula, and level of dependence considered. The RMSE increases with the level of dependence, except for the Normal and Student-t, and decreases as the sample becomes larger. Hence, the higher RMSE for the canonical MPL estimator is observed for small strongly dependent samples and the lower RMSE is obtained from weakly dependent large samples. The increase in the RMSE with the level of dependence is supported by the fact that the estimated standard errors also increase with the strength of dependence, as shown in the PRB tables. For the Normal and Student-t copulas the estimated standard errors and RMSE of the canonical MPL decrease with the strength of the dependence and sample size.

In Table A5, we also report the percentage relative efficiency (PRE) calculated as 100 times the estimated RMSE of the canonical MPL divided by the estimated RMSE of each of the other five estimators. We plot the PRE values in Figure 4 of the five estimators in relation to the canonical MPL for sample size

n = 50

. We observe that the MM Kendall’s tau- and Spearman’s rho-based estimators outperform the canonical MPL estimator for small weakly dependent samples but this advantage vanishes when the level of dependence becomes stronger or the sample size increases. These results are perfectly in line with the results from Kojadinovic and Yan (2010). The three semiparametric MPL estimators proposed here outperform, in terms of MSE, both MM estimators for all levels of dependence and sample size. Consequently, the three estimators introduced also outperform the canonical MPL for low dependence small samples. For stronger levels of dependence,

τ \geq 0.6

, and samples larger than 100 the canonical MPL has the smallest MSE for the sample sizes considered. It is worth noting that, in the simulations, the proposed estimators substantially outperform the canonical MPL for weak dependence while for stronger dependence, the outperformance of the canonical MPL is modest. It is interesting that the MM estimators can have a quite poor performance in terms of MSE for stronger dependence in relation to the MPL estimators. Between the three MPL estimators introduced here, the mode MPL is overall the best for weakly dependent samples. This is particularly clear in Figure 4.

Finally, we estimate the asymptotic relative efficiency of the median MPL, mode MPL and midpoint MPL in relation to the canonical MPL estimator. The asymptotic percentage relative efficiency for each estimator is calculated as the estimated variance of the canonical MPL estimate, divided by the estimated variance of the MPL estimate given by the method being compared with, multiplied by 100. The estimates are obtained from a pseudo-randomly generated sample of size

n =

100,000. The results, presented in Table A6, confirm that the three proposed MPL estimators and the canonical MPL estimator are asymptotically equally efficient.

8. Application to General Insurance Loss Ratios

In our application, we show the impact of using different MPL estimators while modelling the dependence between general insurance business classes, which is relevant for pricing, reserving and regulatory capital. We apply our results to loss ratios net of reinsurance from three insurance classes: houseowners/householders, domestic motor vehicles, and commercial motor vehicles. The data have been downloaded from the Australian Prudential Regulation Authority (APRA) (https://www.apra.gov.au/, accessed on 20 December 2023) general insurance statistics website. The historical loss ratios are available only from September 2010 until March 2023, comprising a sample of

n = 51

quarterly observations per insurance class.

Common factors underlying the risks covered under these three insurance classes, like weather conditions for instance, suggest the presence of dependence between the loss ratios. The Pearson’s linear correlation between houseowners/householders (house) and domestic motor vehicle (dom-motor) loss ratios is

0.318

, between house and commercial motor vehicle (com-motor) is

0.172

and between dom-motor and com-motor is

0.696

. To select a copula model, we use the goodness-of-fit test from Genest et al. (2009) implemented in the R package copula. Although net of reinsurance, there might still be signs of upper joint tail dependence in the loss ratios. Indeed, a 180° rotated Clayton copula fitted to the loss ratios of house and dom-motor gives a p-value of

51 %

, compared with

23 %

from fitting a Gumbel copula,

10 %

from a Student-t copula and

9 %

from a normal copula. In panel A of Table 1, we list the estimates obtained from fitting a rotated Clayton model to house and dom-motor using the different MPL and MM estimators. For benchmarking, we also list the Kendall’s tau,

τ = θ / (2 + θ)

, and upper tail dependence,

λ_{U} = 2^{- 1 / θ}

, implied by the copula parameter estimates from the different methods. The mode MPL estimation produces the lowest copula parameter estimate and the lowest standard error, while the corresponding canonical MPL estimates are the largest among the MPL estimators. The MM estimators produce the largest copula estimates and standard errors. This agrees with the results we obtained for the finite sample performance of the estimators in Section 7.1. For a Clayton copula with

τ = 0.2

, we observed that the mode MPL has the lowest PRB and standard error indicating that the mode MPL should give the least upward biased estimate of dependence. Comparing the results from the different estimators, note that the Kendall’s tau implied by the copula estimates ranges from

18 %

to

34 %

while the upper tail dependence ranges from

20 %

to

50 %

. Depending on the volume of earned premiums of these insurance classes on a particular insurance company, such variability will potentially have a significant financial impact on the calculation of reserves and regulatory capital of the firm.

For the case of house and com-motor loss ratios, with an even lower linear correlation of

0.172

, the goodness-of-fit test from Genest et al. (2009) ranks first the 180° rotated Clayton copula model with a p-value of

75 %

, followed by a Gumbel copula with

9.4 %

, a normal copula with

3.6 %

and a Student-t copula with a

2.5 %

p-value. The results are consistent with the previous observations; see panel B in Table 1. The mode MPL produces the lowest overall estimate for the copula parameter and standard error, and the canonical MPL gives the highest estimates among the MPL estimators. The MM estimates are the highest across all the estimation methods. The implied Kendall’s tau varies now between

8.7 %

and

16.2 %

, while the upper tail dependence parameter ranges from

2.6 %

to

16.7 %

.

Finally, we consider the pair with the highest linear correlation among the named insurance classes reported in the APRA data: domestic motor vehicle and commercial motor vehicle. These two insurance classes have a sample linear correlation of

69.6 %

between the corresponding loss ratios. In this case, the Gumbel copula model ranks first with a goodness-of-fit test p-value of

98 %

. It is not surprising that a textbook three dimensional model, as a multivariate Gumbel or Clayton copulas for instance, does not have enough flexibility to accommodate real data as it is the case here. For the Gumbel copula model with parameter

θ

, Kendall’s tau is given by

τ = 1 - 1 / θ

and the upper tail dependence parameter is

λ_{U} = 2 - 2^{1 / θ}

; see Joe (2014). The results from fitting a Gumbel copula model to com-motor and dom-motor, reported in panel C of Table 1, are coherent with those obtained for the previous two pairs of loss ratios. The mode MPL parameter estimate and standard error are the lowest across all the estimation methods, the canonical MPL has the largest estimates among the MPL estimators and the MM estimates are the largest overall. Nevertheless, the differences between the estimates are much smaller than in the previous two lower dependence cases, as we can see from the implied Kendall’s tau and upper tail coefficient estimates. The Kendall’s tau ranges between

36.2 %

and

47.8 %

and the upper tail coefficient varies from

44.4 %

and

56.4 %

.

From the three cases considered in this application, we observe that the variation of the estimates from the different MPL estimators increases as the dependence level decreases. The mode MPL has consistently the lowest parameter estimate and standard error. At a lower dependence level, the implied Kendall’s tau obtained by the MM estimators is almost double that obtained from the mode MPL. This study based on empirical data confirms what we would expect to observe according to the results we obtained in Section 7.1 for the finite sample performance of the estimators.

9. Conclusions

Kim et al. (2007) and Fermanian and Scaillet (2005) found that misspecification of the margins leads to non robust estimation of the dependence structure in a copula model with overestimation of the degree of dependence. Later, Kojadinovic and Yan (2010) found that overestimation of the degree of dependence can happen even when the unknown margins are estimated non-parametrically, especially for small weakly dependent samples.

We show here that the pseudo-observations used in the canonical MPL estimation method (Genest et al. 1995) can be seen as expected values of the order statistics and propose new estimators based on the corresponding median and mode instead. We derive the theoretical asymptotic properties of the new MPL estimators. Our simulation study shows that using the mode of the order statistics instead of the mean when calculating the pseudo-observations can reduce the overestimation of the level of dependence, outperforming the canonical MPL and the inversion methods’ Kendall’s tau and Spearman’s rho in terms of mean squared error for weakly dependent samples. For larger, strongly dependent samples, the canonical MPL still outperforms the proposed modified MPL estimators. Hence, within the conditions considered, our study shows that it is preferable to use the MPL estimator where the pseudo-observations are calculated as the mode of the order statistics rather than the mean.

In applications, data might naturally only be available in small samples. This is the case in our empirical study of quarterly general insurance loss ratios. Our application illustrates that the mode MPL estimator gives lower levels of dependence with smaller standard errors. The Kendall’s tau coefficient implied from the different estimators varies by up to more than

80 %

, showing the importance of understanding well the performance of the different estimators.

Funding

This research received no external funding.

Data Availability Statement

The data used in this study is publicly available and can be found here: (https://www.apra.gov.au/, accessed on 20 December 2023).

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proofs

Appendix A.1. Proof of Proposition 1

Proof.

Let

F_{(r)} (x)

, for

r = 1, 2, \dots, n

, denote the cdf of the rth order statistic

X_{(r)}

. It is well known, (see e.g., David and Nagaraja 2003), that

F_{(r)} (x) = \sum_{i = r}^{n} (\binom{n}{i}) F {(x)}^{i} {[1 - F (x)]}^{n - i} .

If we assume that

X_{i}

is continuous, denoting the pdf of

X_{(r)}

by

f_{(r)} (x)

we have that

f_{(r)} (x) = \frac{1}{B (r, n - r + 1)} F {(x)}^{r - 1} {[1 - F (x)]}^{n - r} f (x),

where

f (x) = F^{'} (x)

is the pdf of

X_{i}

and

B (a, b) = \int_{0}^{1} t^{a - 1} {(1 - t)}^{b - 1} d t

, for

a > 0

and

b > 0

, is the beta function.

Given that the cdf of

X_{i}

, for

i = 1, \dots, n

, is F, the random sample

(F (X_{1}), F (X_{2}), \dots, F (X_{n}))

= (U_{1}, \dots, U_{n})

is drawn from a standard uniform distribution

U (0, 1)

. Hence, the pdf of the rth order statistic

F {(X)}_{(r)} = U_{(r)}

has the expression

f_{(r)} (u) = \frac{1}{B (r, n - r + 1)} u^{r - 1} {(1 - u)}^{n - r} u \in (0, 1),

and belongs to the family of beta distributions. The mean of the rth order statistic for a random sample from a standard uniform

U (0, 1)

distribution is then

E [F {(X)}_{(r)}] = E [U_{(r)}] = \frac{r}{r + (n - r + 1)} = \frac{r}{n + 1} .

Defining the function

a (r) = E [U_{(r)}]

, for

1 \leq r \leq n

, gives the result. □

Appendix A.2. Proof of Proposition 2

Consider a continuous bivariate distribution

F (x_{1}, x_{2})

with copula

C (u_{1}, u_{2})

and marginals

F_{1} (x_{1})

and

F_{2} (x_{2})

such that

F (x_{1}, x_{2}) = C [F_{1} (x_{1}), F_{2} (x_{2})]

.

Definition A1.

For a random sample

\{(X_{1, k}, X_{2, k}) : k = 1, \dots, n\}

, define the following rescaled versions of the empirical distribution for

j = 1, 2

:

(a): ${\hat{F}}_{j, n} (x) = {(n + 1)}^{- 1} \sum_{k = 1}^{n} 1 (X_{j, k} \leq x)$ for $x \in R$ ,
(b): ${\bar{F}}_{j, n} (x) = {(n + 1 / 3)}^{- 1} [\sum_{k = 1}^{n} 1 (X_{j, k} \leq x) - 1 / 3]$ for $x \in R$ ,
(c): $F_{j, n}^{*} (x) = {(n - 1)}^{- 1} [\sum_{k = 1}^{n} 1 (X_{j, k} \leq x) - 1]$ for $x \in (a, b)$ and $F_{j, n}^{*} (x) = {\hat{F}}_{j, n} (x)$ for $x \in R ∖ (a, b)$ , with $a = min {X_{k, 1}, \dots, X_{k, n}}$ and $b = max {X_{k, 1}, \dots, X_{k, n}}$ ,
(d): ${\tilde{F}}_{j, n} (x) = n^{- 1} [\sum_{k = 1}^{n} 1 (X_{j, k} \leq x) - 1 / 2]$ for $x \in R$ .

Let

{\hat{F}}_{j} (x_{j})

for

j = 1, 2

denote any of the rescaled empirical distributions in Definition (A1). In order to proof Proposition 2 we first introduce two results concerning the asymptotic behaviour of statistics of the form

R_{n} = n^{- 1} \sum_{k = 1}^{n} J [{\hat{F}}_{1} (X_{1, k}), {\hat{F}}_{2} (X_{2, k})],

where

J (u_{1}, u_{2})

is a continuous function from

{(0, 1)}^{2}

into

R

such that

μ = E [J \{F_{1} (X_{1}), F_{2} (X_{2})\}] = \int J (u_{1}, u_{2}) d C (u_{1}, u_{2})

exists. Define the function

r (u) = u (1 - u)

, on

(0, 1)

, let p and q be positive numbers satisfying

1 / p + 1 / q = 1

and

δ > 0

.

Proposition A1.

If

J (u_{1}, u_{2}) \leq M r {(u_{1})}^{a} r {(u_{2})}^{b}

with M a positive constant,

a = (- 1 + δ) / p

and

b = (- 1 + δ) / q

, then

R_{n} \to μ

almost surely.

First, note that all the rescaled empirical distribution functions (b), (c), and (d) in Definition (A1) are functions of the rescaled empirical distribution (a). The proof of Proposition (A1) can then be obtained following an argument similar to the one used by Genest et al. (1995) to prove their Proposition A1.

Proposition A2.

If

J (u_{1}, u_{2}) \leq M r {(u_{1})}^{a} r {(u_{2})}^{b}

with M a positive constant,

a = (- 0.5 + δ) / p

and

b = (- 0.5 + δ) / q

, and if J admits continuous partial derivatives

J_{i} (u_{1}, u_{2}) = \partial J (u_{1}, u_{2}) / \partial u_{i}

on

{(0, 1)}^{2}

such that

J_{i} (u_{1}, u_{2}) \leq M r {(u_{1})}^{d_{i}} r {(u_{2})}^{d_{3 - i}}

with

d_{1} = a - 1

and

d_{2} = b

, then

n^{1 / 2} (R_{n} - μ) \to_{d} N (0, σ^{2})

, with

σ^{2} = var \{J [F_{1} (X_{1}), F_{2} (X_{2})] + \sum_{i = 1}^{2} \int 1 (X_{i} \leq x_{i}) J_{i} [F_{1} (X_{1}), F_{2} (X_{2})] d F (X_{1}, X_{2})\} .

The proof of Proposition (A2) can be obtained by using the rescaled empirical distributions in Definition (A1) (b), (c), and (d) in the proof of Theorem 2.1 of Ruymgaart et al. (1972).

Now to proof Proposition 2. we apply Proposition (A1) taking J equal to

l_{θ, θ}

to obtain that

B_{n} \to β

almost sure. Then, taking J equal to

l_{θ}

in Proposition (A2), we obtain that

n^{1 / 2} A_{n} \to_{d} N (0, σ^{2})

with

σ^{2} = var \{l_{θ} [θ, F_{1} (X_{1}), F_{2} (X_{2})] + \sum_{i = 1}^{2} W_{i} (X_{i})\},

giving the asymptotic results of Proposition 2.

Appendix A.3. Proposition 3

Proposition 3.

Consider the set of iid pairs of continuous random variables

{(X_{1 k}, X_{2 k}) : k = 1, \dots, n}

from distribution

H_{θ}

and the function f(r) from

{1, \dots, n}

to

(0, 1)

. Then,

\begin{matrix} E (A_{n}^{'}) & = \sum_{i = 1}^{n} \sum_{k = 1}^{n} l_{θ} (θ, f (i), f (k)) P (R_{i}^{0} = k), \\ var (A_{n}^{'}) & = \sum_{i = 1}^{n} \sum_{k = 1}^{n} {[l_{θ} (θ, f (i), f (k)) - {\bar{l}}_{θ} (i)]}^{2} P (R_{i}^{0} = k) \\ + & \underset{i \neq j}{\sum \sum} \underset{k \neq h}{\sum \sum} [l_{θ} (θ, f (i), f (k)) - {\bar{l}}_{θ} (i)] [l_{θ} (θ, f (j), f (h)) - {\bar{l}}_{θ} (j)] P (R_{i}^{0} = k, R_{j}^{0} = h), \\ cov (A_{n}^{'}, B_{n}^{'}) & = - \sum_{i = 1}^{n} \sum_{j = 1}^{n} \underset{k \neq h}{\sum \sum} [l_{θ} (θ, f (i), f (k)) - {\bar{l}}_{θ} (i)] [l_{θ, θ} (θ, f (j), f (h)) - {\bar{l}}_{θ, θ} (j)] \\ P (R_{i}^{0} = k, R_{j}^{0} = h), \end{matrix}

with

{\bar{l}}_{θ} (i) = \sum_{k = 1}^{n} l_{θ} (θ, f (i), f (k)) P (R_{i}^{0} = k)

and

{\bar{l}}_{θ, θ} (j) = \sum_{k = 1}^{n} l_{θ, θ} (θ, f (j), f (k)) P (R_{j}^{0} = k)

.

The expected value and variance of

B_{n}^{'}

are obtained in a similar way by replacing

l_{θ}

and

{\bar{l}}_{θ}

with

l_{θ, θ}

and

{\bar{l}}_{θ, θ}

, respectively. The poof can be easily obtained from the definitions of expected value, variance, and covariance of a discrete random variable.

Appendix B. Simulation Results

Table A1. Percentage relative bias (PRB), empirical standard deviation of the estimates (s), mean of the estimated standard errors (

\bar{s e}