An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality

Pinelis, Iosif

doi:10.3390/risks2030349

Open AccessArticle

An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality

by

Iosif Pinelis

Department of Mathematical Sciences, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA

Risks 2014, 2(3), 349-392; https://doi.org/10.3390/risks2030349

Submission received: 15 June 2014 / Revised: 21 August 2014 / Accepted: 10 September 2014 / Published: 23 September 2014

(This article belongs to the Special Issue Selected Papers from the Sixth International Conference on Mathematical and Statistical Methods for Actuarial Sciences and Finance)

Download

Browse Figures

Versions Notes

Abstract

:

A spectrum of upper bounds

{(Q_{α} (X; p))}_{α \in [0, \infty]}

on the (largest)

(1 - p)

-quantile

Q (X; p)

of an arbitrary random variable X is introduced and shown to be stable and monotonic in α, p, and X, with

Q_{0} (X; p) = Q (X; p)

. If p is small enough and the distribution of X is regular enough, then

Q_{α} (X; p)

is rather close to

Q (X; p)

. Moreover, these quantile bounds are coherent measures of risk. Furthermore,

Q_{α} (X; p)

is the optimal value in a certain minimization problem, the minimizers in which are described in detail. This allows of a comparatively easy incorporation of these bounds into more specialized optimization problems. In finance,

Q_{0} (X; p)

and

Q_{1} (X; p)

are known as the value at risk (VaR) and the conditional value at risk (CVaR). The bounds

Q_{α} (X; p)

can also be used as measures of economic inequality. The spectrum parameter α plays the role of an index of sensitivity to risk. The problems of the effective computation of the bounds are considered. Various other related results are obtained.

Keywords:

quantile bounds; coherent measures of risk; sensitivity to risk; measures of economic inequality; value at risk (VaR); conditional value at risk (CVaR); stochastic dominance; stochastic orders

Graphical Abstract

1. Introduction

The most common measure of risk is apparently the value at risk,

{VaR}_{p} (X)

, defined as the largest

(1 - p)

-quantile of (the distribution of) a random variable (r.v.) X, which represents an uncertain future loss on an investment portfolio. Whereas very simple conceptually, the risk measure

{VaR}_{p}

is not subadditive and, hence, is not coherent, in the sense established by Artzner et al. [1] and widely accepted afterwards. Other flaws of the value at risk are also well known; quoting Rockafellar and Uryasev [2]:

A very serious shortcoming of VaR, in addition, is that it provides no handle on the extent of the losses that might be suffered beyond the threshold amount indicated by this measure. It is incapable of distinguishing between situations where losses that are worse may be deemed only a little bit worse, and those where they could well be overwhelming. Indeed, it merely provides a lowest bound for losses in the tail of the loss distribution and has a bias toward optimism instead of the conservatism that ought to prevail in risk management.

In other words, the

VaR

is not sensitive to the amount of risk beyond the threshold. Moreover, as is also discussed in [2],

{VaR}_{p} (X)

is unstable in p and unstable in (the distribution of) X: arbitrarily small changes of the confidence level

1 - p

or of the composition of the portfolio may effect arbitrarily large changes of the value of

{VaR}_{p} (X)

. Closely related to these two kinds of instability is the inherent instability in the computation of

{VaR}_{p} (X)

.

To address these deficiencies of the

VaR

, Rockafellar and Uryasev [2,3] proposed an alternative risk measure,

CVaR

, which stands for the conditional value at risk. In the case when (the distribution of) the r.v. X is continuous,

{CVaR}_{p} (X)

can be defined as

E (X | X ⩾ {VaR}_{p} (X))

, the conditional expectation of the loss given that the loss X exceeds the threshold

{VaR}_{p} (X)

. This alternative risk measure,

{CVaR}_{p} (X)

, is coherent and stable in p and in X; it also has a certain, fixed sensitivity to the losses beyond the threshold.

However,

{CVaR}_{p} (X)

provides no handle on the degree of sensitivity to risk. In particular, as will be demonstrated in Section 5.2, one can easily construct two portfolios with the same value of

{CVaR}_{p}

, such that one of the portfolios is clearly riskier than the other. Such indifference may generally be considered “an unwanted characteristic”; see e.g. comments on pages 36 and 48 in [4].

The main objective of the present paper is to remedy this indifference and provide the mentioned missing handle on the degree of sensitivity to risk, while retaining the coherence and stability properties. Indeed, we shall present a spectrum of risk measures

{(Q_{α} (X; p))}_{α \in [0, \infty]}

, where the spectrum parameter α may be considered the degree of sensitivity to risk: the greater the value of α, the greater the sensitivity to risk; see Section 5.2 for details. In particular,

α = \infty

corresponds to an “exponentially” high degree of risk sensitivity. Moreover, the proposed spectrum of risk measures possesses the following properties:

(I): The common risk measures $VaR$ and $CVaR$ are in the spectrum: $Q_{0} (X; p) = {VaR}_{p} (X)$ and $Q_{1} (X; p) = {CVaR}_{p} (X)$ ; thus, $Q_{α} (X; p)$ interpolates between ${VaR}_{p} (X)$ and ${CVaR}_{p} (X)$ for $α \in (0, 1)$ and extrapolates from ${VaR}_{p} (X)$ and ${CVaR}_{p} (X)$ on towards higher degrees of risk sensitivity for $α \in (1, \infty]$ . Details on this can be found in Section 5.1.
(II): The risk measure $Q_{α} (\cdot; p)$ is coherent for each $α \in [1, \infty]$ and each $p \in (0, 1)$ , but it is not coherent for any $α \in [0, 1)$ and any $p \in (0, 1)$ . Thus, $α = 1$ is the smallest value of the sensitivity index for which the risk measure $Q_{α} (X; p)$ is coherent. One may also say that for $α \in [1, \infty]$ the risk measure $Q_{α} (\cdot; p)$ inherits the coherence of ${CVaR}_{p} = Q_{1} (\cdot; p)$ , and for $α \in [0, 1)$ it inherits the lack of coherence of ${VaR}_{p} = Q_{0} (\cdot; p)$ . For details, see Section 5.3.
(III): $Q_{α} (X; p)$ is three-way stable and monotonic: in $α \in (0, \infty]$ , in $p \in (0, 1)$ , and in X. Moreover, as stated in Theorem 3.4 and Proposition 3.5, $Q_{α} (X; p)$ is nondecreasing in X with respect to the stochastic dominance of any order $γ \in [1, α + 1]$ ; but, this monotonicity property breaks down for the stochastic dominance of any order $γ \in (α + 1, \infty]$ . Thus, the sensitivity index α is in a one-to-one correspondence with the highest order of the stochastic dominance respected by $Q_{α} (X; p)$ .

Rockafellar and Uryasev [2] also wrote: “Most importantly for applications, however, CVaR can be expressed by a remarkable minimization formula.” It will be shown (in Theorem 3.3) that our risk measures

Q_{α} (X; p)

possess quite a similar variational representation for each

α \in (0, \infty]

, which in fact generalizes the minimization formula for

CVaR

. This representation allows of a comparatively easy incorporation of the risk measures

Q_{α} (X; p)

into more specialized optimization problems, with additional restrictions on the r.v. X; see Section 4.3 for details.

The spectrum of risk measures

{(Q_{α} (X; p))}_{α \in [0, \infty]}

is naturally based on a previously developed spectrum

{(P_{α} (X; x))}_{α \in [0, \infty]}

of upper bounds on the tail probability

P (X ⩾ x)

for

x \in R

, with

P_{0} (X; x) = P (X ⩾ x)

and

P_{\infty} (X; x)

being the best possible exponential upper bound on

P (X ⩾ x)

; see, e.g., [5,6] and bibliography therein; a shorter version of [6] appeared as [7]. The spectrum

{(P_{α} (X; x))}_{α \in [0, \infty]}

is shown in the present paper to be stable and monotonic in α, x, and X. The bounds

P_{α} (X; x)

are optimal values in certain minimization problems. It is shown that the mentioned minimization problems for which

P_{α} (X; x)

and

Q_{α} (X; p)

are the optimal values are in a certain sense dual to each other; in the special case

α = \infty

, this corresponds to the bilinear Legendre–Fenchel duality.

A few related results are obtained as well. In particular, a generalization of the Cillo–Delquie necessary and sufficient condition for the so-called mean-risk (M-R) to be nondecreasing with respect to the stochastic dominance of order 1 is presented, with a short proof. Moreover, a necessary and sufficient condition for the M-R measure to be coherent is given.

It is also shown that the quantile bounds

Q_{α} (X; p)

can be used as measures of economic inequality, and then the spectrum parameter α may be considered an index of sensitivity to inequality: the greater is the value of α, the greater is the sensitivity of the function

Q_{α} (\cdot; p)

to inequality.

In addition, it is demonstrated that

P_{α} (X; x)

and

Q_{α} (X; p)

can be effectively computed.

The paper is structured as follows.

*: In Section 2, the three-way stability and monotonicity, as well as other useful properties, of the spectrum ${(P_{α} (X; x))}_{α \in [0, \infty]}$ of upper bounds on tail probabilities are established.
*: In Section 3, the corresponding properties of the spectrum ${(Q_{α} (X; p))}_{α \in [0, \infty]}$ of risk measures are presented, as well as other useful properties.
*: The matters of effective computation of $P_{α} (X; x)$ and $Q_{α} (X; p)$ , as well as optimization of $Q_{α} (X; p)$ with respect to X, are considered in Section 4.
*: An extensive discussion of results is presented in Section 5, particularly in relation with existing literature.
*: Concluding remarks are collected in Section 6.
*: The necessary proofs are given in Appendix A.

Further details can be found in the arXiv version of this paper [8].

2. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Tail Probabilities

Consider the family

{(h_{α})}_{α \in [0, \infty]}

of functions

h_{α} : R \to R

given by the formula

h_{α} (u) : = \{\begin{matrix} I {u ⩾ 0} & if α = 0, \\ {(1 + u / α)}_{+}^{α} & if 0 < α < \infty, \\ e^{u} & if α = \infty \end{matrix}

(2.1)

for all

u \in R

. Here, as usual,

I {\cdot}

denotes the indicator function,

u_{+} : = 0 \lor u

and

u_{+}^{α} : = {(u_{+})}^{α}

for all real u.

Obviously, the function

h_{α}

is nonnegative and nondecreasing for each

α \in [0, \infty]

, and it is also continuous for each

α \in (0, \infty]

. Moreover, it is easy to see that, for each

u \in R

,

h_{α} (u) is nondecreasing and continuous in α \in [0, \infty] .

(2.2)

Next, let us use the functions

h_{α}

as generalized moment functions and thus introduce the generalized moments

A_{α} (X; x) (λ) : = E h_{α} (λ (X - x)) .

(2.3)

Here and in what follows, unless otherwise specified, X is any random variable (r.v.),

x \in R

,

α \in [0, \infty]

, and

λ \in (0, \infty)

. Since

h_{α} ⩾ 0

, the expectation in formula (2.3) is always defined, but may take the value ∞. It may be noted that in the particular case

α = 0

, one has

A_{0} (X; x) (λ) = P (X ⩾ x),

(2.4)

which does not actually depend on

λ \in (0, \infty)

.

Now one can introduce the expressions

P_{α} (X; x) : = inf_{λ \in (0, \infty)} A_{α} (X; x) (λ) = \{\begin{matrix} P (X ⩾ x) & if α = 0, \\ inf_{λ \in (0, \infty)} {E (1 + λ (X - x) / α)}_{+}^{α} & if 0 < α < \infty, \\ inf_{λ \in (0, \infty)} E e^{λ (X - x)} & if α = \infty . \end{matrix}

(2.5)

By the property stated in (2.2),

A_{α} (X; x) (λ)

and

P_{α} (X; x)

are nondecreasing in

α \in [0, \infty]

. In particular,

P_{0} (X; x) = P (X ⩾ x) ⩽ P_{α} (X; x) .

(2.6)

It will be shown later (see Proposition 2.3) that

P_{α} (X; x)

also largely inherits the property of

h_{α} (u)

of being continuous in

α \in [0, \infty]

.

The definition (2.5) can be rewritten as

P_{α} (X; x) = inf_{t \in T_{α}} {\tilde{A}}_{α} (X; x) (t)

(2.7)

where

\begin{matrix} T_{α} & : = \{\begin{matrix} R & if α \in [0, \infty), \\ (0, \infty) & if α = \infty \end{matrix} \end{matrix}

(2.8)

and

\begin{matrix} {\tilde{A}}_{α} (X; x) (t) & : = \{\begin{matrix} \frac{E {(X - t)}_{+}^{α}}{{(x - t)}_{+}^{α}} & if α \in [0, \infty), \\ E e^{(X - x) / t} & if α = \infty . \end{matrix} \end{matrix}

(2.9)

Here and subsequently, we also use the conventions

0^{0} : = 0

and

\frac{a}{0} : = \infty

for all

a \in [0, \infty]

. The alternative representation (2.7) of

P_{α} (X; x)

follows because (i)

A_{α} (X; x) (λ) = {\tilde{A}}_{α} (X; x) (x - \frac{α}{λ})

for

α \in (0, \infty)

; (ii)

A_{\infty} (X; x) (λ) = {\tilde{A}}_{\infty} (X; x) (\frac{1}{λ})

; and (iii)

P_{0} (X; x) = P (X ⩾ x) = {inf}_{t \in (- \infty, x)} P (X > t) = {inf}_{t \in (- \infty, x)} {\tilde{A}}_{0} (X; x) (t)

.

In view of Formula (2.7), one can see (cf. Corollary 2.3 in [5]) that, for each

α \in [0, \infty]

,

P_{α} (X; x)

is the optimal (that is, least possible) upper bound on the tail probability

P (X ⩾ x)

given the generalized moments

E g_{α; t} (X)

for all

t \in T_{α}

, where:

g_{α; t} (u) : = \{\begin{matrix} {(u - t)}_{+}^{α} & if α \in [0, \infty), \\ e^{u / t} & if α = \infty . \end{matrix}

(2.10)

In fact (cf. e.g. Proposition 3.3 in [6]), the bound

P_{α} (X; x)

remains optimal given the larger class of generalized moments

E g (X)

for all functions

g \in H^{α}

, where

H^{α} : = \{g \in R^{R} : g (u) = \int_{R} g_{α; t} (u) μ (d t) for some μ \in M_{α} and all u \in R\},

(2.11)

M_{α}

denotes the set of all nonnegative Borel measures on

T_{α}

, and, as usual,

R^{R}

stands for the set of all real-valued functions on

R

. By Proposition 1(ii) in [9] and Proposition 3.4 in [6],

0 ⩽ α < β ⩽ \infty implies H^{α} \supseteq H^{β} .

(2.12)

This provides the other way to come to the mentioned conclusion that

P_{α} (X; x) is nondecreasing in α \in [0, \infty] .

(2.13)

By Proposition 1.1 in [10], the class

H^{α}

of generalized moment functions can be characterized as follows in the case when α is a natural number: for any

g \in R^{R}

, one has

g \in H^{α}

if and only if g has finite derivatives

g^{(0)} : = g, g^{(1)} : = g^{'}, \dots, g^{(α - 1)}

on

R

, such that

g^{(α - 1)}

is convex on

R

and

{lim}_{x \to - \infty} g^{(j)} (x) = 0

for

j = 0, 1, \dots, α - 1

. Moreover, by Proposition 3.4 in [6],

g \in H^{\infty}

if and only if g is infinitely differentiable on

R

, and

g^{(j)} ⩾ 0

on

R

and

{lim}_{x \to - \infty} g^{(j)} (x) = 0

for all

j = 0, 1, \dots

.

Thus, the greater the value of α, the narrower and easier to deal with is the class

H^{α}

and the smoother are the functions comprising

H^{α}

. However, the greater the value of α, the farther away is the bound

P_{α} (X; x)

from the true tail probability

P (X ⩾ x)

.

Of the bounds

P_{α} (X; x)

, the loosest and easiest one to get is

P_{\infty} (X; x)

, the so-called exponential upper bound on the tail probability

P (X ⩾ x)

. It is used very widely, in particular when X is the sum of independent r.v.’s

X_{i}

, in which case one can rely on the factorization

A_{α} (X; x) (λ) = e^{- λ x} \prod_{i} E e^{λ X_{i}}

. A bound very similar to

P_{3} (X; x)

was introduced in [11] in the case when X the sum of independent bounded r.v.’s; see also [12,13,14]. For any

α \in (0, \infty)

, the bound

P_{α} (X; x)

is a special case of a more general bound given in Corollary 2.3 in [5]; see also Theorem 2.5 in [5]. For some of the further developments in this direction, see [7] and the bibliography therein. The papers mentioned in this paragraph used the representation (2.7) of

P_{α} (X; x)

, rather than the new representation (2.5). The new representation appears, not only of more unifying form, but also more convenient as far as such properties of

P_{α} (X; x)

as the monotonicity in α and the continuity in α and in X are concerned; cf. (2.2) and the proofs of Propositions 2.3 and 2.4; those proofs, as well as the proofs of most of the other statements in this paper, are given in Appendix A. Yet another advantage of the representation (2.5) is that, for

α \in [1, \infty)

, the function

A_{α} (X; x) (\cdot)

inherits the convexity property of

h_{α}

, which facilitates the minimization of

A_{α} (X; x) (λ)

in

λ

, as needed to find

P_{α} (X; x)

by Formula (2.5); relevant details on the remaining “difficult case”

α \in (0, 1)

can be found in Section 4.1.

On the other hand, the “old” representation (2.7) of

P_{α} (X; x)

is more instrumental in establishing the mentioned connection with the classes

H^{α}

of generalized moment functions; in proving Part (iii) of Proposition 2.2; and in discovering and proving Theorem 3.3.

***

Some of the more elementary properties of

P_{α} (X; x)

are presented in

Proposition 2.1.

(i): $P_{α} (X; x)$ is nonincreasing in $x \in R$ .
(ii): If $α \in (0, \infty)$ and $E X_{+}^{α} = \infty$ , then $P_{α} (X; x) = \infty$ for all $x \in R$ .
(iii): If $α = \infty$ and $E e^{λ X} = \infty$ for all real $λ > 0$ , then $P_{\infty} (X; x) = \infty$ for all $x \in R$ .
(iv): If $α \in (0, \infty)$ and $E X_{+}^{α} < \infty$ , then $P_{α} (X; x) \to 1$ as $x \to - \infty$ and $P_{α} (X; x) \to 0$ as $x \to \infty$ , so that $0 ⩽ P_{α} (X; x) ⩽ 1$ for all $x \in R$ .
(v): If $α = \infty$ and $E e^{λ_{0} X} < \infty$ for some real $λ_{0} > 0$ , then $P_{α} (X; x) \to 1$ as $x \to - \infty$ and $P_{α} (X; x) \to 0$ as $x \to \infty$ , so that $0 ⩽ P_{α} (X; x) ⩽ 1$ for all $x \in R$ .

In view of Proposition 2.1, it will be henceforth assumed by default that the tail bounds

P_{α} (X; x)

– as well as the quantile bounds

Q_{α} (X; p)

, to be introduced in Section 3, and also the corresponding expressions

A_{α} (X; x) (λ)

,

{\tilde{A}}_{α} (X; x) (t)

, and

B_{α} (X; p) (t)

, as in Formulas (2.3), (2.9), and (3.9)) are defined and considered only for r.v.’s

X \in X_{α}

(unless indicated otherwise), where

X_{α} : = \{\begin{matrix} X & if α = 0, \\ \{X \in X : E X_{+}^{α} < \infty\} & if α \in (0, \infty), \\ \{X \in X : Λ_{X} \neq Ø\} & if α = \infty, \end{matrix}

(2.14)

X

is the set of all real-valued r.v.’s on a given probability space (implicit in this paper), and

Λ_{X} : = \{λ \in (0, \infty) : E e^{λ X} < \infty\} .

(2.15)

Observe that the set

X_{α}

is a convex cone containing all real constants; for details on this, one may see comments in the paragraph containing Formula (1.14) in [8].

As usual, we let

{∥ Z ∥}_{α} : = {(E | Z |}^{α})^{1 / α}

, the

L^{α}

-norm of a r.v. Z, which is actually a norm if and only if

α ⩾ 1

.

It follows from Proposition 2.1 and Formula (2.6) that

P_{α} (X; x) is nonincreasing in x \in R, with P_{α} (X; (- \infty) +) = 1 and P_{α} (X; \infty -) = 0 .

(2.16)

Here, as usual,

f (a +)

and

f (a -)

denote the right and left limits of f at a.

One can say more in this respect. To do that, introduce

x_{*} : = x_{*, X} : = sup supp X and p_{*} : = p_{*, X} : = P (X = x_{*}) .

(2.17)

Here, as usual,

supp X

denotes the support set of (the distribution of the r.v.) X; speaking somewhat loosely,

x_{*}

is the maximum value taken by the r.v. X, and

p_{*}

is the probability with which this value is taken. It is of course possible that

x_{*} = \infty

, in which case necessarily

p_{*} = 0

, since the r.v. X was assumed to be real-valued.

Introduce also

x_{α} : = x_{α, X} : = inf E_{α} (1),

(2.18)

where

E_{α} (p) : = E_{α, X} (p) : = {x \in R : P_{α} (X; x) < p} .

(2.19)

Recall that, according to the standard convention, for any subset E of

R

,

inf E = \infty

if and only if

E = Ø

. Now, one can state

Proposition 2.2.

(i): For all $x \in [x_{*}, \infty)$ , one has $P_{α} (X; x) = P_{0} (X; x) = P (X ⩾ x) = P (X = x) = p_{*} I {x = x_{*}} .$
(ii): For all $x \in (- \infty, x_{*})$ , one has $P_{α} (X; x) > 0$ .
(iii): The function $(- \infty, x_{*}] \cap R ∋ x \mapsto P_{α} {(X; x)}^{- 1 / α}$ is continuous and convex if $α \in (0, \infty)$ ; we use the conventions $0^{- a} : = \infty$ and $\infty^{- a} : = 0$ for all real $a > 0$ ; concerning the continuity of functions with values in the set $[0, \infty]$ , we use the natural topology on this set. Also, the function $(- \infty, x_{*}] \cap R ∋ x \mapsto - ln P_{\infty} (X; x)$ is continuous and convex, with the convention $ln 0 : = - \infty$ .
(iv): If $α \in (0, \infty]$ , then the function $(- \infty, x_{*}] \cap R ∋ x \mapsto P_{α} (X; x)$ is continuous.
(v): The function $R ∋ x \mapsto P_{α} (X; x)$ is left-continuous.
(vi): $x_{α}$ is nondecreasing in $α \in [0, \infty]$ , and $x_{α} < \infty$ for all $α \in [0, \infty]$ .
(vii): If $α \in [1, \infty]$ , then $x_{α} = E X$ ; even for $X \in X_{α}$ , it is of course possible that $E X = - \infty$ , in which case $P_{α} (X; x) < 1$ for all real x.
(viii): $x_{α} ⩽ x_{*}$ , and $x_{α} = x_{*}$ if and only if $p_{*} = 1$ .
(ix): $E_{α} (1) = (x_{α}, \infty) \neq Ø$ .
(x): $P_{α} (X; x) = 1$ for all $x \in (- \infty, x_{α}]$ .
(xi): If $α \in (0, \infty]$ , then $P_{α} (X; x)$ is strictly decreasing in $x \in [x_{α}, x_{*}] \cap R$ .

This proposition will be useful when establishing continuity properties of the quantile bounds considered in Section 3 and the matters of effective computation addressed in Section 4. Moreover, Proposition 2.2 will be heavily used in the proof of Proposition 3.1 to establish basic properties of the risk measures

Q_{α} (X; p)

.

For

α \in (1, \infty)

, Parts (i), (iv), (vii), (x), and (xi) of Proposition 2.2 are contained in [6], Proposition 3.2.

One may also note here that, by (2.16) and Part (v) of Proposition 2.2, the function

P_{α} (X; \cdot)

may be regarded as the tail function of some r.v.

Z_{α}

:

P_{α} (X; u) = P (Z_{α} ⩾ u)

for all real u.

Some parts of Propositions 2.1 and 2.2 are illustrated in Example 1.3 in [8] and in the corresponding figure there.

Proposition 2.3.

P_{α} (X; x)

is continuous in

α \in [0, \infty]

in the following sense: Suppose that

(α_{n})

is any sequence in

[0, \infty)

converging to

α \in [0, \infty]

, with

β : = {sup}_{n} α_{n}

and

X \in X_{β}

; then

P_{α_{n}} (X; x) \to P_{α} (X; x)

.

In view of Parts (ii) and (iii) of Proposition 2.1, the condition

X \in X_{β}

in Proposition 2.3 is essential.

Let us now turn to the question of stability of

P_{α} (X; x)

with respect to (the distribution of) X. First here, recall that one of a number of mutually equivalent definitions of the convergence in distribution,

X_{n} \underset{n \to \infty}{\overset{D}{⟶}} X

, of a sequence of r.v.’s

X_{n}

to an r.v. X is the following:

P (X_{n} ⩾ x) \underset{n \to \infty}{⟶} P (X ⩾ x)

for all real x such that

P (X = x) = 0

; cf.; cf. e.g. [15, §4 and Theorem 2.1].

We shall also need the following uniform integrability condition:

\begin{matrix} sup_{n} E {(X_{n})}_{+}^{α} I {X_{n} > N} \underset{N \to \infty}{⟶} 0 if α \in (0, \infty), \end{matrix}

(2.20)

\begin{matrix} sup_{n} E e^{λ X_{n}} I {X_{n} > N} \underset{N \to \infty}{⟶} 0 for each λ \in Λ_{X} if α = \infty . \end{matrix}

(2.21)

Proposition 2.4. Suppose that

α \in (0, \infty]

. Then

P_{α} (X; x)

is continuous in X in the following sense. Take any sequence

{(X_{n})}_{n \in N}

of real-valued r.v.’s such that

X_{n} \underset{n \to \infty}{\overset{D}{⟶}} X

and the uniform integrability condition (2.20)- (2.21) is satisfied. Then one has the following.

(i): The convergence

$P_{α} (X_{n}; x) \underset{n \to \infty}{⟶} P_{α} (X; x)$

(2.22)

takes place for all real $x \neq x_{*}$ , where $x_{*} = x_{*, X}$ as in (2.17); thus, by Parts (i) and (iv) of Proposition 2.2, (2.22) holds for all real x that are points of continuity of the function $P_{α} (X; \cdot)$ .
(ii): The convergence (2.22) holds for $x = x_{*}$ as well, provided that $P (X_{n} = x_{*}) \underset{n \to \infty}{⟶} P (X = x_{*})$ . In particular, (2.22) holds for $x = x_{*}$ if $P (X = x_{*}) = 0$ .

Note that in the case

α = 0

the convergence (2.22) may fail to hold, not only for

x = x_{*}

, but for all real x such that

P (X = x) > 0

.

***

Let us now discuss matters of monotonicity of

P_{α} (X; x)

in X, with respect to various orders on the mentioned set

X

of all real-valued r.v.’s X. Using the family of function classes

H^{α}

, defined by (2.11), one can introduce a family of stochastic orders, say

\overset{α + 1}{⩽}

, on the set

X

by the formula

X \overset{α + 1}{⩽} Y \overset{def}{\Leftrightarrow} E g (X) ⩽ E g (Y) for all g \in H^{α},

where

α \in [0, \infty]

and X and Y are in

X

. To avoid using the term “order” with two different meanings in one phrase, let us refer to the relation

\overset{α + 1}{⩽}

as the stochastic dominance of order

α + 1

, rather than the stochastic order of order

α + 1

. In view of (2.11), it is clear that

X \overset{α + 1}{⩽} Y \Leftrightarrow E g_{α; t} (X) ⩽ E g_{α; t} (Y) for all t \in T_{α},

(2.23)

so that, in the case when

α = m - 1

for some natural number m, the order

\overset{α + 1}{⩽}

coincides with the “m-increasing-convex” order

⩽_{m - icx}

as defined e.g. on page 206 in [16]. In particular,

\begin{matrix} X \overset{1}{⩽} Y & \Leftrightarrow P (X > t) ⩽ P (Y > t) for all t \in R \\ \Leftrightarrow P (X ⩾ t) ⩽ P (Y ⩾ t) for all t \in R \Leftrightarrow X \overset{st}{⩽} Y, \end{matrix}

(2.24)

where

\overset{st}{⩽}

denotes the usual stochastic dominance of order 1, and:

\begin{matrix} X \overset{2}{⩽} Y & \Leftrightarrow E {(X - t)}_{+} ⩽ E {(Y - t)}_{+} for all t \in R, \end{matrix}

(2.25)

so that

\overset{2}{⩽}

coincides with the usual stochastic dominance of order 2. Also,

X \overset{st}{⩽} Y iff for some r.v.’s X_{1} and Y_{1} one has X_{1} ⩽ Y_{1}, X_{1} \overset{D}{=} X, and Y_{1} \overset{D}{=} Y,

(2.26)

where

\overset{D}{=}

denotes the equality in distribution.

By (2.12), the orders

\overset{α + 1}{⩽}

are graded in the sense that

if X \overset{α + 1}{⩽} Y for some α \in [0, \infty], then X \overset{β + 1}{⩽} Y for all β \in [α, \infty] .

(2.27)

A stochastic order, which is a “mirror image” of the order

\overset{α + 1}{⩽}

, but only for nonnegative r.v.’s, was presented by Fishburn in [17]; note Theorem 2 in [17] on the relation with a “bounded” version of this order, previously introduced and studied in [18]. Denoting the corresponding Fishburn [17] order by

⩽_{α + 1}

, one has

X ⩽_{α + 1} Y \Leftrightarrow (- Y) \overset{α + 1}{⩽} (- X),

(2.28)

for nonnegative r.v.’s X and Y. However, as shown in this paper (recall Proposition 2.1), the condition of the nonnegativity of the r.v.’s is not essential; without it, one can either deal with infinite expected values or, alternatively, require that they be finite. The case when α is an integer was considered, in a different form, in [19].

One may also consider the order

⩽_{α}^{- 1}

defined by the condition that

X ⩽_{α}^{- 1} Y

if and only if X and Y are nonnegative r.v.’s and

F_{X}^{(- α)} (p) ⩽ F_{Y}^{(- α)} (p)

for all

p \in (0, 1)

, where

α \in (0, \infty)

,

\begin{matrix} F_{X}^{(- α)} (p) & : = \frac{1}{Γ (α)} \int_{[0, p)} {(p - u)}^{α - 1} d F_{X}^{- 1} (u), \end{matrix}

(2.29)

\begin{matrix} F_{X}^{- 1} (p) & : = inf {x \in [0, \infty) : P (X ⩽ x) ⩾ p} = - Q (- X; p) \end{matrix}

(2.30)

with

Q (\cdot; \cdot)

as in (3.3), and the integral in (2.29) is understood as the Lebesgue integral with respect to the nonnegative Borel measure

μ_{X}^{- 1}

on

[0, 1)

defined by the condition that

μ_{X}^{- 1} ([0, p)) = F_{X}^{- 1} (p)

for all

p \in (0, 1)

; cf. [20,21]. Note that

F_{X}^{(- 1)} (p) = F_{X}^{- 1} (p)

. For nonnegative r.v.’s, the order

⩽_{α + 1}^{- 1}

coincides with the order

⩽_{α + 1}

if

α \in {0, 1}

; again see [20,21]. Even for nonnegative r.v.’s, it seems unclear how the orders

⩽_{α + 1}

and

⩽_{α + 1}^{- 1}

relate to each other for positive real

α \neq 1

; see e.g. the discussion following Proposition 1 in [20] and Note

^{1}

on page 100 in [22].

The following theorem summarizes some of the properties of the tail probability bounds

P_{α} (X; x)

established above and also adds a few simple properties of these bounds.

Theorem 2.5. The following properties of the tail probability bounds

P_{α} (X; x)

are valid.

Model-independence:: $P_{α} (X; x)$ depends on the r.v. X only through the distribution of X.
Monotonicity in X:: $P_{α} (\cdot; x)$ is nondecreasing with respect to the stochastic dominance of order $α + 1$ : for any r.v. Y such that $X \overset{α + 1}{⩽} Y$ , one has $P_{α} (X; x) ⩽ P_{α} (Y; x)$ . Therefore, $P_{α} (\cdot; x)$ is nondecreasing with respect to the stochastic dominance of any order $γ \in [1, α + 1]$ ; in particular, for any r.v. Y such that $X ⩽ Y$ , one has $P_{α} (X; x) ⩽ P_{α} (Y; x)$ .
Monotonicity in α:: $P_{α} (X; x)$ is nondecreasing in $α \in [0, \infty]$ .
Monotonicity in x:: $P_{α} (X; x)$ is nonincreasing in $x \in R$ .
Values:: $P_{α} (X; x)$ takes only values in the interval $[0, 1]$ .
α-concavity in x:: $P_{α} {(X; x)}^{- 1 / α}$ is convex in x if $α \in (0, \infty)$ , and $ln P_{α} (X; x)$ is concave in x if $α = \infty$ .
Stability in x:: $P_{α} (X; x)$ is continuous in x at any point $x \in R$ – except the point $x = x_{*}$ when $p_{*} > 0$ .
Stability in α:: Suppose that a sequence $(α_{n})$ is as in Proposition 2.3. Then $P_{α_{n}} (X; x) \to P_{α} (X; x)$ .
Stability in X:: Suppose that $α \in (0, \infty]$ and a sequence $(X_{n})$ is as in Proposition 2.4. Then $P_{α} (X_{n}; x) \to P_{α} (X; x)$ .
Translation invariance:: $P_{α} (X + c; x + c) = P_{α} (X; x)$ for all real c.
Consistency:: $P_{α} (c; x) = P_{0} (c; x) = I {c ⩾ x}$ for all real c; that is, if the r.v. X is the constant c, then all the tail probability bounds $P_{α} (X; x)$ precisely equal the true tail probability $P (X ⩾ x)$ .
Positive homogeneity:: $P_{α} (κ X; κ x) = P_{α} (X; x)$ for all real $κ > 0$ .

3. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Quantiles

Take any

p \in (0, 1)

(3.1)

and introduce the generalized inverse (with respect to x) of the bound

P_{α} (X; x)

by the formula

Q_{α} (X; p) : = inf E_{α, X} (p) = inf \{x \in R : P_{α} (X; x) < p\},

(3.2)

where

E_{α, X} (p)

is as in (2.19). In particular, in view of the equality in (2.6),

Q (X; p) : = Q_{0} (X; p) = inf \{x \in R : P (X ⩾ x) < p\} = inf \{x \in R : P (X > x) < p\},

(3.3)

which is a

(1 - p)

-quantile of (the distribution of) the r.v. X; actually,

Q (X; p)

is the largest one in the set of all

(1 - p)

-quantiles of X.

It follows immediately from (3.2), (2.13), and (3.3) that

\begin{matrix} Q_{α} (X; p) is an upper bound on the quantile Q (X; p), and \\ Q_{α} (X; p) is nondecreasing in α \in [0, \infty] . \end{matrix}

(3.4)

Thus, one has a monotonic spectrum of upper bounds,

Q_{α} (X; p)

, on the quantile

Q (X; p)

, ranging from the tightest bound,

Q_{0} (X; p) = Q (X; p)

, to the loosest one,

Q_{\infty} (X; p)

, which latter is based on the exponential bound

P_{\infty} (X; x) = {inf}_{λ > 0} E e^{λ (X - x)}

on

P (X ⩾ x)

.

Also, it is obvious from (3.2) that

Q_{α} (X; p) is nonincreasing in p \in (0, 1) .

(3.5)

Basic properties of

Q_{α} (X; p)

are collected in

Proposition 3.1. Recall the definitions of

x_{*}

and

x_{α}

in (2.17) and (2.18). The following statements are true.

(i): $Q_{α} (X; p) \in R$ .
(ii): If $p \in (0, p_{*}] \cap (0, 1)$ then $Q_{α} (X; p) = x_{*}$ .
(iii): $Q_{α} (X; p) ⩽ x_{*}$ .
(iv): $Q_{α} (X; p) \underset{p ↓ 0}{⟶} x_{*}$ .

Figure 1. Illustration of Proposition 3.1

(v): If $α \in (0, \infty]$ , then the function

$(p_{*}, 1) ∋ p \mapsto Q_{α} (X; p) \in (x_{α}, x_{*})$

(3.6)

is the unique inverse to the continuous strictly decreasing function

$(x_{α}, x_{*}) ∋ x \mapsto P_{α} (X; x) \in (p_{*}, 1) .$

(3.7)

Therefore, the function (3.6), too, is continuous and strictly decreasing.
(vi): If $α \in (0, \infty]$ , then for any $y \in (- \infty, Q_{α} (X; p))$ , one has $P_{α} (X; y) > p$ .
(vii): If $α \in [1, \infty]$ , then $Q_{α} (X; p) > E X$ .

Example 3.2. Some parts of Proposition 3.1 are illustrated in Figure 1, with graphs

{(p, Q_{α} (X; p)) : 0 < p < 1}

in the important case when the r.v. X takes only two values. Then, by the translation invariance property stated below in Theorem 2.5, without loss of generality (w.l.o.g.)

E X = 0

. Thus,

X = X_{a, b}

, where a and b are positive real numbers and

X_{a, b}

is a r.v. with the uniquely determined zero-mean distribution on the set

{- a, b}

. Let us take

a = 1

and

b = 3

, with the values of α equal 0 (black),

\frac{1}{2}

(blue), 1 (green), 2 (orange), and ∞ (red). One may compare this picture with the one for

P_{α} (X; x)

in Example 1.3 in [8] (where the same values of a, b, and α were used), having in mind that the function

Q_{α} (X; \cdot)

is a generalized inverse to the function

P_{α} (X; \cdot)

.

The definition (3.2) of

Q_{α} (X; p)

is rather complicated, in view of the definition (2.5) of

P_{α} (X; x)

. So, the following theorem will be useful, as it provides a more direct expression of

Q_{α} (X; p)

; at that, one may again recall (3.3), concerning the case

α = 0

.

Theorem 3.3. For all

α \in (0, \infty]

Q_{α} (X; p) = inf_{t \in T_{α}} B_{α} (X; p) (t),

(3.8)

where

T_{α}

is as in (2.8) and

B_{α} (X; p) (t) : = \{\begin{matrix} t + \frac{{∥ (X - t)}_{+} ∥_{α}}{p^{1 / α}} & for α \in (0, \infty), \\ t ln \frac{E e^{X / t}}{p} & for α = \infty . \end{matrix}

(3.9)

Proof of Theorem 3.3. The proof is based on the simple observation, following immediately from the definitions (2.9) and (3.9), that the dual level sets for the functions

{\tilde{A}}_{α} (X; x)

and

B_{α} (X; p)

are the same:

T_{{\tilde{A}}_{α} (X; x)} (p) = T_{B_{α} (X; p)} (x)

(3.10)

for all

α \in (0, \infty]

,

x \in R

, and

p \in (0, 1)

, where

\begin{matrix} T_{{\tilde{A}}_{α} (X; x)} (p) & : = {t \in T_{α} : {\tilde{A}}_{α} (X; x) (t) < p} and \\ T_{B_{α} (X; p)} (x) & : = {t \in T_{α} : B_{α} (X; p) (t) < x} . \end{matrix}

Indeed, by (2.7) and (3.10),

\begin{matrix} P_{α} (X; x) < p \Leftrightarrow inf_{t \in T_{α}} {\tilde{A}}_{α} (X; x) (t) < p \\ \Leftrightarrow T_{{\tilde{A}}_{α} (X; x)} (p) \neq Ø \Leftrightarrow T_{B_{α} (X; p)} (x) \neq Ø & \Leftrightarrow x > inf_{t \in T_{α}} B_{α} (X; p) (t) . \end{matrix}

Now, (3.8) follows immediately by (3.2). ☐

Note that the case

α = \infty

of Theorem 3.3 is a special case of Proposition 1.5 in [23], and the above proof of Theorem 3.3 is similar to that of Proposition 1.5 in [23]. Correspondingly, the duality presented in the above proof of Theorem 3.3 is a generalization of the bilinear Legendre–Fenchel duality considered in [23].

The following theorem presents the most important properties of the quantile bounds

Q_{α} (X; p)

, in addition to the variational representation of

Q_{α} (X; p)

given by Theorem 3.3.

Theorem 3.4. The following properties of the quantile bounds

Q_{α} (X; p)

are valid.

Model-independence:: $Q_{α} (X; p)$ depends on the r.v. X only through the distribution of X.
Monotonicity in X:: $Q_{α} (\cdot; p)$ is nondecreasing with respect to the stochastic dominance of order $α + 1$ : for any r.v. Y such that $X \overset{α + 1}{⩽} Y$ , one has $Q_{α} (X; p) ⩽ Q_{α} (Y; p)$ . Therefore, $Q_{α} (\cdot; p)$ is nondecreasing with respect to the stochastic dominance of any order $γ \in [1, α + 1]$ ; in particular, for any r.v. Y such that $X ⩽ Y$ , one has $Q_{α} (X; p) ⩽ Q_{α} (Y; p)$ .
Monotonicity in α:: $Q_{α} (X; p)$ is nondecreasing in $α \in [0, \infty]$ .
Monotonicity in p:: $Q_{α} (X; p)$ is nonincreasing in $p \in (0, 1)$ , and $Q_{α} (X; p)$ is strictly decreasing in $p \in [p_{*}, 1) \cap (0, 1)$ if $α \in (0, \infty]$ .
Finiteness:: $Q_{α} (X; p)$ takes only (finite) real values.
Concavity in $p^{- 1 / α}$ or in $ln \frac{1}{p}$ :: $Q_{α} (X; p)$ is concave in $p^{- 1 / α}$ if $α \in (0, \infty)$ , and $Q_{\infty} (X; p)$ is concave in $ln \frac{1}{p}$ .
Stability in p:: $Q_{α} (X; p)$ is continuous in $p \in (0, 1)$ if $α \in (0, \infty]$ .
Stability in X:: Suppose that $α \in (0, \infty]$ and a sequence $(X_{n})$ is as in Proposition 2.4. Then $Q_{α} (X_{n}; p) \to Q_{α} (X; p)$ .
Stability in α:: Suppose that $α \in (0, \infty]$ and a sequence $(α_{n})$ is as in Proposition 2.3. Then $Q_{α_{n}} (X; p) \to Q_{α} (X; p)$ .
Translation invariance:: $Q_{α} (X + c; p) = Q_{α} (X; p) + c$ for all real c.
Consistency:: $Q_{α} (c; p) = c$ for all real c; that is, if the r.v. X is the constant c, then all of the quantile bounds $Q_{α} (X; p)$ equal c.
Positive sensitivity:: Suppose here that $X ⩾ 0$ . If at that $P (X > 0) > 0$ , then $Q_{α} (X; p) > 0$ for all $α \in (0, \infty]$ ; if, moreover, $P (X > 0) > p$ , then $Q_{0} (X; p) > 0$ .
Positive homogeneity:: $Q_{α} (κ X; p) = κ Q_{α} (X; p)$ for all real $κ ⩾ 0$ .
Subadditivity:: $Q_{α} (X; p)$ is subadditive in X if $α \in [1, \infty]$ ; that is, for any other r.v. Y (defined on the same probability space as X) one has:

$Q_{α} (X + Y; p) ⩽ Q_{α} (X; p) + Q_{α} (Y; p) .$
Convexity:: $Q_{α} (X; p)$ is convex in X if $α \in [1, \infty]$ ; that is, for any other r.v. Y (defined on the same probability space as X) and any $t \in (0, 1)$ one has

$Q_{α} ((1 - t) X + t Y; p) ⩽ (1 - t) Q_{α} (X; p) + t Q_{α} (Y; p)$

The inequality

Q_{1} (X; p) ⩽ Q_{\infty} (X; p)

, in other notations, was mentioned (without proof) in [24]; of course, this inequality is a particular, and important, case of the monotonicity of

Q_{α} (X; p)

in

α \in [0, \infty]

. That

Q_{α} (\cdot; p)

is nondecreasing with respect to the stochastic dominance of order

α + 1

was shown (using other notations) in [25] in the case

α = 1

.

The following two propositions complement the monotonicity property of

Q_{α} (X; p)

in X stated in Theorem 3.4.

Proposition 3.5. The upper bound

α + 1

on γ in the statement of the monotonicity of

Q_{α} (X; p)

in X in Theorem 3.4 is exact in the following rather strong sense. For any

α \in [0, \infty)

, there exist r.v.’s X and Y in

X_{α}

such that

X \overset{γ}{⩽} Y

for all

γ \in (α + 1, \infty]

, whereas

Q_{α} (X; p) > Q_{α} (Y; p)

.

Proposition 3.6. Suppose that an r.v. Y is stochastically strictly greater than X (which may be written as

X \overset{st}{<} Y

; cf., (2.24)) in the sense that

X \overset{st}{⩽} Y

and for any

v \in R

there is some

u \in (v, \infty)

such that

P (X ⩾ u) < P (Y ⩾ u)

. Then

Q_{α} (X; p) < Q_{α} (Y; p)

if

α \in (0, \infty]

.

The latter proposition will be useful in the proof of Proposition 3.7 below.

Given the positive homogeneity, it is clear that the subadditivity and convexity properties of

Q_{α} (X; p)

easily follow from each other. In the statements in Theorem 3.4 on these two mutually equivalent properties, it was assumed that

α \in [1, \infty]

. One may ask whether this restriction is essential. The answer to this question is “yes”:

Proposition 3.7. There are r.v.’s X and Y such that for all

α \in [0, 1)

and all

p \in (0, 1)

one has

Q_{α} (X + Y; p) > Q_{α} (X; p) + Q_{α} (Y; p)

, so that the function

Q_{α} (\cdot; p)

is not subadditive (and, equivalently, not convex).

It is well known (see e.g. [1,2,26]) that

Q (X; p) = Q_{0} (X; p)

is not subadditive in X; it could therefore have been expected that

Q_{α} (X; p)

will not be subadditive in X if α is close enough to 0. In quite a strong and specific sense, Proposition 3.7 justifies such expectations.

***

Consider briefly the rather important case when the distribution of X belongs to a location-scale family; that is, when (the distribution of) the r.v. X has a probability density function (pdf) of the form

f_{μ, σ} (x) = \frac{1}{σ} f (\frac{x - μ}{σ})

(3.11)

for all real x, where f is a pdf,

μ \in R

(is the “location” parameter), and

σ \in (0, \infty)

(is the “scale” parameter). Then f may be referred to as the “standard” pdf of this family. Perhaps the most common example of a location-scale family is the normal distribution family, for which f is the standard normal pdf, and μ and σ are the mean and the standard deviation of the distribution.

Proposition 3.8. If the r.v. X has a pdf of the form (3.11), then

Q_{α} (X; p) = μ + σ Q_{α} (Z; p),

(3.12)

where Z stands for any r.v. with the “standard” pdf f.

This follows immediately by the translation invariance, positive homogeneity, and model-independence properties stated in Theorem 3.4. Note that, given any location-scale family,

Q_{α} (Z; p)

depends only on α and p.

Remark 3.9. It is shown in [8] that for small enough values of p the quantile bounds

Q_{α} (X; p)

are close enough to the true quantiles

Q_{0} (X; p) = {VaR}_{p} (X)

provided that the right tail of the distribution of X is light enough and regular enough, depending on α see Proposition 2.7 in [8].

For instance, if the r.v. X has the normal distribution with mean μ and standard deviation σ, then, by (3.12) and the monotonicity of

Q_{α} (X; p)

in α,

μ + σ Q_{0} (Z; p) = Q_{0} (X; p) ⩽ Q_{α} (X; p) ⩽ Q_{\infty} (X; p) = μ + σ Q_{\infty} (Z; p) .

(3.13)

Next, obviously

Q_{0} (Z; p) = Φ^{- 1} (1 - p)

, where

Φ^{- 1}

is the inverse to the standard normal distribution function Φ, and

Q_{\infty} (Z; p) = \sqrt{2 ln \frac{1}{p}}

. Also,

1 - Φ (u) = exp {- u^{2} / (2 + o (1))}

as

u \to \infty

. Therefore,

Q_{0} (Z; p) = Φ^{- 1} (1 - p) \underset{p ↓ 0}{\sim} \sqrt{2 ln \frac{1}{p}} = Q_{\infty} (Z; p)

. Here, as usual,

a \sim b

means

a / b \to 1

. Hence, by (3.13),

Q_{α} (X; p) \approx Q_{0} (X; p) = {VaR}_{p} (X)

for small

p > 0

and all

α \in (0, \infty]

.

Another easy to consider case, also illustrating Remark 3.9, is that of the exponential location-scale family, with the “standard” pdf f given by the formula

f (x) = e^{- x} I {x > 0}

.

Let then the r.v. X have the corresponding pdf

f_{μ, σ}

, so that

f_{μ, σ} (x) = \frac{1}{σ} exp \{- \frac{x - μ}{σ}\} I {x > μ}

. Let Z be any r.v. with the “standard” exponential pdf f. Then, obviously,

Q_{0} (Z; p) = ln \frac{1}{p}

. Also, it is not hard to see that here

Q_{\infty} (Z; p) = - W_{- 1} (- p / e)

, where

W_{- 1}

is the

(- 1)

-branch of the Lambert function [27, pages 3 and 16]; that is,

- Q_{\infty} (Z; p)

is the only root

u \in (- \infty, - 1]

of the equation

u e^{u} = - p / e

. Note that

u e^{u} = exp {(1 + o (1)) u}

as

u \to - \infty

. Therefore,

Q_{\infty} (Z; p) = - W_{- 1} (- p / e) \underset{p ↓ 0}{\sim} ln \frac{1}{p} = Q_{0} (Z; p)

. Hence, by the monotonicity in α, one has

Q_{α} (Z; p) \underset{p ↓ 0}{\sim} Q_{0} (Z; p)

uniformly in

α \in [0, \infty]

. Hence, again by (3.13),

Q_{α} (X; p) \approx Q_{0} (X; p) = {VaR}_{p} (X)

for small

p > 0

and all

α \in (0, \infty]

.

For

α \in [1, \infty)

and a r.v. Z as in the above paragraph, one has

B^{'} (0) = 1 - \frac{p_{α}}{p} ⩽ 0

if

0 < p ⩽ p_{α}

, where

B (t) : = B_{α} (Z; p) (t)

and

p_{α} : = \frac{Γ (α + 1)}{α^{α}}

; then, in view of Part (i) of Proposition 4.4, the infimum in (3.8) is attained at some point

t_{α} \in [0, \infty)

; in fact,

t_{α} = ln \frac{p_{α}}{p}

. It follows that

Q_{α} (Z; p) = α + ln \frac{p_{α}}{p}

for all

α \in [1, \infty)

and

p \in (0, p_{α})

; so, one can now establish directly that

Q_{α} (Z; p) \underset{p ↓ 0}{\sim} ln \frac{1}{p} = Q_{0} (Z; p)

for each

α \in [1, \infty)

.

4. Computation of the Tail Probability and Quantile Bounds

4.1. Computation of $P_{α} (X; x)$

The computation of

P_{α} (X; x)

in the case

α = 0

is straightforward, in view of the equality in (2.6). If

x \in [x_{*}, \infty)

, then the value of

P_{α} (X; x)

is easily found by Part (i) of Proposition 2.2. Therefore, in the rest of this subsection it may be assumed that

α \in (0, \infty]

and

x \in (- \infty, x_{*})

.

In the case when

α \in (0, \infty)

, using (2.5), the inequality

{(1 + λ (X - x) / α)}_{+}^{α} ⩽ 2^{{(α - 1)}_{+}} (λ^{α} X_{+}^{α} + {(α - λ x)}_{+}^{α}) / α^{α},

(4.1)

the condition

X \in X_{α}

, and dominated convergence, one sees that

A_{α} (X; x) (λ)

is continuous in

λ \in (0, \infty)

and right-continuous in

λ

at

λ = 0

(assuming the definition (2.3) for

λ = 0

as well), and hence

P_{α} (X; x) = inf_{λ \in [0, \infty)} A_{α} (X; x) (λ) .

(4.2)

Similarly, using in place of (4.1) the inequality

e^{λ X} ⩽ 1 + e^{λ_{0} X}

whenever

0 ⩽ λ ⩽ λ_{0}

, one can show that

A_{\infty} (X; x) (λ)

is continuous in

λ \in Λ_{X}

(recall (2.15)) and right-continuous in

λ

at

λ = 0

, so that (4.2) holds for

α = \infty

as well – provided that

X \in X_{\infty}

. Moreover, by the Fatou lemma for the convergence in distribution (see e.g. Theorem 5.3 in [15]),

A_{\infty} (X; x) (λ)

is lower-semicontinuous in

λ

at

λ = λ_{*} : = sup Λ_{X}

even if

λ_{*} \in R \ Λ_{X}

. It then follows by the convexity of

A_{\infty} (X; x) (λ)

in

λ

that

A_{\infty} (X; x) (λ)

is left-continuous in

λ

at

λ = λ_{*}

whenever

λ_{*} \in R

; at that, the natural topology on the set

[0, \infty]

is used, as it is of course possible that

A_{\infty} (X; x) (λ_{*}) = \infty

.

Since

x \in (- \infty, x_{*})

, one can find some

y \in (x, \infty)

such that

P (X ⩾ y) > 0

(of course, necessarily

y \in (x, x_{*}]

); so, one can introduce

λ_{max} : = λ_{max, α} : = λ_{max, α, X} : = \{\begin{matrix} \frac{α}{y - x} (\frac{1}{P {(X ⩾ y)}^{1 / α}} - 1) & if α \in (0, \infty), \\ \frac{1}{y - x} ln \frac{1}{P (X ⩾ y)} & if α = \infty . \end{matrix}

(4.3)

Then, by (2.3),

A_{α} (X; x) (λ) ⩾ E {(1 + λ (X - x) / α)}_{+}^{α} I {X ⩾ y} ⩾ {(1 + λ (y - x) / α)}^{α} P (X ⩾ y) > 1

if

α \in (0, \infty)

and

λ \in (λ_{max, α}, \infty)

, and

A_{\infty} (X; x) (λ) ⩾ E e^{λ (X - x)} I {X ⩾ y} ⩾ e^{λ (y - x)} P (X ⩾ y) > 1

if

λ \in (λ_{max, \infty}, \infty)

. Therefore, for all

α \in (0, \infty]

one has

A_{α} (X; x) (λ) > 1 ⩾ P_{α} (X; x) = {inf}_{λ \in (0, \infty)} A_{α} (X; x) (λ)

provided that

λ \in (λ_{max, \infty}, \infty)

, and hence

P_{α} (X; x) = inf_{λ \in [0, λ_{max, α}]} A_{α} (X; x) (λ), if α \in (0, \infty] and x \in (- \infty, x_{*}) .

(4.4)

Therefore and because

λ_{max, α} < \infty

, the minimization of

A_{α} (X; x) (λ)

in

λ

in (4.4) in order to compute the value of

P_{α} (X; x)

can be done effectively if

α \in [1, \infty]

, because in this case

A_{α} (X; x) (λ)

is convex in

λ

. At that, the positive-part moments

{E (1 + λ (X - x) / α)}_{+}^{α}

, which express

A_{α} (X; x) (λ)

for

α \in (0, \infty)

in accordance with (2.3), can be efficiently computed using formulas in [28]; cf. e.g. Section 3.2.3 in [6]. Of course, for specific kinds of distributions of the r.v. X, more explicit expressions for the positive-part moments can be used.

In the remaining case, when

α \in (0, 1)

, the function

λ \mapsto A_{α} (X; x) (λ)

cannot in general be “convexified” by any monotonic transformations in the domain and/or range of this function, and the set of minimizing values of

λ

does not even have to be connected, in the following rather strong sense:

Proposition 4.1. For any

α \in (0, 1)

,

p \in (0, 1)

, and

x \in R

, there is a r.v. X (taking three distinct values) such that

P_{α} (X; x) = p

and the infimum

{inf}_{λ \in (0, \infty)} A_{α} (X; x) (λ)

in (2.5) is attained at precisely two distinct values of

λ \in (0, \infty)

.

Proposition 4.1 is illustrated by

Example 4.2. Let X be a r.v. taking values

- \frac{27}{11}, - 1, 2

with probabilities

\frac{1}{4}, \frac{1}{4}, \frac{1}{2}

; then

x_{*} = 2

. Also let

α = \frac{1}{2}

and

x = 0

, so that

x \in (- \infty, x_{*})

, and then let

λ_{max}

be as in (4.3) with

y = x_{*} = 2

, so that here

λ_{max} = \frac{3}{4}

. Then the minimum of

A_{α} (X; 0) (λ)

over all real

λ ⩾ 0

equals

\frac{\sqrt{3}}{2}

and is attained at each of the two points,

λ = \frac{11}{54}

and

λ = \frac{1}{2}

, and only at these two points. The graph

\{(λ, A_{1 / 2} (X; 0) (λ)) : 0 ⩽ λ ⩽ λ_{max}\}

is shown here in Figure 2.

Figure 2. Illustration of Example 4.2

Nonetheless, effective minimization of

A_{α} (X; x) (λ)

in

λ

in (4.4) is possible even in the case

α \in (0, 1)

, say by the interval method. Indeed, take any

α \in (0, 1)

and write

A_{α} (X; x) (λ) = A_{α}^{+} (X; x) (λ) + A_{α}^{-} (X; x) (λ),

where (cf. (2.3))

A_{α}^{+} {(X; x) (λ) : = E (1 + λ (X - x) / α)}_{+}^{α} I {X ⩾ x}

and

A_{α}^{-} {(X; x) (λ) : = E (1 + λ (X - x) / α)}_{+}^{α} I {X < x}

. Just as

A_{α} (X; x) (λ)

is continuous in

λ \in [0, \infty)

, so are

A_{α}^{+} (X; x) (λ)

and

A_{α}^{-} (X; x) (λ)

. It is also clear that

A_{α}^{+} (X; x) (λ)

is nondecreasing and

A_{α}^{-} (X; x) (λ)

is nonincreasing in

λ \in [0, \infty)

.

So, as soon as the minimizing values of

λ

are bracketed as in (4.4), one can partition the finite interval

[0, λ_{max, α}]

into a large number of small subintervals

[a, b]

with

0 ⩽ a < b ⩽ λ_{max, α}

. For each such subinterval,

\begin{matrix} M_{a, b} : = max_{λ \in [a, b]} A_{α} (X; x) (λ) ⩽ A_{α}^{+} (X; x) (b) + A_{α}^{-} (X; x) (a), \\ m_{a, b} : = min_{λ \in [a, b]} A_{α} (X; x) (λ) ⩾ A_{α}^{+} (X; x) (a) + A_{α}^{-} (X; x) (b), \end{matrix}

so that, by the continuity of

A_{α}^{\pm} (X; x) (λ)

in

λ

,

M_{a, b} - m_{a, b} ⩽ A_{α}^{+} (X; x) (b) - A_{α}^{+} (X; x) (a) + A_{α}^{-} (X; x) (a) - A_{α}^{-} (X; x) (b) ⟶ 0

as

b - a \to 0

, uniformly over all subintervals

[a, b]

of the interval

[0, λ_{max, α}]

. Thus, one can effectively bracket the value

P_{α} (X; x) = {inf}_{λ \in [0, λ_{max, α}]} A_{α} (X; x) (λ)

with any degree of accuracy; this same approach will work, and perhaps may be sometimes useful, for

α \in [1, \infty)

as well.

4.2. Computation of $Q_{α} (X; p)$

Proposition 4.3. (Quantile bounds: Attainment and bracketing).

(i): If $α \in (0, \infty)$ , then ${inf}_{t \in T_{α}} B_{α} (X; p) (t) = {inf}_{t \in R} B_{α} (X; p) (t)$ in (3.8) is attained at some $t_{opt} \in R$ and hence

$Q_{α} (X; p) = min_{t \in R} B_{α} (X; p) (t) = B_{α} (X; p) (t_{opt});$

(4.5)

moreover, for any

$s \in R and \tilde{p} \in (p, 1),$

necessarily

$t_{opt} \in [t_{min}, t_{max}],$

(4.6)

where

$t_{max} : = B_{α} (X; p) (s), t_{min} : = t_{0, min} \land t_{1, min},$

(4.7)

$t_{0, min} : = Q_{0} (X; \tilde{p}), t_{1, min} : = \frac{{(\tilde{p} / p)}^{1 / α} t_{0, min} - t_{max}}{{(\tilde{p} / p)}^{1 / α} - 1} .$

(4.8)
(ii): Suppose now that $α = \infty$ . Then ${inf}_{t \in T_{α}} B_{α} (X; p) (t) = {inf}_{t \in (0, \infty)} B_{α} (X; p) (t)$ in (3.8) is attained, and hence

$Q_{\infty} (X; p) = min_{t \in (0, \infty)} B_{\infty} (X; p) (t)$

unless

$x_{*} < \infty and p ⩽ p_{*},$

(4.9)

where $x_{*}$ and $p_{*}$ are as in (2.17). On the other hand, if conditions (4.9) hold, then $B_{\infty} (X; p) (t)$ is strictly increasing in $t > 0$ and hence ${inf}_{t \in T_{α}} B_{α} (X; p) (t) = {inf}_{t \in (0, \infty)} B_{α} (X; p) (t)$ in (3.8) is not attained; rather,

$Q_{\infty} (X; p) = inf_{t > 0} B_{\infty} (X; p) (t) = B_{\infty} (X; p) (0 +) = x_{*} .$

For instance, in the case when

α = 0.5

,

p = 0.05

, and X has the Gamma distribution with the shape and scale parameters equal to

2.5

and 1, respectively, Proposition 4.3 yields

t_{min} > 4.01

(using

\tilde{p} = 0.095

) and

t_{max} < 6.45

.

When

α = 0

, the quantile bound

Q_{α} (X; p)

is simply the quantile

Q (X; p)

, which can be effectively computed by Formula (3.3), since the tail probability

P (X > x)

is monotone in x. Next, as was noted in the proof of Theorem 3.4,

B_{α} (X; p) (t)

is convex in t when

α \in [1, \infty]

, which provides for an effective computation of

Q_{α} (X; p)

by Formula (3.8).

Therefore, it remains to consider the computation – again by Formula (3.8) – of

Q_{α} (X; p)

for

α \in (0, 1)

. In such a case, as in Section 4.1, one can use an interval method. As soon as the minimizing values of t are bracketed as in (4.6), one can partition the finite interval

[t_{min}, t_{max}]

into a large number of small subintervals

[a, b]

with

t_{min} ⩽ a < b ⩽ t_{max}

. For each such subinterval,

\begin{matrix} M_{a, b} : = max_{t \in [a, b]} B_{α} (X; p) (t) ⩽ b + p^{- 1 / α} {∥ {(X - a)}_{+} ∥}_{α}, \\ m_{a, b} : = min_{t \in [a, b]} B_{α} (X; p) (t) ⩾ a + p^{- 1 / α} {∥ {(X - b)}_{+} ∥}_{α}, \end{matrix}

so that, by the continuity of

{∥ (X - t)}_{+} ∥_{α}

in t,

M_{a, b} - m_{a, b} ⩽ b - a + p^{- 1 / α} {(∥ (X - a)}_{+} ∥_{α} {- ∥ (X - b)}_{+} ∥_{α}) ⟶ 0

as

b - a \to 0

, uniformly over all subintervals

[a, b]

of the interval

[t_{min}, t_{max}]

. Thus, one can effectively bracket the value

Q_{α} (X; p) = {inf}_{t \in R} B_{α} (X; p) (t)

; this same approach will work, and perhaps may be useful, for

α \in [1, \infty)

as well.

In accordance with Proposition 3.2 in [6], consider

x_{* *} : = x_{* *, X} : = sup ((supp X) \ {x_{*}}) \in [- \infty, x_{*}] \subseteq [- \infty, \infty] .

(4.10)

The following proposition will be useful.

Proposition 4.4.

(i): If $α \in [1, \infty]$ , then $B_{α} (X; p) (t)$ is convex in the pair $(X, t) \in X_{α} \times T_{α}$ .
(ii): If $α \in (1, \infty)$ , then $B_{α} (X; p) (t)$ is strictly convex in $t \in (- \infty, x_{* *}] \cap R$ .
(iii): $B_{\infty} (X; p) (t)$ is strictly convex in $t \in {s \in (0, \infty) : E e^{X / s} < \infty}$ , unless $P (X = c) = 1$ for some $c \in R$ .

If

α \in (1, \infty)

then, by Part (ii) of Proposition 4.4 and Part (i) of Proposition 4.3, the set

\underset{t \in R}{argmin} B_{α} (X; p) (t)

is a singleton one; that is, there is exactly one minimizer

t \in R

of

B_{α} (X; p) (t)

. If

α = 1

, then

B_{α} (X; p) (t) = B_{1} (X; p) (t)

is convex, but not strictly convex, in t, and the set

\underset{t \in R}{argmin} B_{α} (X; p) (t)

of all minimizers of

B_{α} (X; p) (t)

in t coincides with the set of all

(1 - p)

-quantiles of X, as mentioned at the conclusion of the derivation of the identity (5.10). Thus, if

α = 1

, then the set

\underset{t \in R}{argmin} B_{α} (X; p) (t)

may in general be, depending on p and the distribution of X, a nonzero-length closed interval. Finally, if

α \in (0, 1)

then, in general, the set

\underset{t \in R}{argmin} B_{α} (X; p) (t)

does not have to be connected:

Proposition 4.5. For any

α \in (0, 1)

,

p \in (0, 1)

, and

x \in R

, there is a r.v. X (taking three distinct values) such that

Q_{α} (X; p) = x

and the infimum

{inf}_{t \in T_{α}} B_{α} (X; p) (t) = {inf}_{t \in R} B_{α} (X; p) (t)

in (3.8) is attained at precisely two distinct values of t.

Proposition 4.5 follows immediately from Proposition 4.1, by the duality (3.10) and the change-of-variables identity

A_{α} (X; x) (λ) = {\tilde{A}}_{α} (X; x) (x - α / λ)

for

α \in (0, \infty)

, used to establish (2.7)–(2.9). At that,

λ \in (0, \infty)

is one of the two minimizers of

A_{α} (X; x) (λ)

in Proposition 4.1 if and only if

t : = x - α / λ

is one of the two minimizers of

B_{α} (X; p) (t)

in Proposition 4.5.

Proposition 4.1 is illustrated by the following example, which is obtained from Example 4.2 by the same duality (3.10).

Example 4.6.

As in Example 4.2, let

α = \frac{1}{2}

, and let X be a r.v. taking values

- \frac{27}{11}, - 1, 2

with probabilities

\frac{1}{4}, \frac{1}{4}, \frac{1}{2}

. Also let

p = \frac{\sqrt{3}}{2}

. Then the minimum of

B_{α} (X; p) (t)

over all real t equals zero and is attained at each of the two points,

t = - \frac{27}{11}

and

t = - 1

, and only at these two points. The graph

\{(t, B_{1 / 2} (X; \frac{\sqrt{3}}{2}) (t)) : - 3 ⩽ t ⩽ 3\}

is shown in Figure 3. The minimizing values of t here,

- \frac{27}{11}

and

- 1

, are related with the minimizing values of λ in Example 4.2,

\frac{11}{54}

and

\frac{1}{2}

, by the mentioned formula

t = x - α / λ

(here, with

x = 0

and

α = \frac{1}{2}

).

Figure 3. Illustration of Example 4.6

4.3. Optimization of the Risk Measures $Q_{α} (X; p)$ with Respect to X

As was pointed out, the variational representation of

Q_{α} (X; p)

given in (3.8) allows for a comparatively easy incorporation of these risk measures into more specialized optimization problems, with restrictions on the r.v. X. Indeed, (3.8) immediately yields the following generalization of Theorem 14 of Rockafellar and Uryasev [2]:

Theorem 4.7. (Optimization shortcut.) Take any

α \in (0, \infty]

and any

p \in (0, 1)

. Let

Y_{α}

be any subset of the set

X_{α}

of r.v.’s defined by Formula (2.14). Then, for any

α \in (0, \infty]

and any

p \in (0, 1)

, the minimization of the risk measure

Q_{α} (X; p)

in

X \in Y_{α}

is equivalent to the minimization of

B_{α} (X; p) (t)

in

(t, X) \in (T_{α}, Y_{α})

, in the sense that

inf_{X \in Y_{α}} Q_{α} (X; p) = inf_{(t, X) \in (T_{α}, Y_{α})} B_{α} (X; p) (t) .

(4.11)

The mentioned Theorem 14 in [2] is the special case of Theorem 4.7 corresponding to

α = 1

; recall that in this case, according to (5.1),

Q_{α} (X; p)

coincides with

{CVaR}_{p} (X)

.

Suppose that

α \in [1, \infty]

and the set

Y_{α}

is convex. Then, in view of Part (i) of Proposition 4.4, computing the infimum on the right-hand side of (4.11) is a problem of convex optimization, for which there are very effective algorithms.

In view of the variational representations of

P_{α} (X; x)

given in (2.5) and (2.7), the result similar to Theorem 4.7 obviously holds for

P_{α} (X; x)

as well.

When the uncertain potential losses on the assets under consideration are modeled as jointly normal r.v.’s, the optimization can be further simplified. Indeed, suppose that the column matrix

X = {[X_{1}, \dots, X_{n}]}^{T}

of the uncertain losses

X_{1}, \dots, X_{n}

on assets

1, \dots, n

is multivariate normal with mean vector

μ = {[μ_{1}, \dots, μ_{n}]}^{T}

and

n \times n

covariance matrix Σ; here, as usual,

^{T}

denotes the matrix transposition. Let

w = {[w_{1}, \dots, w_{n}]}^{T}

be the column matrix of the weights of the assets

1, \dots, n

in the considered investment portfolio, so that the potential loss on the portfolio is

X : = w \cdot X : = w^{T} X = w_{1} X_{1} + \dots + w_{n} X_{n}

, which is normally distributed with mean

μ = w \cdot μ

and standard deviation

σ = \sqrt{w^{T} Σ w}

. Thus, in view of Proposition 3.8, the investor is now in the Markowitz mean-variance risk-assessment framework. For instance, the problem of minimizing the risk measure

Q_{α} (X; p)

given the mean loss μ (which, it is hoped, is negative) is equivalent to the quadratic optimization problem of minimizing the value of the quadratic form

w^{T} Σ w

over all weight “vectors”

w

satisfying the restrictions (say)

w \cdot μ = μ

,

w \cdot 1 = 1

, and

K w ⩾ 0

, where

1 : = {[{\underset{︸}{1, \dots, 1}}_{n}]}^{T}

, K is a rectangular real matrix,

0

is the the zero column matrix of the appropriate height, and the inequality

K w ⩾ 0

is considered component-wise, so that the latter inequality requires some or all of the weights

w_{1}, \dots, w_{n}

(or some of their linear combinations) to be nonnegative.

4.4. Additional Remarks on the Computation and Optimization

As demonstrated in Propositions 4.1 and 4.5, the computation of

P_{α} (X; x)

and

Q_{α} (X; p)

in the case

α \in (0, 1)

inherits some of the difficulties known for the case

α = 0

, when

Q_{α} (X; p)

coincides with

{VaR}_{p} (X)

.

One may also note that – even when a minimizing value of

λ

or t in Formulas (2.5) – (2.7), or (3.8) is not identified quite perfectly – one still obtains, by those formulas, an upper bound on

P_{α} (X; x)

or

Q_{α} (X; p)

and hence on the true tail probability

P (X ⩾ x)

or the true quantile

Q (X; p)

, respectively. A similar remark is valid concerning the optimization shortcut (4.11).

Using variational formulas – of which Formulas (2.5), (2.7), and (3.8) are examples – to define or compute measures of risk is not peculiar to the present paper. Indeed, as mentioned previously, the special case of (3.8) with

α = 1

is the well-known variational representation (5.10) of

CVaR

, obtained in [2,3,26]. The risk measure given by the the Securities and Exchange Commission (SEC) rules Subsection 3.2 in [1] is another example where the calculations are done, in effect, according to a certain minimization formula, which is somewhat implicit and complicated in that case.

5. Implications for Risk Assessment in Finance and Inequality Modeling in Economics

5.1. The Spectrum ${(Q_{α} (X; p))}_{α \in [0, \infty]}$ Contains $VaR$ and $CVaR$ .

In financial literature (see, e.g., [2,26,29]), the quantile bounds

Q_{0} (X; p)

and

Q_{1} (X; p)

are known as the value at risk and conditional value at risk, denoted as

{VaR}_{p} (X)

and

{CVaR}_{p} (X)

, respectively:

Q_{0} (X; p) = {VaR}_{p} (X) and Q_{1} (X; p) = {CVaR}_{p} (X);

(5.1)

here, X is interpreted as a priori uncertain potential loss. The value of

Q_{1} (X; p)

is also known as the expected shortfall (ES) [30], average value at risk (AVaR) [31] and expected tail loss (ETL) [32]. As indicated in [2], at least in the case when there is no atom at the quantile point

Q (X; p)

, the quantile bound

Q_{1} (X; p)

is also called the “mean shortfall” [33], whereas the difference

Q_{1} (X; p) - Q (X; p)

is referred to as “mean excess loss” [34,35].

5.2. The Spectrum Parameter α as a Risk Sensitivity Index

Greater values of the spectrum parameter α correspond to greater sensitivity to risk; cf., e.g., [36]. This is manifested, first of all, by the monotonicity of

Q_{α} (X; p)

in α, as stated in Theorem 3.4). (In the normal-distribution realm, this monotonicity is expressed as the growing (with α) weight of the standard deviation σ of the loss X in its linear combination with the mean μ in (3.12).)

Moreover, in view of the monotonicity in X (also stated in Theorem 3.4) and Proposition 3.5, the sensitivity index α is in a one-to-one correspondence with the highest order of the stochastic dominance respected by

Q_{α} (X; p)

.

As pointed out in the Introduction, the most popular coherent risk measure

CVaR

has a fixed and rather limited sensitivity to risk and thus allows of no variation in the degree of such sensitivity. In fact, one can easily construct two investment portfolios such that

(i): one of the portfolios is clearly riskier than the other;
(ii): this distinction is sensed (to varying degrees, depending on α) by all the risk measures $Q_{α} (X; p)$ with $α \in (1, \infty)$ ;
(iii): yet, the values of ${CVaR}_{p} = Q_{1} (X; p)$ are the same for both portfolios.

For instance, let X and Y denote the potential losses corresponding to two different investments portfolios. Suppose that there are mutually exclusive events

E_{1}

and

E_{2}

and real numbers

p_{*} \in (0, 1)

and

δ \in (0, 1)

such that (i)

P (E_{1}) = P (E_{2}) = p_{*} / 2

; (ii) the loss of either portfolio is 0 if the event

E_{1} \cup E_{2}

does not occur; (iii) the loss of the X-portfolio is 1 if the event

E_{1} \cup E_{2}

occurs; and (iv) the loss of the Y-portfolio is

1 - δ

if the event

E_{1}

occurs, and it is

1 + δ

if the event

E_{2}

occurs. Thus, the r.v. X takes values 0 and 1 with probabilities

1 - p_{*}

and

p_{*}

, and the r.v. Y takes values 0,

1 - δ

, and

1 + δ

with probabilities

1 - p_{*}

,

p_{*} / 2

, and

p_{*} / 2

, respectively. Hence,

E X = E Y

, that is, the expected losses of the two portfolios are the same. Clearly, the distribution of X is less dispersed than that of Y, both intuitively and also in the formal sense that

X \overset{α + 1}{⩽} Y

for all

α \in [1, \infty]

. So, everyone will probably say that the Y-portfolio is riskier than the X-portfolio. However, for any

p \in (p_{*}, 1)

it is easy to see, by (3.3), that

Q_{0} (X; p) = 0 = Q_{0} (Y; p)

, and hence, in view of (5.10),

Q_{1} (Y; p) = \frac{1}{p} E Y = \frac{p_{*}}{p} = \frac{1}{p} E X = Q_{1} (X; p)

. Using also the continuity of

Q_{α} (\cdot; p)

in p, as stated in Theorem 3.4, one concludes that the

Q_{1} (\cdot; p) = {CVaR}_{p} (\cdot)

risk value of the riskier Y-portfolio is the same as that of the less risky X-portfolio for all

p \in [p_{*}, 1)

. Such indifference (which may also be referred to as insufficient sensitivity to risk) may generally be considered “an unwanted characteristic” – see e.g. pages 36 and 48 in [4]. One can also perceive the exhibited here lack of dependence of

CVaR

on δ as a certain “flatness” of this measure of risk.

Let us now show that, in contrast with the risk measure

Q_{1} (\cdot; p) = {CVaR}_{p} (\cdot)

, the value of

Q_{α} (\cdot; p)

is sensitive to risk for all

α \in (1, \infty)

and all

p \in (0, 1)

; that is, for all such α and p and for the losses X and Y as above,

Q_{α} (Y; p) > Q_{α} (X; p)

. Indeed, take any

α \in (1, \infty)

. By (2.17) and (4.10),

x_{*, X} = 1

,

p_{*, X} = p_{*}

,

x_{*, Y} = 1 + δ

,

x_{* *, Y} = 1 - δ

, and

p_{*, Y} = p_{*} / 2

. If

p \in (0, p_{*} / 2]

then, by Part (ii) of Proposition 3.1,

Q_{α} (Y; p) = x_{*, Y} = 1 + δ > 1 = x_{*, X} = Q_{α} (X; p)

. If now

p \in (p_{*} / 2, 1)

, then, by Formula (3.20) in [8,

t_{Y} : =_{α - 1} Q (Y; p) \in (- \infty, x_{* *, Y}) = (- \infty, 1 - δ)

. Also, by a strict version of Jensen’s inequality and the strict convexity of

u^{α}

in

u \in [0, \infty)

,

B_{α} (X; p) (t) = t + p^{- 1 / α} {∥ X - t ∥}_{α} < t + p^{- 1 / α} {∥ Y - t ∥}_{α} = B_{α} (Y; p) (t)

for all

t \in (- \infty, 1 - δ]

. Therefore, by Formula (3.18) in [8] and Formula (3.8) in the present paper,

Q_{α} (Y; p) = B_{α} (Y; p) (t_{Y}) > B_{α} (X; p) (t_{Y}) ⩾ Q_{α} (X; p)

. Thus, it is checked that

Q_{α} (Y; p) > Q_{α} (X; p)

for all

α \in (1, \infty)

and all

p \in (0, 1)

.

The above example is illustrated in Figure 4, for

p_{*} = 0.1

and

δ = 0.6

. It is seen that the sensitivity of the measure

Q_{α} (\cdot; p)

to risk (reflected especially by the gap between the red and blue lines for

p \in [p_{*}, 1) = [0.1, 1)

) increases from the zero sensitivity when

α = 1

to an everywhere positive sensitivity when

α = 2

to an everywhere greater positive sensitivity when

α = 5

.

Figure 4. Sensitivity of

Q_{α} (\cdot; p)

to risk, depending on the value of α: graphs

\{(p, Q_{α} (X; p)) : 0 < p < 1\}

(blue) and

\{(p, Q_{α} (Y; p)) : 0 < p < 1\}

(red) for

α = 1

(left);

α = 2

(middle); and

α = 5

(right).

Figure 4. Sensitivity of

Q_{α} (\cdot; p)

to risk, depending on the value of α: graphs

\{(p, Q_{α} (X; p)) : 0 < p < 1\}

(blue) and

\{(p, Q_{α} (Y; p)) : 0 < p < 1\}

(red) for

α = 1

(left);

α = 2

(middle); and

α = 5

(right).

That

{CVaR}_{p} = Q_{1} (\cdot; p)

is flat – in contrast to

Q_{α} (\cdot; p)

with

α \in (1, \infty)

– is of course rooted in the fact that

u^{α}

is strictly convex in

u \in [0, \infty)

only for

α \in (1, \infty)

, but not for

α = 1

; cf. e.g. [37], where it is shown that the normed space

L^{α}

is uniformly convex for

α \in (1, \infty)

(but of course not for

α = 1

).

5.3. Coherent and Non-Coherent Measures of Risk

Based on an extensive and penetrating discussion of methods of measurement of market and non-market risks, Artzner et al. [1] concluded that, for a risk measure to be effective in risk regulation and management, it has to be coherent, in the sense that it possess the translation invariance, subadditivity, positive homogeneity, and monotonicity properties. In general, a risk measure, say

\hat{ρ}

, is a mapping of a linear space of real-valued r.v.’s on a given probability space into

R

. The probability space (say Ω) was assumed to be finite in [1]. More generally, one could allow Ω to be infinite, and then it is natural to allow

\hat{ρ}

to take values

\pm \infty

as well. In [1], the r.v.’s (say Y) in the argument of the risk measure were called risks but at the same time interpreted as “the investor’s future net worth”. Then the translation invariance was defined in [1] as the identity

\hat{ρ} (Y + r t) = \hat{ρ} (Y) - t

for all r.v.’s Y and real numbers t, where r is a positive real number, interpreted as the rate of return. We shall, however, follow Pflug [26] (among other authors), who considers a risk measure (say ρ) as a function of the potential cost/loss, say X, and then defines the translation invariance of ρ, quite conventionally, as the identity

ρ (X + c) = ρ (X) + c

for all r.v.’s X and real numbers c. The approaches in [1,26] are equivalent to each other, and the correspondence between them can be given by the formulas

ρ (X) = r \hat{ρ} (Y) = r \hat{ρ} (- X)

,

X = - Y

, and

c = - r t

. The positive homogeneity, as defined in [1], can be stated as the identity

ρ (λ X) = λ ρ (X)

for all r.v.’s X and real numbers

λ ⩾ 0

.

Corollary 5.1. For each

α \in [1, \infty]

and each

p \in (0, 1)

, the quantile bound

Q_{α} (\cdot; p)

is a coherent risk measure, and it is not coherent for any pair

(α, p) \in [0, 1) \times (0, 1)

.

This follows immediately from Theorem 3.4 and Proposition 3.7.

The usually least trivial of the four properties characterizing the coherence is the subadditivity of a risk measure – which, in the presence of the positive homogeneity, is equivalent to the convexity, as was pointed out earlier in this paper. As is well known and also discussed above, the value at risk measure

{VaR}_{p} (X)

is translation invariant, positive homogeneous, and monotone (in X), but it fails to be subadditive. Quoting from page 1458 in [2]: “The coherence of [

{CVaR}_{p} (X)

] is a formidable advantage not shared by any other widely applicable measure of risk yet proposed.”

Corollary 5.1 above addresses this problem by providing an entire infinite family of coherent risk measures, indexed by

α \in [1, \infty]

, including

{CVaR}_{p} = Q_{1} (\cdot; p)

just as one member of the family. Moreover,

{CVaR}_{p}

can now be seen as only “barely”, borderline coherent – because (

{CVaR}_{p} = Q_{1} (\cdot; p)

and)

α = 1

is the smallest value of the sensitivity index for which the risk measure

Q_{α} (\cdot; p)

is coherent. One can also say that the coherence of

CVaR

is unstable with respect to the sensitivity index α:

{CVaR}_{p}

is coherent, but the risk measure

Q_{α} (\cdot; p)

(which is arbitrarily close to

{CVaR}_{p}

when α is close enough to 1) is not coherent if

α \in [0, 1)

. Here one may also recall the discussion in Section 5.2 on

CVaR

’s “flatness” and indifference to risk.

5.4. Other Terminology Used in the Literature for Some of the Listed Properties of $Q_{α} (\cdot; p)$

Theorem 3.4 provides a number of useful properties of the spectrum of risk measures

Q_{α} (\cdot; p)

. The terminology we use to name some of these properties differs from the corresponding terminology used elsewhere.

In particular, what we refer to as the “positive sensitivity” in Theorem 3.4 corresponds to the “relevance” in [1].

Next, in the present paper the “model-independence” means that the risk measure depends on the potential loss only through the distribution of the loss, rather than on the way to model the “states of nature”, on which the loss may depend. In contrast, in [1] a measure of risk is considered “model-free” if it does not depend, not only on modeling the “states of nature”, but, to a possibly large extent, on the distribution of the loss. An example of such a “model-free” risk measure is given by the SEC rules mentioned in Section 4.4; this measure of risk depends only on the set of all possible representations of the investment portfolio in question as a portfolio of long call spreads, that is, pairs of the form (a long call, a short call). If a measure of risk is not “model-free”, then it is called “model-dependent” in [1]. The “model-independence” property is called “law-invariance” in Section 12.1.2 of [38], and a similar property is called “neutrality” on page 97 in [39].

Also in [38], the consistency property is referred to as “constancy”.

5.5. Gini-Type Mean Differences and Related Risk Measures

Yitzhaki [40] utilized the Gini mean difference – which had prior to that been mainly used as a measure of economic inequality – to construct, somewhat implicitly, a measure of risk; this approach was further developed in [41,42]. If (say) a r.v. X is thought of as the income of a randomly selected person in a certain state, then the Gini mean difference can be defined by the formula

G_{H} (X) : = E H (| X - \tilde{X} |),

where

\tilde{X}

is an independent copy of X and

H : [0, \infty) \to R

is a measurable function, usually assumed to be nonnegative and such that

H (0) = 0

; clearly, given the function H, the Gini mean difference

G_{H} (X)

depends only on the distribution of the r.v. X. Therefore, if

H (u)

is considered, for any

u \in [0, \infty)

, as the measure of inequality between two individuals with incomes x and y such that

| x - y | = u

, then the Gini mean difference

E H (| X - \tilde{X} |)

is the mean H-inequality in income between two individuals selected at random (and with replacement, thus independently of each other). The most standard choice for H is the identity function

id

, so that

H (u) = id (u) = u

for all

u \in [0, \infty)

. Based on the measure-of-inequality

G_{H}

, one can define the risk measure

R_{H} (X) : = E X + G_{H} (X) = E X + E H (| X - \tilde{X} |),

(5.2)

where now the r.v. X is interpreted as the uncertain loss on a given investment, with the term

G_{H} (X) = E H (| X - \tilde{X} |)

then possibly interpreted as a measure of the uncertainty. Clearly, when there is no uncertainty, so that the loss X is in fact a nonrandom real constant, then the measure

G_{H} (X)

of the uncertainty is 0, assuming that

H (0) = 0

. If

X \sim N (μ, σ^{2})

(that is, X is normally distributed with mean μ and standard deviation

σ > 0

) and

H = κ id

for some positive constant κ, then

R_{H} (X) = μ + \frac{2 κ}{\sqrt{π}} σ

, a linear combination of the mean and the standard deviation, so that in such a case we find ourselves in the realm of the Markowitz mean-variance risk-assessment framework; cf. (3.12).

It is assumed that

R_{H} (X)

is defined when both expected values in the last expression in (5.2) are defined and are not infinite values of opposite signs – so that these two expected values could be added, as needed in (5.2).

It is clear that

R_{H} (X)

is translation-invariant. Moreover,

R_{H} (X)

is convex in X if the function H is convex and nondecreasing. Further, if

H = κ id

for some positive constant κ, then

R_{H} (X)

is also positive-homogeneous.

It was shown in [40], under an additional technical condition, that

R_{H} (X)

is nondecreasing in X with respect to the stochastic dominance of order 1 if

H = \frac{1}{2} id

. Namely, the result obtained in [40] is that if

X \overset{st}{⩽} Y

and the distribution functions F and G of X and Y are such that

F - G

changes sign only finitely many times on

R

, then

R_{\frac{1}{2} id} (X) ⩽ R_{\frac{1}{2} id} (Y)

. A more general result was obtained in [42], which can be stated as follows: in the case when the function H is differentiable,

R_{H} (X)

is nondecreasing in X with respect to the stochastic dominance of order one if and only if

| H^{'} | ⩽ \frac{1}{2}

. Cf. also [41]. The proof in [42] was rather long and involved; in addition, it used a previously obtained result of [43]. Here we are going to give (in Appendix A) a very short, direct, and simple proof of the more general

Proposition 5.2. The risk measure

R_{H} (X)

is nondecreasing in X with respect to the stochastic dominance of order 1 if and only if the function H is

\frac{1}{2}

-Lipschitz:

| H (x) - H (y) | ⩽ \frac{1}{2} | x - y |

for all x and y in

[0, \infty)

.

In Proposition 5.2, it is not assumed that

H ⩾ 0

or that

H (0) = 0

. Of course, if H is differentiable, then the

\frac{1}{2}

-Lipschitz condition is equivalent to the condition

| H^{'} | ⩽ \frac{1}{2}

in [42].

The risk measure

R_{H} (X)

was called mean-risk (M-R) in [41].

It follows from [42] or Proposition 5.2 above that the risk measure

R_{κ id} (X)

is coherent for any

κ \in [0, \frac{1}{2}]

. In fact, based on Proposition 5.2, one can rather easily show more:

Proposition 5.3. The risk measure

R_{H} (X)

is coherent if and only if

H = κ id

for some

κ \in [0, \frac{1}{2}]

.

It is possible to indicate a relation – albeit rather indirect – of the risk measure

R_{H} (X)

, defined in (5.2), with the quantile bounds

Q_{α} (X; p)

. Indeed, introduce

{\hat{Q}}_{α} (X; p) = E X + p^{- 1 / α} ∥ {(X - E X)}_{+} ∥_{α},

(5.3)

assuming

E X

exists in

R

. By (3.8)–(3.9),

{\hat{Q}}_{α} (X; p)

is a majorant of

Q_{α} (X; p)

, obtained by using

t = E X

in (3.8) as a surrogate of the minimizing value of t.

The term

p^{- 1 / α} ∥ {(X - E X)}_{+} ∥_{α}

in (5.3) is somewhat similar to the Gini mean-difference term

E H (| X - \tilde{X} |)

, at least when

α = 1

and (the distribution of) the r.v. X is symmetric about its mean.

Moreover, if the distribution of

X - E X

is symmetric and stable with index

γ \in (1, 2]

, then

{\hat{Q}}_{1} (X; p) = R_{κ id} (X)

with

κ = 2^{- 1 - 1 / γ} / p

.

One may want to compare the two considered kinds of coherent measures of risk/inequality,

R_{κ id} (X)

for

κ \in [0, \frac{1}{2}]

and

Q_{α} (X; p)

for

α \in [1, \infty]

and

p \in (0, 1)

. It appears that the latter measure is more flexible, as it depends on two parameters (α and p) rather than just one parameter (κ). Moreover, as previously mentioned Proposition 2.7 in [8] shows, rather generally

Q_{α} (X; p)

retains a more or less close relation with the quantile

Q_{0} (X; p)

– which, recall, is the widely used value at risk (VaR). On the other hand, recall here that, in contrast with the VaR,

Q_{α} (X; p)

is coherent for

α \in [1, \infty]

. However, both of these kinds of coherent measures appear useful, each in its own manner, representing two different ways to express risk/inequality.

Formulas (5.2) and (5.3) can be considered special instances of the general relation between risk measures and measures of inequality established in [44]. Let

X_{E}

be a convex cone of real-valued r.v.

X \in X

with a finite mean

E X

such that

X_{E}

contains all real constants.

Largely following [44] (see also the earlier study [45]), let us say a coherent risk measure

R : X_{E} \to (- \infty, \infty]

is strictly expectation-bounded if

R (X) > E X

for all

X \in X_{E}

. (Note that here the r.v. X represents the loss, whereas in [44] it represents the gain; accordingly, X in this paper corresponds to

- X

in [44]; also, in [44] the cone

X_{E}

was taken to be the space

L^{2}

.) In view of Theorem 3.4 and Part (vii) of Proposition 3.1, it follows that

Q_{α} (X; p)

is a coherent and strictly expectation-bounded risk measure if

α \in [1, \infty]

. Also (cf. Definition 1 and Proposition 1 in [44]), let us say that a mapping

D : X_{E} \to [0, \infty]

is a deviation measure if D is subadditive, positive-homogeneous, and nonnegative with

D (X) = 0

if and only if

E (X = c) = 1

for some real constant c; here X is any r.v. in

X_{E}

. Next (cf. Definition 2 in [44]), let us say that a deviation measure

D : X_{E} \to [0, \infty]

is upper-range dominated if

D (X) ⩽ sup supp X - E X

for all

X \in X_{E}

. Then (cf. Theorem 2 in [44]), the formulas

D (X) = R (X - E X) and R (X) = E X + D (X)

(5.4)

provide a one-to-one correspondence between all coherent strictly expectation-bounded risk measures

R : X_{E} \to (- \infty, \infty]

and all upper-range dominated deviation measures

D : X_{E} \to [0, \infty]

.

In particular, it follows that the risk measure

{\hat{Q}}_{α} (\cdot; p)

, defined by Formula (5.3), is coherent for all

α \in [1, \infty]

and all

p \in (0, \infty)

. It also follows that

X \mapsto Q_{α} (X - E X; p)

is a deviation measure. As was noted,

{\hat{Q}}_{α} (X; p)

is a majorant of

Q_{α} (X; p)

. In contrast with

Q_{α} (X; p)

, in general

{\hat{Q}}_{α} (X; p)

will not have such a close hereditary relation with the true quantile

Q_{0} (X; p)

as e.g. the ones given in the previously mentioned Proposition 2.7 in [8]. For instance, if

P (X ⩾ x)

is like

x^{- \infty}

then, by Formulas (2.13)-(2.14) in [8,

Q_{α} (X; p) \underset{p ↓ 0}{\sim} Q_{0} (X; p)

for each

α \in [0, \infty]

, whereas

{\hat{Q}}_{\infty} (X; p) = \infty

for all real

p > 0

. On the other hand, in distinction with the definition (5.3) of

{\hat{Q}}_{α} (X; p)

, the expression (3.8) for

Q_{α} (X; p)

requires minimization in t; however, that minimization will add comparatively little to the complexity of the problem of minimizing

Q_{α} (X; p)

subject to a usually large number of restrictions on X; cf. Theorem 4.7. Risk measures similar to (5.3) were considered in [46] in relation with the stochastic dominance of arbitrary orders.

5.6. A Lorentz-Type Parametric Family of Risk Measures

Recalling (2.29) and following [21,22,47], one may also consider

- F_{- X}^{(- α)} (p)

as a measure of risk. Here one will need the following semigroup identity, given by Formula (8a) in [21], (cf. e.g. Remark 3.7 in [5]):

F_{X}^{(- α)} (p) = \frac{1}{Γ (α - ν)} \int_{0}^{p} {(p - u)}^{α - ν - 1} F_{X}^{(- ν)} (u) d u

(5.5)

whenever

0 < ν < α < \infty

. The following proposition is well known.

Proposition 5.4. If the r.v. X is nonnegative, then

F_{X}^{(- 2)} (p) = L_{X} (p) = - p {CVaR}_{p} (- X),

(5.6)

where

L_{X}

is the Lorenz curve function, given by the formula

L_{X} (p) : = \int_{0}^{p} F_{X}^{- 1} (u) d u

(5.7)

.

Indeed, the first equality in (5.6) is the special case of the identity (5.5) with

α = 2

and

ν = 1

, and the second equality in (5.6) follows by Part (i) of Theorem 3.1 in [48], identity (3.8) for

α = 1

, and the second identity in (5.1). Cf. Theorem 2 in [49] and [20,50].

Using (5.5) with

ν = 2

,

α + 1

in place of α, and

- X

in place of X together with Proposition 5.4, one has

- F_{- X}^{(- α - 1)} (p) = \frac{1}{Γ (α - 1)} \int_{0}^{p} {(p - u)}^{α - 2} u {CVaR}_{u} (X) d u

(5.8)

for any

α \in (1, \infty)

. Since

{CVaR}_{u} (X)

is a coherent risk measure, it now follows that, as noted in [47],

- F_{- X}^{(- α - 1)} (p)

is a coherent risk measure as well, again for

α \in (1, \infty)

; by (5.6), this conclusion will hold for

α = 1

. However, one should remember that the expression

F_{X}^{(- α)} (p)

was defined only when the r.v. X is nonnegative (and otherwise some of the crucial considerations above will not hold). Thus,

- F_{- X}^{(- α - 1)} (p)

is defined only if

X ⩽ 0

almost surely.

5.7. Spectral Risk Measures

In view of (5.8), the risk measure

- F_{- X}^{(- α - 1)} (p)

is a mixture of the coherent risk measures

{CVaR}_{u} (X)

and thus a member of the general class of the so-called spectral risk measures [51], which are precisely the mixtures, over the values

u \in (0, 1)

, of the risk measures

{CVaR}_{u} (X)

; thus, all spectral risk measures are automatically coherent. However, in general such measures will lack such an important variational representation as the one given by Formula (3.8) for the risk measure

Q_{α} (X; p)

. Of course, for any “mixing” nonnegative Borel measure μ on the interval

(0, 1)

and the corresponding spectral risk measure

{CVaR}_{μ} (X) : = \int_{(0, 1)} {CVaR}_{u} (X) μ (d u)

, one can write

{CVaR}_{μ} (X) = \int_{(0, 1)} inf_{t \in R} (t + \frac{1}{u} {∥ {(X - t)}_{+} ∥}_{1}) μ (d u),

(5.9)

in view of (5.1) and (3.8)–(3.9). However, in contrast with (3.8), the minimization (in

t \in R

) in (5.9) needs in general to be done for each of the infinitely many values of

u \in (0, 1)

. If the r.v. X takes only finitely many values, then the expression of

{CVaR}_{μ} (X)

in (5.9) can be rewritten as a finite sum, so that the minimization in

t \in R

will be needed only for finitely many values of u; cf. e.g. the optimization problem on page 8 in [47].

On the other hand, one can of course consider arbitrary mixtures in

p \in (0, 1)

and/or

α \in [1, \infty)

of the risk measures

Q_{α} (X; p)

. Such mixtures will automatically be coherent. Also, all mixtures of the measures

Q_{α} (X; p)

in p will be nondecreasing in α, and all mixtures of

Q_{α} (X; p)

in α will be nonincreasing in p.

5.8. Risk Measures Reinterpreted as Measures of Economic Inequality

Deviation measures such as the ones studied in [44] and discussed in the paragraph containing (5.4) can be used as measures of economic inequality if the r.v. X models, say, the random income/wealth – defined as the income/wealth of an (economic) unit chosen at random from a population of such units. Then, according to the one-to-one correspondence given by (5.4), coherent risk measures R translate into deviation measures D, and vice versa.

However, the risk measures

Q_{α} (\cdot; p)

themselves can be used to express certain aspects of economic inequality directly, without translation into deviation measures. For instance, if X stands for the random wealth, then the statement

Q_{1} (X; 0.01) = 30 E X

formalizes the common kind of expression “the wealthiest 1% own 30% of all wealth”, provided that the wealthiest 1% can be adequately defined, say as follows: there is a threshold wealth value t such that the number of units with wealth greater than or equal to t is

0.01 N

, where N is the number of units in the entire population. Then (cf. (5.12)),

0.01 N Q_{1} (X; 0.01) = 0.01 N E (X | X ⩾ t) = N E X I {X ⩾ t} = 0.30 N E X

, whence indeed

Q_{1} (X; 0.01) = 30 E X

. Similar in spirit expressions of economic inequality in terms of

Q_{α} (X; p)

can be provided for all

α \in (0, \infty)

. For instance, suppose now that X stands for the annual income of a randomly selected household, whereas x is a particular annual household income level in question. Then, in view of (3.8)–(3.9), the inequality

Q_{α} (X; p) ⩾ x

means that for any (potential) annual household income level t less than the maximum annual household income level

x_{*, X}

in the population, the conditional α-mean

E {({(X - t)}^{α} | X > t)}^{1 / α}

of the excess

{(X - t)}_{+}

of the random income X over t is no less than

{(\frac{p}{E (X > t)})}^{1 / α}

times the excess

{(x - t)}_{+}

of the income level x over t. Of course, the conditional α-mean

E {({(X - t)}^{α} | X > t)}^{1 / α}

is increasing in α. Thus, using the measure

Q_{α} (X; p)

of economic inequality with a greater value of α means treating high values of the economic variable X in a more progressive/sensitive manner. One may also note here that the above interpretation of the inequality

Q_{α} (X; p) ⩾ x

is a “synthetic” statement in the sense that it provides information concerning all values of potential interest of the threshold annual household income level t.

Not only the upper bounds

Q_{α} (X; p)

on the quantile

Q (X; p)

, but also the upper bounds

P_{α} (X; x)

on the tail probability

P (X ⩾ x)

may be considered measures of risk/inequality. Indeed, if X is interpreted as the potential loss, then the tail probability

P (X ⩾ x)

corresponds to the classical safety-first (SF) risk measure; see e.g. [52,53].

5.9. “Explicit” Expressions of $Q_{α} (X; p)$

In the case

α = 1

, an expression of

Q_{α} (X; p)

can be given in terms of the true

(1 - p)

-quantile

Q (X; p)

:

Q_{1} (X; p) = Q (X; p) + \frac{1}{p} E {(X - Q (X; p))}_{+} .

(5.10)

That the expression for

Q_{1} (X; p)

in (3.8) coincides with the one in (5.10) was established in Theorem 1 in [3] for absolutely continuous r.v.’s X, and then on page 273 in [26] and in Theorem 10 in [2] in general. For the readers’ convenience, let us present here the following brief proof of (5.10). For all real

h > 0

and

t \in R

, one has

{(X - t)}_{+} - {(X - t - h)}_{+} = h I {X > t} - (t + h - X) I {t < X < t + h} .

It follows that the right derivative of the convex function

{t \mapsto t + ∥ (X - t)}_{+} ∥_{1} / p

at any point

t \in R

is

1 - P (X > t) / p

, which, by (3.3), is

⩽ 0

if

t < Q (X; p)

and

> 0

if

t > Q (X; p)

. Hence,

Q (X; p)

is a minimizer in

t \in R

of

{t + ∥ (X - t)}_{+} ∥_{1} / p

, and thus (5.10) follows by (3.8). It is also seen now that any

(1 - p)

-quantile of X is a minimizer in

t \in R

of

{t + ∥ (X - t)}_{+} ∥_{1} / p

as well, and

Q (X; p)

is the largest of these minimizers.

As was shown in [2], the expression for

Q_{1} (X; p)

in (5.10) can be rewritten as a conditional expectation:

\begin{matrix} Q_{1} (X; p) & = Q (X; p) + E (X - Q (X; p) | X ⩾ Q (X; p), U ⩾ δ) \\ = E (X | X ⩾ Q (X; p), U ⩾ δ), \end{matrix}

(5.11)

where U is any r.v. which is independent of X and uniformly distributed on the interval

[0, 1]

,

δ : = δ (X; p) : = d I {X = Q (X; p)}

, and d is any real number in the interval

[0, 1]

such that

P (X ⩾ Q (X; p)) - p = P (X = Q (X; p)) d;

such a number d always exists. Thus, the r.v. U is used to split the possible atom of the distribution of X at the quantile point

Q (X; p)

in order to make the randomized tail probability

P (X ⩾ Q (X; p), U ⩾ δ)

exactly equal to p. Of course, in the absence of such an atom, one can simply write

Q_{1} (X; p) = Q (X; p) + E (X - Q (X; p) | X ⩾ Q (X; p)) = E (X | X ⩾ Q (X; p)) .

(5.12)

As pointed out in [2,3] and discussed in Section 4.3, a variational formula such as (3.8) has a distinct advantage over such ostensibly explicit formulas as (5.10) and (5.11), since (3.8) allows of rather easy incorporation into specialized optimization problems. Nonetheless, one can obtain an extension of the representation (5.10), valid for all

α \in [1, \infty)

; see Formula (4.18) and also Proposition 4.7 in [8].

6. Conclusions

Let us summarize some of the advantages of the risk/inequality measures

P_{α} (X; x)

and

Q_{α} (X; p)

:

$P_{α} (X; x)$ and $Q_{α} (X; p)$ are three-way monotonic and three-way stable – in α, p, and X.
The monotonicity in X is graded continuously in α, resulting in varying, controllable degrees of sensitivity of $P_{α} (X; x)$ and $Q_{α} (X; p)$ to financial risk/economic inequality.
$x \mapsto P_{α} (X; x)$ is the tail-function of a certain probability distribution.
$Q_{α} (X; p)$ is a $(1 - p)$ -percentile of that probability distribution.
For small enough values of p, the quantile bounds $Q_{α} (X; p)$ are close enough to the corresponding true quantiles $Q (X; p) = {VaR}_{p} (X)$ , provided that the right tail of the distribution of X is light enough and regular enough, depending on α.
In the case when the loss X is modeled as a normal r.v., the use of the risk measures $Q_{α} (X; p)$ reduces, to an extent, to using the Markowitz mean-variance risk-assessment paradigm – but with a varying weight of the standard deviation, depending on the risk sensitivity parameter α.
$P_{α} (X; x)$ and $Q_{α} (X; p)$ are solutions to mutually dual optimizations problems, which can be comparatively easily incorporated into more specialized optimization problems, with additional restrictions on the r.v. X.
$P_{α} (X; x)$ and $Q_{α} (X; p)$ are effectively computable.
Even when the corresponding minimizer is not identified quite perfectly, one still obtains an upper bound on the risk/inequality measures $P_{α} (X; x)$ or $Q_{α} (X; p)$ .
Optimal upper bounds on $P_{α} (X; x)$ and, hence, on $Q_{α} (X; p)$ over important classes of r.v.’s X represented (say) as sums of independent r.v.’s $X_{i}$ with restrictions on moments of the $X_{i}$ ’s and/or sums of such moments can be given; see e.g. [7,54] and references therein.
The quantile bounds $Q_{α} (X; p)$ with $α \in [1, \infty]$ constitute a spectrum of coherent measures of financial risk and economic inequality.
The r.v.’s X of which the measures $P_{α} (X; x)$ and $Q_{α} (X; p)$ are taken are allowed to take values of both signs. In particular, if, in a context of economic inequality, X is interpreted as the net amount of assets belonging to a randomly chosen economic unit, then a negative value of X corresponds to a unit with more liabilities than paid-for assets. Similarly, if X denotes the loss on a financial investment, then a negative value of X will obtain when there actually is a net gain.

As seen from the discussion in Section 5, some of these advantages, and especially their totality, appear to be unique to the risk measures proposed here.

Further studies involving especially the use and computational implementation of the proposed risk measures would be welcome.

Acknowledgment

I am pleased to thank Emmanuel Rio for the mentioned communication [24], which also included a reference to [29] and in fact sparked the study presented here. I am also pleased to thank the referees for useful suggestions concerning the presentation.

Appendix

A. Proofs

Proof of Proposition 2.1. This proof is not hard but somewhat technical; it can be found in the more detailed version [8] of this paper; see the proof of Proposition 1.1 there. ☐

Proof of Proposition 2.2. This too can be found in [8]; see the proof of Proposition 1.2 there. ☐

Proof of Proposition 2.3. Let α and a sequence

(α_{n})

be indeed as in Proposition 2.3. If

x \in [x_{*}, - \infty)

, then the desired conclusion

P_{α_{n}} (X; x) \to P_{α} (X; x)

follows immediately from part (i) of Proposition 2.2. Therefore, assume in the rest of the proof of Proposition 2.3 that

x \in (- \infty, x_{*}) .

(A1)

Then (4.4) takes place and, by (4.3),

λ_{max, α}

is continuous in

α \in (0, \infty]

. Hence,

λ^{*} : = sup_{n} λ_{max, α_{n}} \in [0, \infty)

(A2)

and

P_{γ} (X; x) = inf_{λ \in [0, λ_{*}]} A_{γ} (X; x) (λ) for all γ \in {α} \cup {α_{n} : n \in N} .

(A3)

Also, by (2.3), (2.2), the inequality (4.1) for

α \in (0, \infty)

, the condition

X \in X_{β}

, and dominated convergence,

A_{α_{n}} (X; x) (λ) \to A_{α} (X; x) (λ) .

(A4)

Hence, by (2.5),

{lim sup}_{n} P_{α_{n}} (X; x) ⩽ {lim sup}_{n} A_{α_{n}} (X; x) (λ) = A_{α} (X; x) (λ)

for all

λ \in [0, \infty)

, whence, again by (2.5),

\underset{n}{lim sup} P_{α_{n}} (X; x) ⩽ P_{α} (X; x) .

(A5)

Thus, the case

α = 0

of Proposition 2.3 follows by (2.6).

If

α \in (0, 1]

, then for any κ and

λ

such that

0 ⩽ κ < λ < \infty

one has

| A_{α} (X; x) (λ) - A_{α} {(X; x) (κ) | ⩽ (λ - κ)}^{α} E {(X - x)}_{+}^{α} / α^{α} + {(λ - κ)}^{α / 2} / α^{α} + P ({(x - X)}_{+} > \frac{1}{\sqrt{λ - κ}});

(A6)

this follows because

\begin{matrix} 0 ⩽ {(1 + λ u / α)}_{+}^{α} - {(1 + κ u / α)}_{+}^{α} & ⩽ {(λ - κ)}^{α} u^{α} / α^{α} if u ⩾ 0, \\ 0 ⩽ {(1 + κ u / α)}_{+}^{α} - {(1 + λ u / α)}_{+}^{α} & ⩽ min (1, {(λ - κ)}^{α} {| u |}^{α} / α^{α}) \\ ⩽ {(λ - κ)}^{α / 2} / α^{α} + I {| u | > \frac{1}{\sqrt{λ - κ}}} if u < 0 . \end{matrix}

If now

α \in (0, 1)

, then (say, by cutting off an initial segment of the sequence

(α_{n})

) one may assume that

β \in (0, 1)

, and then, by (A6) with

α_{n}

in place of α, the sequence

(A_{α_{n}} (X; x) (λ))

is equicontinuous in

λ \in [0, \infty)

, uniformly in n. Therefore, by (A2) and the Arzelà–Ascoli theorem, the convergence in (A4) is uniform in

λ \in [0, λ^{*}]

and, hence, the conclusion

P_{α_{n}} (X; x) \to P_{α} (X; x)

follows by (A3) – in the case when

α \in (0, 1)

.

Quite similarly, the same conclusion holds if

α = 1 = β

; that is,

P_{α} (X; x)

is left-continuous in α at the point

α = 1

provided that

E X_{+} < \infty

.

It remains to consider the case when

α \in [1, \infty]

and

α_{n} ⩾ 1

for all n. Then, by the definition in (2.1), the functions

h_{α}

and

h_{α_{n}}

are convex and hence, by (2.3),

A_{α} (X; x) (λ)

and

A_{α_{n}} (X; x) (λ)

are convex in

λ \in [0, \infty)

. Then the conclusion

P_{α_{n}} (X; x) \to P_{α} (X; x)

follows by Corollary 3 in [55], the condition

X \in X_{β}

, (A2), and (A3). ☐

Proof of Proposition 2.4. This is somewhat similar to the proof of Proposition 2.3. One difference here is the use of the uniform integrability condition, which, in view of (2.3), (4.1), and the condition

X \in X_{α}

, implies (see e.g. Theorem 5.4 in [15]) that for all

λ \in [0, \infty)

lim_{n \to \infty} A_{α} (X_{n}; x) (λ) = A_{α} (X; x) (λ);

(A7)

here, in the case when

α = \infty

and

λ \notin Λ_{X}

, one should also use the Fatou lemma for the convergence in distribution (see e.g. Theorem 5.3 in [15]), according to which one always has

{lim inf}_{n \to \infty} A_{α} (X_{n}; x) (λ) ⩾ A_{α} (X; x) (λ)

, even without the uniform integrability condition. In this entire proof, it is indeed assumed that

α \in (0, \infty]

.

It follows from (A7) and the nonnegativity of

P_{α} (\cdot; \cdot)

that

0 ⩽ \underset{n \to \infty}{lim inf} P_{α} (X_{n}; x) ⩽ \underset{n \to \infty}{lim sup} P_{α} (X_{n}; x) ⩽ P_{α} (X; x)

(A8)

for all real x; cf. (A4) and (A5).

The convergence (2.22) for

x \in (x_{*}, \infty)

follows immediately from (A8) and part (i) of Proposition 2.2.

Using the same ingredients, it is easy to check Part (ii) of Proposition 2.4 as well. Indeed, assuming that

P (X_{n} = x_{*}) \underset{n \to \infty}{⟶} P (X = x_{*})

and using also (2.6), one has

\begin{matrix} P (X = x_{*}) = \underset{n \to \infty}{lim inf} P (X_{n} = x_{*}) ⩽ \underset{n \to \infty}{lim inf} P (X_{n} ⩾ x_{*}) ⩽ \underset{n \to \infty}{lim inf} P_{α} (X_{n}; x_{*}) \\ ⩽ \underset{n \to \infty}{lim sup} P_{α} (X_{n}; x_{*}) ⩽ P_{α} (X; x_{*}) = P (X = x_{*}), \end{matrix}

which yields (2.22) for

x = x_{*}

. Also,

X_{n} \underset{n \to \infty}{\overset{D}{⟶}} X

implies

{lim sup}_{n \to \infty} P (X_{n} = x_{*}) ⩽ P (X = x_{*})

; see e.g. Theorem 2.1 in [15]. So, if

P (X = x_{*}) = 0

, then

P (X_{n} = x_{*}) \to P (X = x_{*})

and hence (2.22) holds for

x = x_{*}

, by the first sentence of Part (ii) of Proposition 2.4.

It remains to prove Part (i) of Proposition 2.4 assuming (A1). The reasoning here is quite similar to the corresponding reasoning in the proof of Proposition 2.3, starting with (A1). Here, instead of the continuity of

λ_{max, α} = λ_{max, α, X}

in α, one should use the convergence

λ_{max, α, X_{n}} \to λ_{max, α, X}

, which holds provided that

y \in (x, x_{*})

is chosen to be such that

P (X = y) = 0

. Concerning the use of inequality (A6), note that (i) the uniform integrability condition implies that

E {(X_{n} - x)}_{+}^{α}

is bounded in n and (ii) the convergence in distribution

X_{n} \underset{n \to \infty}{\overset{D}{⟶}} X

implies that

{sup}_{n} P ({(x - X_{n})}_{+} > \frac{1}{\sqrt{λ - κ}}) ⟶ 0

as

0 < λ - κ \to 0

. Proposition 2.4 is now completely proved. ☐

Proof of Theorem 2.5. The model-independence is obvious from the definition (2.5). The monotonicity in X follows immediately from (2.23), (2.10), and (2.7)–(2.9). The monotonicity in α was already given in (2.13). The monotonicity in x is Part (i) of Proposition 2.1. That

P_{α} (X; x)

takes on only values in the interval

[0, 1]

follows immediately from (2.16). The α-concavity in x and stability in x follow immediately from parts (iii) and (i) of Proposition 2.2. The stability in α and the stability in X are Propositions 2.3 and 2.4, respectively. The translation invariance, consistency, and positive homogeneity follow immediately from the definition (2.5). ☐

Proof of Proposition 3.1.

(i) Part (i) of this proposition follows immediately from (3.2) and (2.16).

(ii) Suppose here indeed that

p \in (0, p_{*}] \cap (0, 1)

. Then for any

x \in (x_{*}, \infty)

one has

P_{α} (X; x) = 0 < p

, by Part (i) of Proposition 2.2, whence, by (2.19),

x \in E_{α} (p)

. On the other hand, for any

x \in (- \infty, x_{*}]

one has

P_{α} (X; x) ⩾ P_{α} (X; x_{*}) = p_{*} ⩾ p

, by Part (i) of Proposition 2.1 and Part (i) of Proposition 2.2, whence

x \notin E_{α} (p)

. Therefore,

E_{α} (p) = (x_{*}, \infty)

, and the conclusion

Q_{α} (X; p) = x_{*}

now follows by the definition of

Q_{α} (X; p)

in (3.2).

(iii) If

x_{*} = \infty

, then the inequality

Q_{α} (X; p) ⩽ x_{*}

in Part (iii) of Proposition 3.1 is trivial. If

x_{*} < \infty

and

p \in (p_{*}, 1)

, then

x_{*} \in E_{α} (p)

and hence

Q_{α} (X; p) ⩽ x_{*}

by (3.2). Now Part (iii) of Proposition 3.1 follows from its Part (ii).

(iv) Take any

x \in (- \infty, x_{*})

. Then

P_{0} (X; x) = P (X ⩾ x) > 0

. Moreover, for all

p \in (0, P_{0} (X; x))

one has

x \notin E_{0, X} (p)

. Therefore and because the set

E_{0, X} (p)

is an interval with endpoints

Q_{0} (X; p)

and ∞, it follows that

x ⩽ Q_{0} (X; p)

. Thus, for any given

x \in (- \infty, x_{*})

and for all small enough

p > 0

one has

Q_{0} (X; p) ⩾ x

and hence, by the already established Part (iii) of Proposition 3.1,

Q_{0} (X; p) \in [x, x_{*}]

. This means that Part (iv) of Proposition 3.1 is proved for

α = 0

. To complete the proof of this part, it remains to refer to the monotonicity of

Q_{α} (X; p)

in α stated in (3.4) and, again, to Part (iii) of Proposition 3.1.

(v) Assume indeed that

α \in (0, \infty]

. By Part (viii) of Proposition 2.2, the case

p_{*} = 1

is equivalent to

x_{α} = x_{*}

, and in that case both mappings (3.6) and (3.7) are empty, so that Part (v) of Proposition 3.1 is trivial. So, assume that

p_{*} < 1

and, equivalently,

x_{α} < x_{*}

. The function

(x_{α}, x_{*}) ∋ x \mapsto P_{α} (X; x)

is continuous and strictly decreasing, by Parts (iv) and (xi) of Proposition 2.2. At that,

P_{α} (X; x_{*} -) = P_{α} (X; x_{*}) = p_{*}

by Parts (iv) and (i) of Proposition 2.2 if

x_{*} < \infty

, and

P_{α} (X; x_{*} -) = 0 = p_{*}

by (2.16) and (2.17) if

x_{*} = \infty

. Also,

P_{α} (X; x_{α} +) = P_{α} (X; x_{α}) = 1

by the condition

x_{α} < x_{*}

and Parts (iv) and (x) of Proposition 2.2 if

x_{α} > - \infty

, and

P_{α} (X; x_{α} +) = 1

by (2.16) if

x_{α} = - \infty

. Therefore, the continuous and strictly decreasing function

(x_{α}, x_{*}) ∋ x \mapsto P_{α} (X; x)

maps

(x_{α}, x_{*})

onto

(p_{*}, 1)

, and so, Formula (3.7) is correct, and there is a unique inverse function, say

(p_{*}, 1) ∋ p \mapsto x_{α, p} \in (x_{α}, x_{*})

, to the function (3.7); moreover, this inverse function is continuous and strictly decreasing. It remains to show that

Q_{α} (X; p) = x_{α, p}

for all

p \in (p_{*}, 1)

. Take indeed any

p \in (p_{*}, 1)

. Since the function

(p_{*}, 1) ∋ p \mapsto x_{α, p} \in (x_{α}, x_{*})

is inverse to (3.7) and strictly decreasing,

P_{α} (X; x_{α, p}) = p

,

P_{α} (X; x) > p

for

x \in (x_{α}, x_{α, p})

, and

P_{α} (X; x) < p

for

x \in (x_{α, p}, x_{*})

. So, by Part (i) of Proposition 2.1,

P_{α} (X; x) > p

for

x \in (- \infty, x_{α, p})

and

P_{α} (X; x) < p

for

x \in (x_{α, p}, \infty)

. Now the conclusion that

Q_{α} (X; p) = x_{α, p}

for all

p \in (p_{*}, 1)

follows by (3.2).

(vi) Assume indeed that

α \in (0, \infty]

and take indeed any

y \in (- \infty, Q_{α} (X; p))

. If

P_{α} (X; y) = 1

, then the conclusion

P_{α} (X; y) > p

in Part (vi) of Proposition 3.1 is trivial, in view of (3.1). Therefore, w.l.o.g.

P_{α} (X; y) < 1

and hence

y \in E_{α} (1) = (x_{α}, \infty)

, by (2.19) and Part (ix) of Proposition 2.2. Let now

y_{p} : = Q_{α} (X; p)

for brevity, so that

y \in (- \infty, y_{p})

and, by the already verified part (iii) of Proposition 3.1,

y_{p} ⩽ x_{*}

. Hence,

x_{α} < y < y_{p} ⩽ x_{*}

. So, by Part (v) of Proposition 3.1 and Parts (iv) and (i) of Proposition 2.2,

P_{α} (X; y) > lim_{x ↑ y_{p}} P_{α} (X; x) = P_{α} (X; y_{p}) ⩾ P_{α} (X; x_{*}) = p_{*},

(A9)

which yields the conclusion

P_{α} (X; y) > p

in the case when

p ⩽ p_{*}

. If now

p > p_{*}

, then

p \in (p_{*}, 1)

and, by Part (v) of Proposition 3.1,

y_{p} = Q_{α} (X; p) \in (x_{α}, x_{*})

and

P_{α} (X; y_{p}) = p

, so that the conclusion

P_{α} (X; y) > p

follows by (A9) in this case as well.

(vii) Part (vii) of Proposition 3.1 follows immediately from (3.6), (3.5), and Part (vii) of Proposition 2.2. ☐

Proof of Theorem 3.4. The model-independence, monotonicity in X, monotonicity in α, translation invariance, consistency, and positive homogeneity properties of

Q_{α} (X; p)

follow immediately from (3.2) and the corresponding properties of

P_{α} (X; x)

stated in Theorem 2.5.

Concerning the monotonicity of $Q_{α} (X; p)$ in p: that

Q_{α} (X; p)

is nondecreasing in

p \in (0, 1)

follows immediately from (3.3) for

α = 0

and from (3.8) and (3.9) for

α \in (0, \infty]

. That

Q_{α} (X; p)

is strictly decreasing in

p \in [p_{*}, 1) \cap (0, 1)

if

α \in (0, \infty]

follows immediately from Part (v) of Proposition 3.1, and the verified below statement on the stability in p:

Q_{α} (X; p)

is continuous in

p \in (0, 1)

if

α \in (0, \infty]

.

The monotonicity of $Q_{α} (X; p)$ in α follows immediately from (2.13) and (3.2).

The finiteness of $Q_{α} (X; p)$ was already stated in Part (i) of Proposition 3.1.

The concavity of $Q_{α} (X; p)$ in $p^{- 1 / α}$ in the case when

α \in (0, \infty)

follows by (3.8), since

B_{α} (X; p) (t)

is affine (and hence concave) in

p^{- 1 / α}

. Similarly, the concavity of $Q_{\infty} (X; p)$ in $ln \frac{1}{p}$ follows by (3.8), since

B_{\infty} (X; p) (t)

is affine in

ln \frac{1}{p}

.

The stability of $Q_{α} (X; p)$ in p can be deduced from Proposition 3.1. Alternatively, the same follows from the already established finiteness and concavity of

Q_{α} (X; p)

in

p^{- 1 / α}

or

ln \frac{1}{p}

(cf. the proof of [2, Proposition 13]), because any finite concave function on an open interval of the real line is continuous, whereas the mappings

(0, 1) ∋ p \mapsto p^{- 1 / α} \in (0, \infty)

and

(0, 1) ∋ p \mapsto ln \frac{1}{p} \in (0, \infty)

are homeomorphisms.

Concerning the stability of $Q_{α} (X; p)$ in X, take any real

x \neq x_{*}

. Then the convergence

P_{α} (X_{n}; x) \to P_{α} (X; x)

holds, by Proposition 2.4. Therefore, in view of (2.19), if

x \in E_{α, X} (p)

then eventually (that is, for all large enough n)

x \in E_{α, X_{n}} (p)

. Hence, by (3.2), for each real

x \neq x_{*}

such that

x > Q_{α} (X; p)

eventually one has

x ⩾ Q_{α} (X_{n}; p)

. It follows that

{lim sup}_{n} Q_{α} (X_{n}; p) ⩽ Q_{α} (X; p) .

On the other hand, by Part (vi) of Proposition 3.1, for any

y \in (- \infty, Q_{α} (X; p))

, one has

P_{α} (X; y) > p

and, hence, eventually

P_{α} (X_{n}; y) > p

, which yields

y \notin E_{α, X_{n}} (p)

and, hence,

y ⩽ Q_{α} (X_{n}; p)

. It follows that

{lim inf}_{n} Q_{α} (X_{n}; p) ⩾ Q_{α} (X; p)

. Recalling now the established inequality

{lim sup}_{n} Q_{α} (X_{n}; p) ⩽ Q_{α} (X; p)

, one completes the verification of the stability of

Q_{α} (X; p)

in X.

The stability of $Q_{α} (X; p)$ in α is proved quite similarly, only using Proposition 2.3 in place of Proposition 2.4. Here the stipulation

x \neq x_{*}

is not needed.

Consider now the positive sensitivity property. First, suppose that

α \in (0, 1)

. Then, for all real

t < 0

, the derivative of

B_{α} (X; p) (t)

in t is less than

D : = 1 - {(E Y^{α})}^{- 1 + 1 / α} E Y^{α - 1}

, where

Y : = {(X - t)}_{+} = X - t > 0

. The inequality

D ⩽ 0

can be rewritten as the true inequality

\frac{τ}{τ + 1} L (- 1) + \frac{1}{τ + 1} L (τ) ⩾ L (0)

for the convex function

s \mapsto L (s) : = ln E exp {(1 - α) s ln Y}

, where

τ : = \frac{α}{1 - α}

. Therefore, the derivative is negative and hence

B_{α} (X; p) (t)

decreases in

t ⩽ 0

(here, to include

t = 0

, we also used the continuity of

B_{α} (X; p) (t)

in t, which follows by the condition

X \in X_{α}

and dominated convergence). On the other hand, if

t > 0

, then

B_{α} (X; p) (t) ⩾ t > 0

. Also,

B_{α} (X; p) (0) > 0

by (3.9) if the condition

P (X > 0) > 0

holds. Recalling again the continuity of

B_{α} (X; p) (t)

in t, one completes the verification of the positive sensitivity property – in the case

α \in (0, 1)

.

The positive sensitivity property in the case

α = 1

follows by (5.10). Indeed, (5.10) yields

Q_{1} (X; p) ⩾ Q (X; p) > 0

if

Q (X; p) > 0

, and

Q_{1} (X; p) = \frac{1}{p} E X ⩾ 0

by the condition

X ⩾ 0

if

Q (X; p) = 0

; moreover, one has

E X > 0

and hence

Q_{1} (X; p) = \frac{1}{p} E X > 0

if

Q (X; p) = 0

and

P (X > 0) > 0

. On the other hand, by (3.3),

X ⩾ 0

implies

Q (X; p) ⩾ 0

. Thus, the positive sensitivity property in the case

α = 1

is verified as well. This and the already established monotonicity of

Q_{α} (X; p)

in α implies the positive sensitivity property whenever

α \in [1, \infty]

.

As far as this property is concerned, it remains to verify it when

α = 0

– assuming that

P (X > 0) > p

. The sets

E : = \{x \in R : P (X > x) ⩽ p\}

and

E^{\circ} : = \{x \in R : P (X > x) < p\}

are intervals with the right endpoint ∞. The condition

P (X > 0) > p

means that

0 \notin E

. By the right continuity of

P (X > x)

in x, the set E contains the closure

\bar{E^{\circ}}

of the set

E^{\circ}

. Therefore,

0 \notin \bar{E^{\circ}}

and hence

0 < inf E^{\circ} = Q_{0} (X; p)

, by (3.3). Thus, the positive sensitivity property is fully verified.

In the presence of the positive homogeneity, the subadditivity property is easy to see to be equivalent to the convexity; cf. e.g. Theorem 4.7 in [56].

Therefore, it remains to verify the convexity property. Assume indeed that

α \in [1, \infty]

. If at that

α < \infty

, then the function

{∥ \cdot ∥}_{α}

is a norm and hence convex; moreover, this function is nondecreasing on the set of all nonnegative r.v.’s. On the other hand, the function

R ∋ x \mapsto x_{+}

is nonnegative and convex. It follows by (3.9) that

B_{α} (X; p) (t)

is convex in the pair

(X, t)

. So, to complete the verification of the convexity property of

Q_{α} (X; p)

in the case

α \in [1, \infty)

, it remains to refer to the well-known and easily established fact that, if

f (x, y)

is convex in

(x, y)

, then

{inf}_{y} f (x, y)

is convex in x; cf. e.g. Theorem 5.7 in [56].

The subadditivity and hence convexity of

Q_{α} (X; p)

in X in the remaining case

α = \infty

can now be obtained by the already established stability in α. It can also be deduced from Lemma B.2 in [57] (cf. Lemma 2.1 in [58]) or from the main result in [23], in view of the inequality (L_{X₁+⋯+X_n})^*⁻¹ ≤ Risks 02 00349 i001

given in the course of the discussion in [23] following Corollary 2.2 therein. However, a direct proof, similar to the one above for

α \in [1, \infty)

, can be based on the observation that

B_{\infty} (X; p) (t)

is convex in the pair

(X, t)

. Since

t ln \frac{1}{p}

is obviously linear in

(X, t)

, the convexity of

B_{\infty} (X; p) (t)

in

(X, t)

means precisely that for any natural number n, any r.v.’s

X_{1}, \dots, X_{n}

, any positive real numbers

t_{1}, \dots, t_{n}

, and any positive real numbers

α_{1}, \dots, α_{n}

with

\sum_{i} α_{i} = 1

, one has the inequality

t ln E e^{X / t} ⩽ \sum_{i} α_{i} t_{i} ln E e^{X_{i} / t_{i}}

, where

X : = \sum_{i} α_{i} X_{i}

and

t : = \sum_{i} α_{i} t_{i}

; but the latter inequality can be rewritten as an instance of Hölder’s inequality:

E \prod_{i} Z_{i} ⩽ \prod_{i} {∥ Z_{i} ∥}_{p_{i}}

, where

Z_{i} : = e^{α_{i} X_{i} / t}

and

p_{i} : = t / (α_{i} t_{i})

(so that

\sum_{i} \frac{1}{p_{i}} = 1

). (In particular, it follows that

B_{\infty} (X; p) (t)

is convex in t, which is useful when

Q_{\infty} (X; p)

is computed by Formula (3.8).)

The proof of Theorem 3.4 is now complete. ☐

Proof of Proposition 3.5. Take indeed any

α \in [0, \infty)

. Let then Y be a r.v. with the density function f given by the formula

f (y) = c_{α} y^{- α - 1} {(ln y)}^{- 2} I {y > 2}

for all

y \in R

, where

c_{α} : = 1 / \int_{2}^{\infty} y^{- α - 1} {ln}^{- 2} y d y

. Then

Y \in X_{α}

and, by the finiteness property stated in Theorem 3.4,

Q_{α} (Y; p) \in R

. Thus, one can find some real constant

c > Q_{α} (Y; p)

. Let now

X = c

, for any such constant c. Then, by the consistency property stated in Theorem 3.4,

Q_{α} (X; p) = c > Q_{α} (Y; p)

. On the other hand, for any

γ \in (α + 1, \infty]

one has

E g_{γ - 1; t} (X) = g_{γ - 1; t} (c) < \infty = E g_{γ - 1; t} (Y)

for all

t \in T_{γ - 1}

(letting here

γ - 1 : = \infty

when

γ = \infty

), so that, by (2.23),

X \overset{γ}{⩽} Y

. ☐

Proof of Proposition 3.6. Consider first the case

α \in (0, \infty)

. Let r.v.’s X and Y be in the default domain of definition,

X_{α}

, of the functional

Q_{α} (\cdot; p)

. The condition

X \overset{st}{<} Y

and the left continuity of the function

P (X ⩾ \cdot)

imply that for any

v \in R

, there are some

u \in (v, \infty)

and

w \in (v, u)

such that

P (X ⩾ z) < P (Y ⩾ z)

for all

z \in [w, u]

. On the other hand, by the Fubini theorem,

E {(X - t)}_{+}^{α} = \int_{R} α {(z - t)}_{+}^{α - 1} P (X ⩾ z) d z

for all

t \in R

. Recalling also that X and Y are in

X_{α}

, one has

B_{α} (X; p) (t) < B_{α} (Y; p) (t)

for all

t \in R

. By Proposition 4.3,

Q_{α} (Y; p) = B_{α} (Y; p) (t_{opt})

for some

t_{opt} \in R

. Therefore,

Q_{α} (X; p) ⩽ B_{α} (X; p) (t_{opt}) < B_{α} (Y; p) (t_{opt}) = Q_{α} (Y; p)

. (Note that the proof of Proposition 4.3, given later in this appendix, does not use Proposition 3.6 – so that there is no vicious circle here.)

Concerning the case

α = \infty

, recall (2.17) and (2.15), and then note that the condition

X \overset{st}{<} Y

implies that

x_{*, Y} = \infty

,

Λ_{X} \supseteq Λ_{Y}

, and

B_{\infty} (X; p) (t) < B_{\infty} (Y; p) (t)

for all

t \in (0, \infty)

such that

\frac{1}{t} \in Λ_{X}

and hence for all

t \in (0, \infty)

such that

\frac{1}{t} \in Λ_{Y}

. Here, instead of the formula

E {(X - t)}_{+}^{α} = \int_{R} α {(z - t)}_{+}^{α - 1} P (X ⩾ z) d z

for all

t \in R

, one uses the formula

E e^{(X - x) / t} = \int_{R} \frac{1}{t} e^{(z - x) / t} P (X ⩾ z) d z

for all

t \in (0, \infty)

. Using now Proposition 4.3, one sees that

Q_{\infty} (Y; p) = B_{\infty} (Y; p) (t_{opt})

for some

t_{opt} \in (0, \infty)

such that

\frac{1}{t} \in Λ_{Y}

. Therefore,

Q_{\infty} (X; p) ⩽ B_{\infty} (X; p) (t_{opt}) < B_{\infty} (Y; p) (t_{opt}) = Q_{\infty} (Y; p)

. ☐

Proof of Proposition 3.7. Suppose that indeed

α \in [0, 1)

. Let X and Y be independent r.v.’s, each with the Pareto density function given by the formula

f (u) = {(1 + u)}^{- 2} I {u > 0}

, so that

P (X ⩾ x) = P (Y ⩾ x) = {(1 + x_{+})}^{- 1}

for all

x \in R

. Then, by the condition

α \in [0, 1)

, the condition

X \in X_{α}

(assumed by default in this paper and, in particular, in Proposition 3.6) holds; this is the only place in the proof of Proposition 3.7 where the condition

α < 1

is used. Moreover, then it is not hard to see that for all

x \in (0, \infty)

one has

P (X + Y ⩾ x) - P (2 X ⩾ x) = 2 {(2 + x)}^{- 2} ln (1 + x) > 0

and hence, by the definition of the relation

\overset{st}{<}

given in Proposition 3.6,

2 X \overset{st}{<} X + Y .

Using now Proposition 3.6 together with the positive homogeneity property stated in Theorem 3.4, one concludes that

Q_{α} (X + Y; p) > Q_{α} (2 X; p) = 2 Q_{α} (X; p) = Q_{α} (X; p) + Q_{α} (Y; p)

if

α \in (0, 1)

.

It remains to consider the case

α = 0

. Note that the function

(0, \infty) ∋ x \mapsto P (X + Y ⩾ x) \in (0, 1)

is decreasing strictly and continuously from 1 to 0. Hence, in view of (3.3), the function

(0, 1) ∋ p \mapsto Q (X + Y; p) \in (0, \infty)

is the inverse to the function

(0, \infty) ∋ x \mapsto P (X + Y ⩾ x) \in (0, 1)

. Similarly, the function

(0, 1) ∋ p \mapsto Q (2 X; p) \in (0, \infty)

is the inverse to the strictly decreasing continuous function

(0, \infty) ∋ x \mapsto P (2 X ⩾ x) \in (0, 1)

. Since

P (X + Y ⩾ x) > P (2 X ⩾ x)

for all

x \in (0, \infty)

, it follows that

Q (X + Y; p) > Q (2 X; p)

and thus the inequality

Q_{α} (X + Y; p) > Q_{α} (X; p) + Q_{α} (Y; p)

holds for

α = 0

as well. ☐

Proof of Proposition 4.1. Take indeed any

α \in (0, 1)

and

p \in (0, 1)

. Note that there are real numbers q, r, and b such that

\begin{matrix} q > 0, r > 0, q + r < 1, \\ 0 < b < 1, \\ q {(1 - b)}^{α} + r {(1 + b)}^{α} = 2^{α} r = p . \end{matrix}

(A10)

Indeed, if

0 < b < 1

,

r = \frac{p}{2^{α}}

, and

q = k (b) r

, where

k (b) : = \frac{2^{α} - {(1 + b)}^{α}}{{(1 - b)}^{α}}

, then all of the conditions in (A10) will be satisfied, possibly except the condition

q + r < 1

, which latter will be then equivalent to the condition

h (b) : = \frac{p}{2^{α}} (1 + k (b)) < 1

. However, this condition can be satisfied by letting

b \in (0, 1)

be small enough, because

h (0 +) = p \in (0, 1)

.

If now q, r, and b satisfy (A10), then there is a r.v. X taking values

- 1

,

- b

, and b with probabilities

1 - q - r

, q, and r, respectively. Let indeed X be such a r.v. Then for all

s \in (0, \infty)

A_{α} (X; 0) (α s) = g (s) : = (1 - q - r) {(1 - s)}_{+}^{α} + q {(1 - b s)}_{+}^{α} + r {(1 + b s)}^{α} .

(A11)

In view of (A11) and (A10),

g (0 +) = 1 > p = g (\frac{1}{b}) = g (1) < \infty = g (\infty -) .

Moreover, by the condition

α \in (0, 1)

, the function g is strictly concave on each of the intervals

(0, 1]

,

[1, \frac{1}{b}]

, and

[\frac{1}{b}, \infty)

. Therefore, the minimum of

g (s)

in

s \in (0, \infty)

equals p and is attained precisely at two distinct positive values of s. Thus, in the case

x = 0

, Proposition 4.1 follows by (A11). The case of a general

x \in R

immediately reduces to that of

x = 0

by using the shifted r.v.

X + x

in place of X. ☐

Proof of Proposition 4.3. Consider first Part (i) of the proposition. For any real

t > t_{max}

, one has

B_{α} (X; p) (t) ⩾ t > B_{α} (X; p) (s) ⩾ {inf}_{t \in R} B_{α} (X; p) (t)

. On the other hand, by (4.8), for all real

t ⩽ t_{0} : = t_{0, min}

one has

{∥ (X - t)}_{+} ∥_{α}^{α} ⩾ E {(X - t)}^{α} I {X ⩾ t_{0}} ⩾ {(t_{0} - t)}^{α} P (X ⩾ t_{0}) ⩾ {(t_{0} - t)}^{α} \tilde{p}

, whence

B_{α} (X; p) (t) ⩾ t + (t_{0} - t) {(\tilde{p} / p)}^{1 / α} > t_{max} = B_{α} (X; p) (s) ⩾ {inf}_{t \in R} B_{α} (X; p) (t)

provided that also

t < t_{1, min}

. Thus,

B_{α} (X; p) (t) > {inf}_{t \in R} B_{α} (X; p) (t)

if either

t > t_{max}

or

t < t_{0, min} \land t_{1, min} = t_{min}

. This, together with the continuity of

B_{α} (X; p) (t)

in t, completes the proof of Part (i) of Proposition 4.3.

Concerning Part (ii) of the proposition, consider first

Case 1:

x_{*} = \infty

. Take then any real

t_{1} > 0

such that

E e^{X / t_{1}} < \infty

and then any real

x > x_{1} : = B_{\infty} (X; p) (t_{1})

such that

q : = P (X ⩾ x) < p

; note that

q > 0

, since

x_{*} = \infty

. Then for any real

t > 0

one has

E e^{X / t} ⩾ q e^{x / t}

and hence

B_{\infty} (X; p) (t) = t ln \frac{E e^{X / t}}{p} ⩾ t ln \frac{q e^{x / t}}{p} = x - t ln \frac{p}{q} > x_{1} = B_{\infty} (X; p) (t_{1}) ⩾ inf_{t > 0} B_{α} (X; p) (t)

(A12)

provided that

t < t_{min} : = \frac{x - x_{1}}{ln (p / q)};

the latter inequality is in fact equivalent to the strict inequality in (A12); recall here also that

x > x_{1}

and

0 < q < p

, whence

t_{min} \in (0, \infty)

. Taking now into account that

B_{\infty} (X; p) (t)

is lower semi-continuous in t (by Fatou’s lemma) and

B_{\infty} (X; p) (t) = t ln \frac{E e^{X / t}}{p} \sim t ln \frac{1}{p} \to \infty

as

t \to \infty

, one concludes that

inf_{t > 0} B_{\infty} (X; p) (t) = inf_{t ⩾ t_{min}} B_{\infty} (X; p) (t) = min_{t ⩾ t_{min}} B_{\infty} (X; p) (t),

which completes the consideration of Case 1 for Part (ii) of the proposition. It remains to consider

Case 2:

x_{*} < \infty

. Note that

B_{\infty} (\cdot; p) (t)

is translation invariant in the sense that

B_{\infty} (X + c; p) (t) = B_{\infty} (X; p) (t) + c

for all

c \in R

and

t \in (0, \infty)

. Therefore, without loss of generality,

x_{*} = 0

, so that

X ⩽ 0

almost surely (a.s.) and

P (X ⩾ - ε) > 0

for all real

ε > 0

. Now, by dominated convergence,

E e^{X / t} \underset{t ↓ 0}{⟶} P (X = 0) = p_{*}

and

E e^{X / t} \underset{t \to \infty}{⟶} 1

, whence

ln \frac{E e^{X / t}}{p} ⟶ \{\begin{matrix} ln \frac{p_{*}}{p} & as t ↓ 0, \\ ln \frac{1}{p} & as t \to \infty . \end{matrix}

(A13)

Moreover,

B_{\infty} (X; p) (t) = t ln \frac{E e^{X / t}}{p} ⟶ \{\begin{matrix} 0 & as t ↓ 0, \\ \infty & as t \to \infty . \end{matrix}

(A14)

Indeed, if

p_{*} = 0

, then for each real

ε > 0

and all small enough real

t > 0

, one has

E e^{X / t} < p

and hence

0 > t ln \frac{E e^{X / t}}{p} ⩾ t ln (\frac{1}{p} E e^{X / t} I {X ⩾ - ε}) ⩾ - ε + t ln P (X ⩾ - ε) \underset{t ↓ 0}{⟶} - ε

, which yields (A14) for

t ↓ 0

, in the case when

p_{*} = 0

. As for the cases when

t \to \infty

, or

t ↓ 0

and

p_{*} > 0

, then (A14) follows from (A13) because

0 < p < 1

.

To proceed further with the consideration of Case 2, one needs to distinguish the following three subcases.

Subcase 2.1:

p_{*} \in [0, p)

. Then, by (A14), for all large enough real

t > 0

B_{\infty} (X; p) (t) > 0 = lim_{t ↓ 0} B_{\infty} (X; p) (t) ⩾ inf_{t > 0} B_{\infty} (X; p) (t)

and, by (A14) and (A13), for all small enough real

s > 0

lim_{t ↓ 0} B_{\infty} (X; p) (t) = 0 > s ln \frac{E e^{X / s}}{p} = B_{\infty} (X; p) (s) ⩾ inf_{t > 0} B_{\infty} (X; p) (t) .

It follows that for some positive real

t_{min}

and

t_{max}

inf_{t > 0} B_{\infty} (X; p) (t) = inf_{t_{min} ⩽ t ⩽ t_{max}} B_{\infty} (X; p) (t) = min_{t_{min} ⩽ t ⩽ t_{max}} B_{\infty} (X; p) (t);

the latter equality here follows by the continuity of

B_{\infty} (X; p) (t)

in

t \in (0, \infty)

, which in turn takes place by the Case 2 condition

x_{*} < \infty

. This completes the consideration of Subcase 2.1 for Part (ii) of the proposition.

Subcase 2.2:

p_{*} \in [p, 1)

. Here, note that

P (X < 0) > 0

(since

p_{*} < 1

) and

E e^{X / t} = p_{*} + E e^{X / t} I {X < 0}

. Therefore, if t is decreasing from ∞ to zero, then

E e^{X / t}

is strictly decreasing and hence

ln \frac{E e^{X / t}}{p}

is strictly decreasing – to

ln \frac{p_{*}}{p} ⩾ 0

, by (A13) and the case condition

p_{*} \in [p, 1)

. So,

ln \frac{E e^{X / t}}{p} > 0

for all

t > 0

and hence

B_{\infty} (X; p) (t) = t ln \frac{E e^{X / t}}{p}

is strictly decreasing if t is decreasing from ∞ to 0. It follows that, in Subcase 2.2,

{inf}_{t \in T_{α}} = {inf}_{t \in (0, \infty)}

in (3.8) is not attained; rather,

{inf}_{t > 0} B_{\infty} (X; p) (t) = {lim}_{t ↓ 0} B_{\infty} (X; p) (t) = 0 = x_{*}

, in view of (A14) and the assumption

x_{*} = 0

. It remains to consider

Subcase 2.3:

p_{*} = 1

. Then

P (X = 0) = 1

and hence

B_{\infty} (X; p) (t) = t ln \frac{1}{p}

, so that, as in Subcase 2.2,

{inf}_{t \in T_{α}} = {inf}_{t \in (0, \infty)}

in (3.8) is not attained, and

{inf}_{t > 0} B_{\infty} (X; p) (t) = {lim}_{t ↓ 0} B_{\infty} (X; p) (t) = 0 = x_{*}

.

Now Proposition 4.3 is completely proved. ☐

Proof of Proposition 4.4. See the proof of Proposition 3.6 in [8]. ☐

Proof of Proposition 5.2. To prove the “if” part of the proposition, suppose that H is

\frac{1}{2}

-Lipschitz and take any r.v.’s X and Y such that

X \overset{st}{⩽} Y

. We have to show that then

R_{H} (X) ⩽ R_{H} (Y)

. By (2.26) and because

R_{H} (X)

depends only on the distribution of X, w.l.o.g.

X ⩽ Y

. Let

(\tilde{X}, \tilde{Y})

be an independent copy of the pair

(X, Y)

. Then, by (5.2), the

\frac{1}{2}

-Lipschitz condition, the triangle inequality, and the condition

X ⩽ Y

,

\begin{matrix} R_{H} (X) - R_{H} (Y) & = E (X - Y) + E H (| X - \tilde{X} |) - E H (| Y - \tilde{Y} |) \\ ⩽ E (X - Y) + \frac{1}{2} E (| X - \tilde{X} | - | Y - \tilde{Y} |) \\ ⩽ E (X - Y) + \frac{1}{2} E | X - \tilde{X} - Y + \tilde{Y} | \\ ⩽ E (X - Y) + \frac{1}{2} E (| X - Y | + | \tilde{X} - \tilde{Y} |) \\ = E (X - Y) + E | X - Y | = E (X - Y) + E (Y - X) = 0, \end{matrix}

so that the “if” part of Proposition 5.2 is verified.

To prove the “only if” part of the proposition, suppose that

R_{H} (X)

is nondecreasing in X with respect to the stochastic dominance of order 1 and take any x and y in

[0, \infty)

such that

x < y

. It is enough to show that then

| H (x) - H (y) | ⩽ \frac{1}{2} (y - x)

. Take also an arbitrary

p \in (0, 1)

. Let X and Y be such r.v.’s that

P (X = 0) = 1

if

x = 0

,

P (X = x) = p = 1 - P (X = 0)

if

x \in (0, \infty)

, and

P (Y = y) = p = 1 - P (Y = 0)

. Then

X \overset{st}{⩽} Y

, whence, by (5.2),

0 ⩾ \frac{1}{p} [R_{H} (X) - R_{H} (Y)] = x - y + 2 (1 - p) [H (x) - H (y)],

which yields

H (x) - H (y) ⩽ \frac{1}{2 (1 - p)} (y - x)

for an arbitrary

p \in (0, 1)

and hence

H (x) - H (y) ⩽ \frac{1}{2} (y - x) .

(A15)

Similarly, letting now X and Y be such r.v.’s that

P (X = - y) = p = 1 - P (X = 0)

,

P (Y = 0) = 1

if

x = 0

, and

P (Y = - x) = p = 1 - P (Y = 0)

if

x \in (0, \infty)

, one has

X \overset{st}{⩽} Y

and hence

0 ⩾ \frac{1}{p} [R_{H} (X) - R_{H} (Y)] = - y + x + 2 (1 - p) [H (y) - H (x)],

which yields

H (y) - H (x) ⩽ \frac{1}{2} (y - x)

. Thus, by (A15),

| H (x) - H (y) | ⩽ \frac{1}{2} (y - x)

. ☐

Proof of Proposition 5.3. To prove the “if” part of the proposition, suppose that

H = κ id

for some

κ \in [0, \frac{1}{2}]

. We have to check that then

R_{H} (X)

has the translation invariance, subadditivity, positive homogeneity, and monotonicity properties and thus is coherent. As noted in the discussion in Section 5,

R_{H} (X)

is translation invariant for any function H. It is also obvious that

R_{κ id} (X)

is positive homogeneous for any

κ \in [0, \infty)

. Next, as also noted in the discussion in Section 5,

R_{H} (X)

is convex in X whenever the function H is convex and nondecreasing. Indeed, let then

({\tilde{X}}_{0}, {\tilde{X}}_{1})

be an independent copy in distribution of a pair

(X_{0}, X_{1})

of r.v.’s, and introduce

X_{λ} : = (1 - λ) X_{0} + λ X_{1}

and

{\tilde{X}}_{λ} : = (1 - λ) {\tilde{X}}_{0} + λ {\tilde{X}}_{1}

, for an arbitrary

λ \in (0, 1)

. Then

\begin{matrix} R_{H} (X_{λ}) & = E X_{λ} + E H (| X_{λ} - {\tilde{X}}_{λ} |) \\ = (1 - λ) E X_{0} + λ E X_{1} + E H (| (1 - λ) (X_{0} - {\tilde{X}}_{0}) + λ (X_{1} - {\tilde{X}}_{1}) |) \\ ⩽ (1 - λ) E X_{0} + λ E X_{1} + E H ((1 - λ) | X_{0} - {\tilde{X}}_{0} | + λ | X_{1} - {\tilde{X}}_{1} |) \\ ⩽ (1 - λ) E X_{0} + λ E X_{1} + (1 - λ) E H (| X_{0} - {\tilde{X}}_{0} |) + λ E H (| X_{1} - {\tilde{X}}_{1} |) \\ = (1 - λ) R_{H} (X_{0}) + λ R_{H} (X_{1}) . \end{matrix}

Thus, the convexity property of

R_{H} (X)

is verified, which, as noted earlier, is equivalent to the subadditivity given the positive homogeneity. Now, to finish the proof of the “if” part of Proposition 5.3, it remains to notice that the monotonicity property of

R_{κ id} (X)

for

κ \in [0, \frac{1}{2}]

follows immediately from Proposition 5.2.

To prove the “only if” part of the proposition, suppose that the function H is such that

R_{H} (X)

is coherent and thus positive homogeneous, monotonic, and subadditive (as noted before,

R_{H} (X)

is translation invariant for any H). Take any

p \in (0, 1)

and let X here be a r.v. such that

P (X = 1) = p = 1 - P (X = 0)

. Then, by the positive homogeneity, for any real

u > 0

one has

0 = R_{H} (u X) - u R_{H} (X) = a A + B,

where

B : = (1 - u) H (0)

,

A : = H (u) - u H (1) - B

, and

a : = 2 p (1 - p)

, so that the range of values of a is the entire interval

(0, \frac{1}{2})

as p varies in the interval

(0, 1)

. Thus,

a A + B = 0

for all

a \in (0, \frac{1}{2})

. On the other hand,

a A + B

is a polynomial in a, with coefficients A and B not depending on a. It follows that

A = B = 0

, which yields

H (u) = u H (1)

for all

u \in (0, \infty)

and

H (0) = 0

. Hence,

H (u) = u H (1)

for all real

u ⩾ 0

. In other words,

H = κ id

, with

κ : = H (1)

. Then the monotonicity property and Proposition 5.2 imply that

| κ | ⩽ \frac{1}{2}

. It remains to show that, necessarily,

κ ⩾ 0

. Take here X and Y to be independent standard normal r.v.’s. Then, by the subadditivity,

2 κ E | X | = R_{κ id} (X + Y) ⩽ R_{κ id} (X) + R_{κ id} (Y) = 2 \sqrt{2} κ E | X |,

whence indeed

κ ⩾ 0

. ☐

Conflicts of Interest

The author declares no conflict of interest.

References

P. Artzner, F. Delbaen, J.-M. Eber, and D. Heath. “Coherent measures of risk.” Math. Financ. 9 (1999): 203–228. [Google Scholar] [CrossRef]
R.T. Rockafellar, and S. Uryasev. “Conditional value at risk for general loss distributions.” J. Bank. Financ. 26 (2002): 1443–1471. [Google Scholar] [CrossRef]
R.T. Rockafellar, and S. Uryasev. “Optimization of conditional value at risk.” J. Risk 2 (2000): 21–41. [Google Scholar]
H. Grootveld, and W.G. Hallerbach. Upgrading value at risk from diagnostic metric to decision variable: A wise thing to do? In Risk Measures in the 21st Century. Hoboken, NJ, USA: Wiley, 2004, pp. 33–50. [Google Scholar]
I. Pinelis. “Optimal tail comparison based on comparison of moments.” In High Dimensional Probability (Oberwolfach, 1996). Basel, Switzerland: Birkhäuser, 1998, Volume 43, pp. 297–314. [Google Scholar]
I. Pinelis. “On the Bennett-Hoeffding Inequality.” 2009. Available online: http://arxiv.org/abs/0902.4058 (accessed on 24 February 2009).
I. Pinelis. “On the Bennett–Hoeffding inequality.” Annales de l’Institut Henri Poincaré, Probabilités et Statistiques 50 (2014): 15–27. [Google Scholar] [CrossRef]
I. Pinelis. “An Optimal Three-Way Stable and Monotonic Spectrum of Bounds On Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality, Version 1.” 2013. Available online: http://arxiv.org/abs/1310.6025 (accessed on 24 October 2013).
I. Pinelis. “Fractional sums and integrals of r-concave tails and applications to comparison probability inequalities.” In Advances in Stochastic Inequalities (Atlanta, GA, 1997). Providence, RI, USA: American Mathematical Society, 1999, Volume 234, pp. 149–168. [Google Scholar]
I. Pinelis. “Binomial upper bounds on generalized moments and tail probabilities of (super)martingales with differences bounded from above.” In High Dimensional Probability. Beachwood, OH, USA: Institute of Mathematical Statistics, 2006, Volume 51, pp. 33–52. [Google Scholar]
M.L. Eaton. “A probability inequality for linear combinations of bounded random variables.” Ann. Statist. 2 (1974): 609–613. [Google Scholar] [CrossRef]
J.-M. Dufour, and M. Hallin. “Improved Eaton bounds for linear combinations of bounded random variables, with statistical applications.” J. Am. Stat. Assoc. 88 (1993): 1026–1033. [Google Scholar] [CrossRef]
I. Pinelis. “Extremal probabilistic problems and Hotelling’s T2 test under symmetry condition.” 1991. Available online: http://arxiv.org/abs/math/0701806 (accessed on 31 January 2007).
I. Pinelis. “Extremal probabilistic problems and Hotelling’s ^T2 test under a symmetry condition.” Ann. Stat. 22 (1994): 357–368. [Google Scholar] [CrossRef]
P. Billingsley. Convergence of Probability Measures. New York, NY, USA: John Wiley & Sons Inc., 1968. [Google Scholar]
M. Shaked, and J.G. Shanthikumar. Stochastic Orders. New York, NY, USA: Springer Series in Statistics; Springer, 2007. [Google Scholar]
P.C. Fishburn. “Continua of stochastic dominance relations for unbounded probability distributions.” J. Math. Econ. 7 (1980): 271–285. [Google Scholar] [CrossRef]
P.C. Fishburn. “Continua of stochastic dominance relations for bounded probability distributions.” J. Math. Econ. 3 (1976): 295–311. [Google Scholar] [CrossRef]
A.B. Atkinson. “More on the measurement of inequality.” J. Econ. Inequal. 6 (2008): 277–283. [Google Scholar] [CrossRef]
P. Muliere, and M. Scarsini. “A note on stochastic dominance and inequality measures.” J. Econ. Theory 49 (1989): 314–323. [Google Scholar] [CrossRef]
S. Ortobelli, S.T. Rachev, H. Shalit, and F.J. Fabozzi. “The Theory of Orderings and Risk Probability Functionals.” 2006. Available online: http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CDIQFjAA&url=http (accessed on 15 July 2013).
S. Ortobelli, S.T. Rachev, H. Shalit, and F.J. Fabozzi. “Orderings and probability functionals consistent with preferences.” Appl. Math. Financ. 16 (2009): 81–102. [Google Scholar] [CrossRef]
I. Pinelis. “(Quasi)additivity Properties of the Legendre–Fenchel Transform and Its Inverse, with Applications in Probability.” 2013. Available online: http://arxiv.org/abs/1305.1860 (accessed on 5 June 2013).
E. Rio. Personal Communication, 2013.
E. De Giorgi. “Reward-risk portfolio selection and stochastic dominance.” J. Bank. Financ. 29 (2005): 895–926. [Google Scholar] [CrossRef]
G.C. Pflug. “Some remarks on the value at risk and the conditional value at risk.” In Probabilistic Constrained Optimization. Dordrecht, The Netherlands: Kluwer Academic Publishers, 2000, Volume 49, pp. 272–281. [Google Scholar]
R.M. Corless, G.H. Gonnet, D.E.G. Hare, D.J. Jeffrey, and D.E. Knuth. “On the Lambert W function.” Adv. Comput. Math. 5 (1996): 329–359. [Google Scholar] [CrossRef]
I. Pinelis. “Positive-part moments via the Fourier–Laplace transform.” J. Theor. Probab. 24 (2011): 409–421. [Google Scholar] [CrossRef]
A. Kibzun, and A. Chernobrovov. “Equivalence of the problems with quantile and integral quantile criteria.” Autom. Remote Control 74 (2013): 225–239. [Google Scholar] [CrossRef]
C. Acerbi, and D. Tasche. “Expected shortfall: A natural coherent alternative to value at risk.” Econ. Notes 31 (2002): 379–388. [Google Scholar] [CrossRef]
S.T. Rachev, S. Stoyanov, and F.J. Fabozzi. Advanced Stochastic Models, Risk Assessment, and Portfolio Optimization: The Ideal Risk, Uncertainty, and Performance Measures. Hoboken, NJ, USA: John Wiley, 2007. [Google Scholar]
R.H. Litzenberger, and D.M. Modest. “Crisis and non-crisis risk in financial markets: A unified approach to risk management.” In The Known, the Unknown, and Unknowable in Financial Risk Management. Princeton, NJ, USA: Princeton University Press, 2008, pp. 74–102. [Google Scholar]
H. Mausser, and D. Rosen. “Efficient risk/return frontiers for credit risk.” Algo Res. Q. 2 (1999): 35–47. [Google Scholar]
F. Bassi, P. Embrechts, and M. Kafetzaki. “Risk management and quantile estimation.” In A Practical Guide to Heavy Tails (Santa Barbara, CA, 1995). Boston, MA, USA: Birkhäuser Boston, 1998, pp. 111–130. [Google Scholar]
P. Embrechts, C. Klüppelberg, and T. Mikosch. Modelling Extremal Events for Insurance and Finance. New York, NY, USA: Springer, 1997. [Google Scholar]
P.C. Fishburn. “Mean-risk analysis with risk associated with below-target returns.” Am. Econ. Rev. 67 (1977): 116–126. [Google Scholar]
J.A. Clarkson. “Uniformly convex spaces.” Trans. Amer. Math. Soc. 40 (1936): 396–414. [Google Scholar] [CrossRef]
M. Frittelli, and E.R. Gianin. “Dynamic convex risk measures.” In Risk Measures in the 21st Century. Hoboken, NJ, USA: Wiley, 2004, pp. 227–248. [Google Scholar]
M.E. Yaari. “The dual theory of choice under risk.” Econometrica 55 (1987): 95–115. [Google Scholar] [CrossRef]
S. Yitzhaki. “Stochastic dominance, mean variance, and Gini’s mean difference.” Am. Econ. Rev. 72 (1982): 178–185. [Google Scholar]
A. Cillo, and P. Delquie. “Mean-risk analysis with enhanced behavioral content.” Eur. J. Oper. Res. 239 (2014): 764–775. [Google Scholar] [CrossRef]
P. Delquie, and A. Cillo. “Disappointment without prior expectation: A unifying perspective on decision under risk.” J. Risk Uncertain. 33 (2006): 197–215. [Google Scholar] [CrossRef]
M.J. Machina. “Expected utility analysis without the independence axiom.” Econometrica 50 (1982): 277–323. [Google Scholar] [CrossRef]
R.T. Rockafellar, S. Uryasev, and M. Zabarankin. “Generalized deviations in risk analysis.” Financ. Stoch. 10 (2006): 51–74. [Google Scholar] [CrossRef]
R. Mansini, W. Ogryczak, and M.G. Speranza. “LP solvable models for portfolio optimization: A classification and computational comparison.” IMA J. Manag. Math. 14 (2003): 187–220. [Google Scholar] [CrossRef]
W. Ogryczak, and A. Ruszczyński. “On consistency of stochastic dominance and mean-semideviation models.” Math. Program. 89 (2001): 217–232. [Google Scholar] [CrossRef]
S. Ortobelli, S.T. Rachev, H. Shalit, and F.J. Fabozzi. “Risk Probability Functionals and Probability Metrics Applied To Portfolio Theory.” 2007. Available online: http://www.pstat.ucsb.edu/research/papers/2006mid/view.pdf (accessed on 15 July 2013).
W. Ogryczak, and A. Ruszczyński. “Dual stochastic dominance and related mean-risk models.” SIAM J. Optim. 13 (2002): 60–78. [Google Scholar] [CrossRef]
A. Kibzun, and E.A. Kuznetsov. “Comparison of VaR and CVaR criteria.” Autom.Remote Control 64 (2003): 1154–1164. [Google Scholar] [CrossRef]
A.B. Atkinson. “On the measurement of inequality.” J. Econom. Theory 2 (1970): 244–263. [Google Scholar] [CrossRef]
C. Acerbi. “Spectral measures of risk: A coherent representation of subjective risk aversion.” J. Bank. Financ. 26 (2002): 1505–1518. [Google Scholar] [CrossRef]
R. Giacometti, and S. Ortobelli. “Risk measures for asset allocation models.” In Risk Measures in the 21st Century. Hoboken, NJ, USA: Wiley, 2004, pp. 69–86. [Google Scholar]
A.D. Roy. “Safety first and the holding of assets.” Econometrica 20 (1952): 431–449. [Google Scholar] [CrossRef]
I. Pinelis. “Exact inequalities for sums of asymmetric random variables, with applications.” Probab. Theory Relat. Fields 139 (2007): 605–635. [Google Scholar] [CrossRef]
I. Pinelis. “A Necessary and Sufficient Condition on the Stability of the Infimum of Convex Functions.” 2013. Available online: http://arxiv.org/abs/1307.3806 (accessed on 7 August 2013).
R.T. Rockafellar. Convex Analysis. Princeton Landmarks in Mathematics; Princeton, NJ, USA: Princeton University Press, 1997, Reprint of the 1970 original, Princeton Paperbacks. [Google Scholar]
E. Rio. “English translation of the monograph Théorie asymptotique des processus aléatoires faiblement dépendants (2000) by E. Rio.” 2012, Work in Progress. [Google Scholar]
E. Rio. “Local invariance principles and their application to density estimation.” Probab. Theory Relat. Fields 98 (1994): 21–45. [Google Scholar] [CrossRef]

© 2014 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pinelis, I. An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality. Risks 2014, 2, 349-392. https://doi.org/10.3390/risks2030349

AMA Style

Pinelis I. An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality. Risks. 2014; 2(3):349-392. https://doi.org/10.3390/risks2030349

Chicago/Turabian Style

Pinelis, Iosif. 2014. "An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality" Risks 2, no. 3: 349-392. https://doi.org/10.3390/risks2030349

Article Menu

An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality

Abstract

1. Introduction

2. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Tail Probabilities

3. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Quantiles

4. Computation of the Tail Probability and Quantile Bounds

4.1. Computation of $P_{α} (X; x)$

4.2. Computation of $Q_{α} (X; p)$

4.3. Optimization of the Risk Measures $Q_{α} (X; p)$ with Respect to X

4.4. Additional Remarks on the Computation and Optimization

5. Implications for Risk Assessment in Finance and Inequality Modeling in Economics

5.1. The Spectrum ${(Q_{α} (X; p))}_{α \in [0, \infty]}$ Contains $VaR$ and $CVaR$ .

5.2. The Spectrum Parameter α as a Risk Sensitivity Index

5.3. Coherent and Non-Coherent Measures of Risk

5.4. Other Terminology Used in the Literature for Some of the Listed Properties of $Q_{α} (\cdot; p)$

5.5. Gini-Type Mean Differences and Related Risk Measures

5.6. A Lorentz-Type Parametric Family of Risk Measures

5.7. Spectral Risk Measures

5.8. Risk Measures Reinterpreted as Measures of Economic Inequality

5.9. “Explicit” Expressions of $Q_{α} (X; p)$

6. Conclusions

Acknowledgment

Appendix

A. Proofs

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

An Optimal Three-Way Stable and Monotonic Spectrum of Bounds on Quantiles: A Spectrum of Coherent Measures of Financial Risk and Economic Inequality

Abstract

1. Introduction

2. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Tail Probabilities

3. An Optimal Three-Way Stable and Three-Way Monotonic Spectrum of Upper Bounds on Quantiles

4. Computation of the Tail Probability and Quantile Bounds

4.1. Computation of P α ( X ; x )

4.2. Computation of Q α ( X ; p )

4.3. Optimization of the Risk Measures Q α ( X ; p ) with Respect to X

4.4. Additional Remarks on the Computation and Optimization

5. Implications for Risk Assessment in Finance and Inequality Modeling in Economics

5.1. The Spectrum Q α ( X ; p ) α ∈ [ 0 , ∞ ] Contains VaR and CVaR .

5.2. The Spectrum Parameter α as a Risk Sensitivity Index

5.3. Coherent and Non-Coherent Measures of Risk

5.4. Other Terminology Used in the Literature for Some of the Listed Properties of Q α ( · ; p )

5.5. Gini-Type Mean Differences and Related Risk Measures

5.6. A Lorentz-Type Parametric Family of Risk Measures

5.7. Spectral Risk Measures

5.8. Risk Measures Reinterpreted as Measures of Economic Inequality

5.9. “Explicit” Expressions of Q α ( X ; p )

6. Conclusions

Acknowledgment

Appendix

A. Proofs

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. Computation of $P_{α} (X; x)$

4.2. Computation of $Q_{α} (X; p)$

4.3. Optimization of the Risk Measures $Q_{α} (X; p)$ with Respect to X

5.1. The Spectrum ${(Q_{α} (X; p))}_{α \in [0, \infty]}$ Contains $VaR$ and $CVaR$ .

5.4. Other Terminology Used in the Literature for Some of the Listed Properties of $Q_{α} (\cdot; p)$

5.9. “Explicit” Expressions of $Q_{α} (X; p)$