Next Article in Journal
Diffusion Limit for Single-Server Retrial Queues with Renewal Input and Outgoing Calls
Next Article in Special Issue
On Ostrowski Type Inequalities via the Extended Version of Montgomery’s Identity
Previous Article in Journal
Evaluating Video Conferencing Software for Remote Working Using Two-Stage Grey MCDM: A Case Study from Vietnam
Previous Article in Special Issue
An Application of Hayashi’s Inequality for Differentiable Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Converse Jensen-Type Inequality for Generalized f-Divergences and Zipf–Mandelbrot Law

Faculty of Textile Technology, University of Zagreb, 10000 Zagreb, Croatia
Mathematics 2022, 10(6), 947; https://doi.org/10.3390/math10060947
Submission received: 24 January 2022 / Revised: 8 March 2022 / Accepted: 14 March 2022 / Published: 16 March 2022
(This article belongs to the Special Issue Advances in Mathematical Inequalities and Applications)

Abstract

:
Motivated by some recent investigations about the sharpness of the Jensen inequality, this paper deals with the sharpness of the converse of the Jensen inequality. These results are then used for deriving new inequalities for different types of generalized f-divergences. As divergences measure the differences between probability distributions, these new inequalities are then applied on the Zipf–Mandelbrot law as a special kind of a probability distribution.

1. Introduction

Some of the most famous inequalities in mathematics are surely the Jensen inequality and its converse. The converse Jensen inequality is given by Lah and Ribarič in [1] and separately by Edmundson in [2], so it is sometimes referred by Edmundson–Lah–Ribarič inequality. The Jensen inequality and its converse are closely connected with the Hermite-Hadamard inequality, and these three inequalities have always been the great inspiration for further investigations, generalizations, refinements, improvements and extensions. An interested reader can consult several very new papers, published just in the last few months, in order to obtain a more comprehensive understanding of the recent research progress in this field (see for example [3,4,5,6,7,8,9,10]).
In the recent papers [11,12] the authors investigated the sharpness of the Jensen inequality. However, how sharp is the converse of the Jensen inequality?
Let [ α , β ] be an interval in R . Consider the Green functions G k : [ α , β ] × [ α , β ] R , ( k = 0 , 1 , 2 , 3 , 4 ) defined by
G 0 ( t , s ) = ( t β ) ( s α ) β α for α s t , ( s β ) ( t α ) β α for t s β ,
G 1 ( t , s ) = α s , for α s t , α t , for t s β ,
G 2 ( t , s ) = t β , for α s t , s β , for t s β ,
G 3 ( t , s ) = t α , for α s t , s α , for t s β ,
G 4 ( t , s ) = β s , for α s t , β t , for t s β .
By means of these functions, the authors in [13,14] gave the uniform treatment of the Jensen type inequalities, allowing the measure also to be negative. In this paper, we continue this investigation, and we concentrate on the converse Jensen inequality.
The paper is organized as follows: after this introduction, in the Section 2 we give our main results. We analyse the sharpness of the converse of the Jensen inequality. Here, instead of the convexity of the function, we use previously mentioned Green functions. After the first theorem, the following corollaries give us some further results and an example with the condition which is easier to verify. In the Section 3, the analogous results in discrete case are presented. As we know, the Jensen inequality is important when obtaining inequalities for divergences. Therefore, in our Section 4 we use our results with the converse Jensen inequality in order to derive new inequalities for different types of generalized f-divergences. According to their definition, divergences measure the differences between probability distributions. So, to conclude the paper, in the Section 5 we apply our results with f-divergences on the special kind of a probability distribution defined as Zipf–Mandelbrot law.

2. Main Results

To simplify the notation, we denote
g ¯ = a b g ( x ) d λ ( x ) a b d λ ( x ) .
We give our first result.
Theorem 1.
Let g : [ a , b ] R be continuous function and φ : [ α , β ] R , φ C 2 ( [ α , β ] ) , where I m ( g ) [ α , β ] . Let m , M [ α , β ] ( m M ) be such that m g ( t ) M for all t [ a , b ] . Let λ : [ a , b ] R be a function of bounded variation, such that λ ( a ) λ ( b ) . Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 .
Then
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) Q · φ p
holds, where
Q = α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) q d s 1 q , f o r q ; sup s [ α , β ] M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) , f o r q = .
Proof. 
Using the functions G k ( k = 0 , 1 , 2 , 3 , 4 ) we can represent every function φ : [ α , β ] R , φ C 2 ( [ α , β ] ) , as:
φ ( x ) = β x β α φ ( α ) + x α β α φ ( β ) + α β G 0 ( x , s ) φ ( s ) d s ,
φ ( x ) = φ ( α ) + ( x α ) φ ( β ) + α β G 1 ( x , s ) φ ( s ) d s ,
φ ( x ) = φ ( β ) + ( x β ) φ ( α ) + α β G 2 ( x , s ) φ ( s ) d s ,
φ ( x ) = φ ( β ) ( β α ) φ ( β ) + ( x α ) φ ( α ) + α β G 3 ( x , s ) φ ( s ) d s ,
φ ( x ) = φ ( α ) + ( β α ) φ ( α ) ( β x ) φ ( β ) + α β G 4 ( x , s ) φ ( s ) d s ,
which can be easily shown by integrating by parts. For instance, for k = 0 we have
α β G 0 ( x , s ) φ ( s ) d s = α x G 0 ( x , s ) φ ( s ) d s + x β G 0 ( x , s ) φ ( s ) d s = α x ( x β ) ( s α ) β α φ ( s ) d s + x β ( s β ) ( x α ) β α φ ( s ) d s = x β β α α x ( s α ) φ ( s ) d s + x α β α x β ( s β ) φ ( s ) d s = x β β α ( s α ) φ ( s ) | α x φ ( s ) | α x + x α β α ( s β ) φ ( s ) | x β φ ( s ) | x β = φ ( x ) β x β α φ ( α ) x α β α φ ( β ) ,
which proves the first identity. For k = 1 we have
α β G 1 ( x , s ) φ ( s ) d s = α x G 1 ( x , s ) φ ( s ) d s + x β G 1 ( x , s ) φ ( s ) d s = α x ( α s ) φ ( s ) d s + x β ( α x ) φ ( s ) d s = ( α s ) φ ( s ) | α x + φ ( s ) | α x + ( α x ) φ ( s ) | x β = φ ( x ) φ ( α ) + ( α x ) φ ( β ) ,
and this gives us the second identity. The other identities can be proved analogously.
Furthermore, by simple calculation using these identities it can be shown that for every function φ : [ α , β ] R , φ C 2 ( [ α , β ] ) , and for any k { 0 , 1 , 2 , 3 , 4 } holds
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) = α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) φ ( s ) d s .
Now, using the triangle inequality for integrals we get
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) = α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) φ ( s ) d s α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) φ ( s ) d s ,
and then applying the Hölder inequality we get the statement of our theorem. □
Let us see what happens for q = 1 , p = . If the term
M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x )
has the same positivity for all s [ α , β ] , then we can calculate Q. The following result holds.
Corollary 1.
Let g : [ a , b ] R be continuous function and φ : [ α , β ] R , φ C 2 ( [ α , β ] ) , where I m ( g ) [ α , β ] . Let m , M [ α , β ] ( m M ) be such that m g ( t ) M for all t [ a , b ] . Let λ : [ a , b ] R be a function of bounded variation, such that λ ( a ) λ ( b ) . If for any k { 0 , 1 , 2 , 3 , 4 } for all s [ α , β ] the inequality
M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) 0
holds, or if for any k { 0 , 1 , 2 , 3 , 4 } for all s [ α , β ] the reverse inequality in (1) holds, then
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) 1 2 · φ · M g ¯ M m · m 2 + g ¯ m M m · M 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
Proof. 
Applying Theorem 1 for q = 1 , p = we get
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) φ · α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) d s .
If the term M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) doesn’t change it’s positivity for all s [ α , β ] , we can calculate the integral on the right side of (3).
Let us start with the case when k = 0 . We have
α β G 0 ( t , s ) d s = α t ( t β ) ( s α ) β α d s + t β ( s β ) ( t α ) β α d s = t β β α α t ( s α ) d s + t α β α t β ( s β ) d s = t β β α 1 2 ( t 2 α 2 ) α ( t α ) + t α β α 1 2 ( β 2 t 2 ) β ( β t ) = 1 2 ( t α ) ( t β ) ,
and therefore it is
α β M g ¯ M m G 0 m , s + g ¯ m M m G 0 M , s a b G 0 g ( x ) , s d λ ( x ) a b d λ ( x ) d s = M g ¯ M m α β G 0 m , s d s + g ¯ m M m α β G 0 M , s d s 1 a b d λ ( x ) a b α β G 0 g ( x ) , s d s d λ ( x ) = 1 2 M g ¯ M m ( m α ) ( m β ) + 1 2 g ¯ m M m ( M α ) ( M β ) 1 a b d λ ( x ) a b 1 2 g ( x ) α ) ( g ( x ) β d λ ( x ) = 1 2 · M g ¯ M m · m 2 + g ¯ m M m · M 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
Similarly, when we consider the case when k = 1 , we have
α β G 1 ( t , s ) d s = α t ( α s ) d s + t β ( α t ) d s = α α t d s α t s d s + ( α t ) t β d s = α ( t α ) 1 2 ( t 2 α 2 ) + ( α t ) ( β t ) = 1 2 ( t α ) ( t + α 2 β ) ,
and we obtain
α β M g ¯ M m G 1 m , s + g ¯ m M m G 1 M , s a b G 1 g ( x ) , s d λ ( x ) a b d λ ( x ) d s = M g ¯ M m α β G 1 m , s d s + g ¯ m M m α β G 1 M , s d s 1 a b d λ ( x ) a b α β G 1 g ( x ) , s d s d λ ( x ) = 1 2 M g ¯ M m ( m α ) ( m + α 2 β ) + 1 2 g ¯ m M m ( M α ) ( M + α 2 β ) 1 a b d λ ( x ) a b 1 2 g ( x ) α ) ( g ( x ) + α 2 β d λ ( x ) = 1 2 · M g ¯ M m · m 2 + g ¯ m M m · M 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
For k = 2 we have
α β G 2 ( t , s ) d s = α t ( t β ) d s + t β ( s β ) d s = 1 2 ( t β ) ( t + β 2 α ) ,
for k = 3
α β G 3 ( t , s ) d s = α t ( t α ) d s + t β ( s α ) d s = 1 2 ( t α ) 2 + ( β α ) 2 ,
and for k = 4
α β G 4 ( t , s ) d s = α t ( β s ) d s + t β ( β t ) d s = 1 2 ( β t ) 2 + ( β α ) 2 ,
and using the same procedure, we get the same result. Thus, for k = 0 , 1 , 2 , 3 , 4 we have that
α β M g ¯ M m G k m , s + g ¯ m M m G k M , s a b G k g ( x ) , s d λ ( x ) a b d λ ( x ) d s = 1 2 · M g ¯ M m · m 2 + g ¯ m M m · M 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
If for all s [ α , β ] the inequality (1) holds, then (3) becomes
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) 1 2 · φ · M g ¯ M m · m 2 + g ¯ m M m · M 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) ,
and if for all s [ α , β ] the reverse inequality in (1) holds, then (3) becomes
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) 1 2 · φ · a b g ( x ) 2 d λ ( x ) a b d λ ( x ) M g ¯ M m · m 2 g ¯ m M m · M 2 .
So, if for all s [ α , β ] the inequality (1) holds or if for all s [ α , β ] the reverse inequality in (1) holds, in both cases (2) is valid. □
Remark 1.
Note that (2) can also be expressed as
M g ¯ M m φ ( m ) + g ¯ m M m φ ( M ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) 1 2 · φ · g ¯ ( m + M ) m · M a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
Let us consider the case when k = 0 . If we set that m = α and M = β , we have that G 0 ( α , s ) = G 0 ( β , s ) = 0 , and the inequality (1) transforms into
a b G 0 g ( x ) , s d λ ( x ) a b d λ ( x ) 0 .
Therefore, we have the following result.
Corollary 2.
Let g : [ a , b ] R be continuous function and φ : [ α , β ] R , φ C 2 ( [ α , β ] ) , where I m ( g ) [ α , β ] . Let λ : [ a , b ] R be a function of bounded variation, such that λ ( a ) λ ( b ) . If for all s [ α , β ] the inequality (4) holds, or if for all s [ α , β ] the reverse inequality in (4) holds, then
β g ¯ β α φ ( α ) + g ¯ α β α φ ( β ) a b φ g ( x ) d λ ( x ) a b d λ ( x ) 1 2 · φ · β g ¯ β α · α 2 + g ¯ α β α · β 2 a b g ( x ) 2 d λ ( x ) a b d λ ( x ) = 1 2 · φ · g ¯ ( α + β ) α · β a b g ( x ) 2 d λ ( x ) a b d λ ( x ) .
As we can see, this case looks much simpler, while the condition (4) is easier to verify than the condition (1). Similar results could also be given in cases when k = 1 , 2 , 3 , 4 , but we are not mentioning them here, because their conditions are not so simple.

3. On the Converse Jensen Type Inequality in Discrete Case

In this section, we give our results in the discrete case. We omit the proofs, as they are similar to those in the integral case from the previous section. We introduce the notation: U n = i = 1 n u i , x ¯ = 1 U n i = 1 n u i x i .
For x i [ a , b ] [ α , β ] , a b , u i R ( i = 1 , , n ) such that U n 0 , we have that for every function φ : [ α , β ] R , φ C 2 [ α , β ] , holds:
b x ¯ b a φ ( a ) + x ¯ a b a φ ( b ) 1 U n i = 1 n u i φ ( x i ) = α β b x ¯ b a G k ( a , s ) + x ¯ a b a G k ( b , s ) 1 U n i = 1 n u i G k ( x i , s ) φ ( s ) d s .
Using that fact, we obtain the following result.
Theorem 2.
Let x i [ a , b ] [ α , β ] , a b , u i R ( i = 1 , , n ) be such that U n 0 , and let φ : [ α , β ] R , φ C 2 [ α , β ] . Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 .
Then
b x ¯ b a φ ( a ) + x ¯ a b a φ ( b ) 1 U n i = 1 n u i φ ( x i ) L · φ p
holds, where
L = α β b x ¯ b a G k ( a , s ) + x ¯ a b a G k ( b , s ) 1 U n i = 1 n u i G k ( x i , s ) q d s 1 q , f o r q ; sup s [ α , β ] b x ¯ b a G k ( a , s ) + x ¯ a b a G k ( b , s ) 1 U n i = 1 n u i G k ( x i , s ) , f o r q = .
Let us now see what happens for q = 1 , p = . If the term
b x ¯ b a G k ( a , s ) + x ¯ a b a G k ( b , s ) 1 U n i = 1 n u i G k ( x i , s )
has the same positivity for all s [ α , β ] , then we can calculate L. The following result holds.
Corollary 3.
Let x i [ a , b ] [ α , β ] , a b , u i R ( i = 1 , , n ) be such that U n 0 , and let φ : [ α , β ] R , φ C 2 [ α , β ] . If for any k { 0 , 1 , 2 , 3 , 4 } for all s [ α , β ] the inequality
b x ¯ b a G k ( a , s ) + x ¯ a b a G k ( b , s ) 1 U n i = 1 n u i G k ( x i , s ) 0
holds, or if for any k { 0 , 1 , 2 , 3 , 4 } for all s [ α , β ] the reverse inequality in (5) holds, then
b x ¯ b a φ ( a ) + x ¯ a b a φ ( b ) 1 U n i = 1 n u i φ ( x i ) 1 2 · φ · ( a + b ) x ¯ a b 1 U n i = 1 n u i x i 2 .
In the case when k = 0 , a = α and b = β , the inequality (5) transforms into
1 U n i = 1 n u i G 0 ( x i , s ) 0
and we obtain the following result.
Corollary 4.
Let x i [ α , β ] , α β , u i R ( i = 1 , , n ) be such that U n 0 , and let φ : [ α , β ] R , φ C 2 [ α , β ] . If for all s [ α , β ] the inequality (6) holds, or if for all s [ α , β ] the reverse inequality in (6) holds, then
β x ¯ β α φ ( α ) + x ¯ α β α φ ( β ) 1 U n i = 1 n u i φ ( x i ) 1 2 · φ · ( α + β ) x ¯ α β 1 U n i = 1 n u i x i 2 .

4. Inequalities for Generalized f -Divergences

I. Csiszár in [15] defined the f-divergence
C f ( q , p ) = i = 1 n p i f q i p i
for a function f : R + R and two positive probability distributions p = ( p 1 , , p n ) R + n , q = ( q 1 , , q n ) R + n . He considered the case when the function f is convex. Although several other authors ([16,17]) also introduced and studied this divergence, it is well known as the Csiszár f-divergence.
There exist various kinds of divergences, and all of them measure the differences between probability distributions. We focus here on the f-differences which are generalized using weights (see [18,19]), and we apply our results from the previous section in order to get new results and inequalities for these generalized f-divergences.
The generalized Csiszár f -divergence is defined by
C f ( q , p ; r ) = i = 1 n r i p i f q i p i ,
where f : R + R and p , q , r R + n .
To simplify our results, we use the following notations
P r = i = 1 n r i p i ,
Q ¯ r = 1 P r i = 1 n r i q i .
Theorem 3.
Let p , q , r R + n be such that q i p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . If f : [ α , β ] R , f C 2 [ α , β ] , then
b Q ¯ r b a f ( a ) + Q ¯ r a b a f ( b ) 1 P r C f ( q , p ; r ) L · f p
holds, where
L = α β b Q ¯ r b a G k ( a , s ) + Q ¯ r a b a G k ( b , s ) 1 P r i = 1 n r i p i G k ( q i p i , s ) q d s 1 q , f o r q ; sup s [ α , β ] b Q ¯ r b a G k ( a , s ) + Q ¯ r a b a G k ( b , s ) 1 P r i = 1 n r i p i G k ( q i p i , s ) , f o r q = .
Proof. 
Substituting φ : = f ,
u i : = r i p i i = 1 n r i p i , x i : = q i p i , i = 1 , , n ,
our result directly follows from Theorem 2. □
The generalized Kullback–Leibler divergence is defined by
K L ( q , p ; r ) = i = 1 n r i q i log q i p i ,
where p , q , r R + n . For this divergence we have the following result.
Theorem 4.
Let p , q , r R + n be such that q i p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , a , b > 0 and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a a log a + Q ¯ r a b a b log b 1 P r K L ( q , p ; r ) L · ( i d · log ) p
holds, where i d is the identity function and L is as defined in (7).
Proof. 
This result follows directly from Theorem 3 by setting f ( t ) = t log t , t > 0 . □
For the generalized Hellinger divergence defined by
H e ( q , p ; r ) = i = 1 n r i ( q i p i ) 2
the following result holds.
Theorem 5.
Let p , q , r R + n be such that q i p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , a , b > 0 and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a ( 1 a ) 2 + Q ¯ r a b a ( 1 b ) 2 1 P r H e ( q , p ; r ) L · f p
holds, where f ( t ) = ( 1 t ) 2 and L is as defined in (7).
Proof. 
This result follows directly from Theorem 3 by setting f ( t ) = ( 1 t ) 2 , t > 0 . #x25A1;
The generalized Rényi divergence is defined by
R e γ ( q , p ; r ) = i = 1 n r i q i γ p i 1 γ
where γ 1 , + .
Theorem 6.
Let p , q , r R + n be such that q i p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , a , b > 0 and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a a γ + Q ¯ r a b a b γ 1 P r R e γ ( q , p ; r ) L · f p
holds, where f ( t ) = t γ ( t > 0 , γ > 1 ) and L is as defined in (7).
Proof. 
This result follows directly from Theorem 3 by setting f ( t ) = t γ ( t > 0 , γ > 1 ). □
The generalized χ 2 -divergence is defined by
D χ 2 ( q , p ; r ) = i = 1 n r i ( q i p i ) 2 p i .
The following result holds.
Theorem 7.
Let p , q , r R + n be such that q i p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , a , b > 0 and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a ( a 1 ) 2 + Q ¯ r a b a ( b 1 ) 2 1 P r D χ 2 ( q , p ; r ) L · f p
holds, where f ( t ) = ( t 1 ) 2 , t > 0 , and L is as defined in (7).
Proof. 
This result follows directly from Theorem 3 by setting f ( t ) = ( t 1 ) 2 , t > 0 . □
The generalized Shannon entropy of a positive probability distribution p = ( p 1 , , p n ) is defined by
H ( p ; r ) = i = 1 n r i p i log ( p i ) .
It is a special case of the generalized Csiszár f-divergence C f ( q , p ; r ) if we set q = ( 1 , , 1 ) and f ( t ) = log t , t > 0 . We have the following.
Theorem 8.
Let p , r R + n be such that 1 p i [ a , b ] [ α , β ] , ( i = 1 , , n ) , a b , a , b > 0 and let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b 1 P r i = 1 n r i b a log a + 1 P r i = 1 n r i a b a log b 1 P r H ( p ; r ) L · log p
holds, where L is as defined in (7).

5. Applications to Zipf–Mandelbrot Law

Definition 1([20]).
Zipf–Mandelbrot law is a discrete probability distribution, depends on three parameters N 1 , 2 , , t [ 0 , and v > 0 , and it is defined by
ϕ i ; N , t , v : = 1 i + t v H N , t , v , i = 1 , , N ,
where
H N , t , v : = j = 1 N 1 j + t v .
When t = 0 , then Zipf–Mandelbrot law becomes Zipf’s law.
The Zipf–Mandelbrot law got its name after the linguist George Kingsley Zipf, who gave its primary form, and after the mathematician Benoit Mandelbrot, who gave its generalization. The Zipf law goes after the frequency of a certain word in the text, and it is used in bibliometric and in information science. It is used in linguistics, but also in economics (as Pareto’s law) when analysing the distribution of the wealth. Apart from that, this law can be found also in other disciplines like mathematics, physics, biology, computer science, social sciences, demography, etc. Here we are going to concentrate on its mathematical aspect of course. (More about the Zipf–Mandelbrot law in mathematical context can be found in [21].)
As the Zipf–Mandelbrot law is a probability distribution, and f-divergences measure the differences between two probability distributions, we can apply the results from the previous section on the Zipf–Mandelbrot law.
Suppose p , q are two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively. Then
p i = ϕ i ; N , t 1 , v 1 : = 1 i + t 1 v 1 H N , t 1 , v 1 , i = 1 , , N ,
and
q i = ϕ i ; N , t 2 , v 2 : = 1 i + t 2 v 2 H N , t 2 , v 2 , i = 1 , , N ,
where
H N , t k , v k : = j = 1 N 1 j + t k v k , k = 1 , 2 .
The generalized Csiszár divergence for such p , q , and for r R + n is given by
C f ( q , p ; r ) = 1 H N , t 1 , v 1 i = 1 N r i ( i + t 1 ) v 1 f ( i + t 1 ) v 1 H N , t 1 , v 1 ( i + t 2 ) v 2 H N , t 2 , v 2 .
Using (8) and (9), we get the following expressions for P r and Q ¯ r :
P r = i = 1 N r i ( i + t 1 ) v 1 H N , t 1 , v 1 = 1 H N , t 1 , v 1 i = 1 N r i ( i + t 1 ) v 1 ,
Q ¯ r = i = 1 N r i ( i + t 2 ) v 2 H N , t 2 , v 2 i = 1 N r i ( i + t 1 ) v 1 H N , t 1 , v 1 = H N , t 1 , v 1 H N , t 2 , v 2 · i = 1 N r i ( i + t 2 ) v 2 i = 1 N r i ( i + t 1 ) v 1 ,
and we obtain the following result.
Corollary 5.
Let p , q be two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n such that
q i p i : = i + t 1 v 1 H N , t 1 , v 1 i + t 2 v 2 H N , t 2 , v 2 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . If f : [ α , β ] R , f C 2 [ α , β ] , then
b Q ¯ r b a f ( a ) + Q ¯ r a b a f ( b ) 1 P r C f ( q , p ; r ) L · f p
where p i , q i , P r , Q ¯ r , C f ( q , p ; r ) are as defined in (8)–(12), and L is as defined in (7).
The generalized Kullbach-Leibler divergence of two Zipf–Mandelbrot laws p , q with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n , is given by:
K L ( q , p ; r ) = 1 H N , t 2 , v 2 i = 1 N r i ( i + t 2 ) v 2 log ( i + t 1 ) v 1 H N , t 1 , v 1 ( i + t 2 ) v 2 H N , t 2 , v 2 .
The following holds.
Corollary 6.
Let p , q be two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n such that
q i p i : = i + t 1 v 1 H N , t 1 , v 1 i + t 2 v 2 H N , t 2 , v 2 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a , b > 0 , a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a a log a + Q ¯ r a b a b log b 1 P r K L ( q , p ; r ) L · ( i d · log ) p
holds, where i d is the identity function and p i , q i , P r , Q ¯ r , K L ( q , p ; r ) are as defined in (8), (9), (11)–(13), and L is as defined in (7).
The generalized Hellinger divergence for two Zipf–Mandelbrot laws p , q with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n , has the following representation:
H e ( q , p ; r ) = 1 H N , t 1 , v 1 H N , t 2 , v 2 i = 1 N r i ( i + t 1 ) v 1 H N , t 1 , v 1 ( i + t 2 ) v 2 H N , t 2 , v 2 2 ( i + t 1 ) v 1 ( i + t 2 ) v 2 .
The following result holds.
Corollary 7.
Let p , q be two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n such that
q i p i : = i + t 1 v 1 H N , t 1 , v 1 i + t 2 v 2 H N , t 2 , v 2 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a , b > 0 , a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a ( 1 a ) 2 + Q ¯ r a b a ( 1 b ) 2 1 P r H e ( q , p ; r ) L · f p
where f ( t ) = ( 1 t ) 2 , and p i , q i , P r , Q ¯ r , H e ( q , p ; r ) are as defined in (8), (9), (11), (12) and (14), respectively, and L is as defined in (7).
The generalized Rényi divergence for two Zipf–Mandelbrot laws p , q with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n , has the following representation:
R e γ ( q , p ; r ) = H N , t 1 , v 1 γ 1 H N , t 2 , v 2 γ i = 1 N r i ( i + t 1 ) ( γ 1 ) v 1 ( i + t 2 ) γ v 2 ,
where γ 1 , + . The following result holds.
Corollary 8.
Let p , q be two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n such that
q i p i : = i + t 1 v 1 H N , t 1 , v 1 i + t 2 v 2 H N , t 2 , v 2 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a , b > 0 , a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a a γ + Q ¯ r a b a b γ 1 P r R e γ ( q , p ; r ) L · f p
holds, where f ( t ) = t γ ( t > 0 , γ > 1 ), and p i , q i , P r , Q ¯ r , R e ( q , p ; r ) are as defined in (8), (9), (11), (12) and (15), respectively, and L is as defined in (7).
The generalized χ 2 -divergence for two Zipf–Mandelbrot laws p , q with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n , has the following representation:
D χ 2 ( q , p ; r ) = H N , t 1 , v 1 · i = 1 N r i ( i + t 1 ) v 1 1 ( i + t 2 ) v 2 H N , t 2 , v 2 1 ( i + t 1 ) v 1 H N , t 1 , v 1 2 .
We have the following result.
Corollary 9.
Let p , q be two Zipf–Mandelbrot laws with parameters N 1 , 2 , , t 1 , t 2 0 and v 1 , v 2 > 0 , respectively, and r R + n such that
q i p i : = i + t 1 v 1 H N , t 1 , v 1 i + t 2 v 2 H N , t 2 , v 2 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a , b > 0 , a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b Q ¯ r b a ( a 1 ) 2 + Q ¯ r a b a ( b 1 ) 2 1 P r D χ 2 ( q , p ; r ) L · f p
holds, where f ( t ) = ( t 1 ) 2 , t > 0 , and p i , q i , P r , Q ¯ r , D χ 2 ( q , p ; r ) are as defined in (8), (9), (11), (12) and (16), respectively, and L is as defined in (7).
In addition, at the end, we also give the result for the generalized Shannon entropy of a Zipf–Mandelbrot law p with parameters N 1 , 2 , , t 1 0 , v 1 > 0 , and r R + n , which has the following representation:
H ( p ; r ) = 1 H N , t 1 , v 1 i = 1 N r i ( i + t 1 ) v 1 log ( i + t 1 ) v 1 H N , t 1 , v 1 .
Corollary 10.
Let p be a Zipf–Mandelbrot law with parameters N 1 , 2 , , t 1 0 and v 1 > 0 , and r R + n , such that
1 p i : = i + t 1 v 1 H N , t 1 , v 1 [ a , b ] [ α , β ] f o r i = 1 , , N , ( a , b > 0 , a b ) .
Let p , q R , 1 p , q , be such that 1 p + 1 q = 1 . Then
b 1 P r i = 1 n r i b a log a + 1 P r i = 1 n r i a b a log b 1 P r H ( p ; r ) L · log p
holds, where p i , P r , H ( p ; r ) are as defined in (8), (11) and (17), respectively, and L is as defined in (7).

Funding

The funding for this research and the costs of publication are covered by a lump sum granted by University of Zagreb Research Funding (PP2/21).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Lah, P.; Ribarič, M. Converse of Jensen’s inequality for convex functions. Publ. Elektroteh. Fak. Univ. Beogr. Ser. Mat. Fiz. 1973, 412–460, 201–205. [Google Scholar]
  2. Edmundson, H.P. Bounds on the expectation of a convex function of a random variable. Rand Corp. 1957. Paper No. 982. Available online: https://apps.dtic.mil/sti/citations/AD0605117 (accessed on 23 January 2022).
  3. Butt, S.I.; Agarwal, P.; Yousaf, S.; Guirao, J.L.G. Generalized fractal Jensen and Jensen–Mercer inequalities for harmonic convex function with applications. J. Inequal. Appl. 2022, 1. [Google Scholar] [CrossRef]
  4. Faisal, S.; Adil Khan, M.; Khan, T.U.; Saeed, T.; Alshehri, A.M.; Nwaeze, E.R. New “Conticrete” Hermite–Hadamard–Jensen–Mercer Fractional Inequalities. Symmetry 2022, 14, 294. [Google Scholar] [CrossRef]
  5. Liu, J.-B.; Butt, S.I.; Nasir, J.; Aslam, A.; Fahad, A.; Soontharanon, J. Jensen-Mercer variant of Hermite-Hadamard type inequalities via Atangana-Baleanu fractional operator. AIMS Math. 2022, 7, 2123–2141. [Google Scholar] [CrossRef]
  6. Sahoo, S.K.; Tariq, M.; Ahmad, H.; Kodamasingh, B.; Shaikh, A.A.; Botmart, T.; El-Shorbagy, M.A. Some Novel Fractional Integral Inequalities over a New Class of Generalized Convex Function. Fractal Fract. 2022, 6, 42. [Google Scholar] [CrossRef]
  7. Tariq, M.; Ahmad, H.; Sahoo, S.K.; Aljoufi, L.S.; Awan, S.K. A novel comprehensive analysis of the refinements of Hermite-Hadamard type integral inequalities involving special functions. J. Math. Comput. SCI-JM. 2022, 26, 330–348. [Google Scholar] [CrossRef]
  8. Tariq, M.; Ahmad, H.; Cesarano, C.; Abu-Zinadah, H.; Abouelregal, A.E.; Askar, S. Novel Analysis of Hermite–Hadamard Type Integral Inequalities via Generalized Exponential Type m-Convex Functions. Mathematics 2022, 10, 31. [Google Scholar] [CrossRef]
  9. Ullah, H.; Adil Khan, M.; Saeed, T.; Sayed, Z.M. Some Improvements of Jensen’s Inequality via 4-Convexity and Applications. J. Funct. Spaces 2022, 2020, 2157375. [Google Scholar] [CrossRef]
  10. Xu, F.; Butt, S.I.; Yousaf, S.; Aslam, A.; Zia, T.J. Generalized Fractal Jensen–Mercer and Hermite–Mercer type inequalities via h-convex functions involving Mittag–Leffler kernel. Alex. Eng. J. 2022, 61, 4837–4846. [Google Scholar] [CrossRef]
  11. Costarelli, D.; Spigler, R. How sharp is the Jensen inequality? J. Inequal. Appl. 2015, 2015, 69. [Google Scholar] [CrossRef] [Green Version]
  12. Pečarić, Đ.; Pečarić, J.; Rodić, M. About the sharpness of the Jensen inequality. J. Inequal. Appl. 2018, 2018, 337. [Google Scholar] [CrossRef] [PubMed]
  13. Pečarić, J.; Perić, I.; Rodić Lipanović, M. Uniform treatment of Jensen type inequalities. Math. Rep. 2014, 16, 183–205. [Google Scholar]
  14. Pečarić, J.; Rodić, M. Uniform treatment of Jensen type inequalities II. Math. Rep. 2019, 21, 289–310. [Google Scholar]
  15. Csiszár, I. Information-type measures of difference of probability functions and indirect observations. Studia Sci. Math. Hungar. 1967, 2, 299–318. [Google Scholar]
  16. Ali, S.M.; Silvey, S.D. A general class of coefficients of divergence of one distribution from another. J. Roy. Statist. Soc. Ser. B 1966, 28, 131–142. [Google Scholar] [CrossRef]
  17. Morimoto, T. Markov processes and the H-theorem. J. Phys. Soc. Jap. 1963, 18, 328–331. [Google Scholar] [CrossRef]
  18. Pečarić, Đ.; Pečarić, J.; Pokaz, D. Generalized Csiszár’s f-divergence for Lipschitzian functions. Math. Inequal. Appl. 2021, 24, 13–30. [Google Scholar] [CrossRef]
  19. Pečarić, Đ.; Pečarić, J.; Rodić, M. On a Jensen-type inequality for generalized f-divergences and Zipf–Mandelbrot law. Math. Inequal. Appl. 2019, 22, 1463–1475. [Google Scholar] [CrossRef] [Green Version]
  20. Horváth, L.; Pečarić, Đ.; Pečarić, J. Estimations of f- and Rényi divergences by using a cyclic refinement of the Jensen’s inequality. Bull. Malays. Math. Sci. Soc. 2019, 42, 933–946. [Google Scholar] [CrossRef]
  21. Pečarić, Đ.; Pečarić, J. (Eds.) Inequalities and Zipf–Mandelbrot law/Selected topics in information theory. In Monographs in Inequalities 15; Element: Zagreb, Croatia, 2019; p. 342. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Rodić, M. On the Converse Jensen-Type Inequality for Generalized f-Divergences and Zipf–Mandelbrot Law. Mathematics 2022, 10, 947. https://doi.org/10.3390/math10060947

AMA Style

Rodić M. On the Converse Jensen-Type Inequality for Generalized f-Divergences and Zipf–Mandelbrot Law. Mathematics. 2022; 10(6):947. https://doi.org/10.3390/math10060947

Chicago/Turabian Style

Rodić, Mirna. 2022. "On the Converse Jensen-Type Inequality for Generalized f-Divergences and Zipf–Mandelbrot Law" Mathematics 10, no. 6: 947. https://doi.org/10.3390/math10060947

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop