Next Article in Journal
Local Non-Similar Solutions for Boundary Layer Flow over a Nonlinear Stretching Surface with Uniform Lateral Mass Flux: Utilization of Third Level of Truncation
Next Article in Special Issue
Sharp Estimates for Proximity of Geometric and Related Sums Distributions to Limit Laws
Previous Article in Journal
Enhanced Marine Predators Algorithm for Solving Global Optimization and Feature Selection Problems
Previous Article in Special Issue
Poissonization Principle for a Class of Additive Statistics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Asymptotic Expansions for Symmetric Statistics with Degenerate Kernels

1
Department of Mathematics, Tokyo Gakugei University, 4-1-1 Nukuikita-machi, Tokyo 184-8501, Japan
2
Department of Mathematics, Tokyo City University, 1-28-1 Tamazutsumi, Tokyo 158-8557, Japan
Mathematics 2022, 10(21), 4158; https://doi.org/10.3390/math10214158
Submission received: 21 September 2022 / Revised: 26 October 2022 / Accepted: 29 October 2022 / Published: 7 November 2022
(This article belongs to the Special Issue Limit Theorems of Probability Theory)

Abstract

:
Asymptotic expansions for U-statistics and V-statistics with degenerate kernels are investigated, respectively, and the remainder term O ( n 1 p / 2 ) , for some p 4 , is shown in both cases. From the results, it is obtained that asymptotic expansions for the Cram e ´ r–von Mises statistics of the uniform distribution U ( 0 , 1 ) hold with the remainder term O n 1 p / 2 for any p 4 . The scheme of the proof is based on three steps. The first one is the almost sure convergence in a Fourier series expansion of the kernel function u ( x , y ) . The key condition for the convergence is the nuclearity of a linear operator T u defined by the kernel function. The second one is a representation of U-statistics or V-statistics by single sums of Hilbert space valued random variables. The third one is to apply asymptotic expansions for single sums of Hilbert space valued random variables.

1. Introduction

Asymptotic expansions for symmetric statistics are studied by many people. See, e.g., Callaert–Janssen–Veraverbeke (1980) [1], Withers (1988) [2], Maesono (2004) [3], and so on. They treat U-statistics with non-degenerate kernels. On the other hand, Bentkus—Götze (1999) [4] and Zubayraev (2011) [5] obtained optimal bounds in asymptotic expansions for U-statistics with degenerate kernels. They treat the following modified U-statistics,
W n = 1 n 2 1 i < j n ϕ ξ i , ξ j + 1 n 1 i n ϕ 1 ξ i ,
where ϕ · , · is a symmetric function, ϕ 1 · is a measurable function and ξ i are i.i.d. random variables. Wn coincides with V-statistics when
ϕ 1 x = 1 2 ϕ x , x .
If ϕ 1 x = 0 for any x, then W n coincides with U-statistics. They obtained asymptotic expansions with remainder O ( n 1 ) for the distribution function of W n . In this paper, we investigate asymptotic expansions for the simple U-statistics and the V-statistics with degree two defined by
U n = 2 n 2 1 i < j n u ξ i , ξ j , V n = 1 n 2 1 i , j n u ξ i , ξ j ,
respectively. We obtain asymptotic expansions with remainder O ( n 1 p / 2 ) for some p 4 for the distribution function of U n or V n under some assumptions for ξ i and u ( x , y ) . Our scheme of the proof is based on three steps. The first one is the almost sure convergence in a Fourier series expansion of u ξ i , ξ j . The key condition for the convergence is the nuclearity of a linear operator T u defined by the kernel function u ( x , y ) . The second one is a representation of U-statistics or V-statistics by single sums of Hilbert space valued random variables. The third one is to apply asymptotic expansions for single sums of Hilbert space valued random variables due to Sazonov—Ulyanov (1995) [6].

2. Symmetric Statistics

Let ξ j , j 1 be i.i.d. random variables with a probability distribution μ on an arbitrary measurable space X , B . Suppose that u x 1 , x 2 , , x n is a real valued symmetric function for some k 1 , i.e.,
u x 1 , x 2 , , x k = u x i 1 , x i 2 , , x i k ,
for any permutation i 1 , i 2 , , i k of 1 , 2 , , k . A statistics defined by the kernel function u x 1 , x 2 , , x k is called a symmetric statistics. The followings are the typical examples of the symmetric statistics.
Example 1.
U-statistics with degree k 1 :
U n = n k 1 1 i 1 < i 2 < < i k n u ξ i 1 , ξ i 2 , , ξ i k .
Example 2.
V-statistics with degree k 1 :
V n = n k 1 i 1 , i 2 , , i k n u ξ i 1 , ξ i 2 , , ξ i k .
In this paper, we treat V-statistics V n and U-statistics U n with degree two defined by (3) when the kernel function u x , y is degenerate, i.e.,
E u ξ 1 , x = 0 ,
for any real number x.

3. Non-Central Limit Theorems for U-Statistics with Degenerate Kernels

Assume that ξ i are i.i.d. random variables with a distribution μ . Let u x , y be a real valued symmetric function on R × R and square integrable such that
E u ξ 1 , ξ 2 2 < .
Suppose that u x , y is a degenerate kernel satisfying the condition (7). Let L 2 R , μ be the space of all square integrable functions with respect to μ . Then, according to Serfling (1980) [7], we see that the kernel u ( x , y ) induces a bounded linear operator L 2 R , μ L 2 R , μ (trace class) defined by
T u f = E u ξ 1 , x f ξ 1 = u y , x f y μ d y , f L 2 ,
which has eigenvalues λ i and eigenfunctions g i satisfying for each i 1
E g i ξ 1 = 0 , E g i 2 ξ 1 = 1 E g i ξ 1 g j ξ 1 = 0 i j , E u ξ 1 , x g i ξ 1 = λ i g i x .
With respect to (10), see Serfling (1980) [7], pp. 196 and Dunford and Schwartz (1963), pp. 905, 1009, 1083, 1087 for more details. Then we have
lim n E u ξ i , ξ j k = 1 n λ k g k ξ i g k ξ j 2 = 0 ,
for each i , j 1 . Serfling (1980) [7] showed the non-central limit theorem for U-statistics with degree 2.
Theorem 1.
(Serfling (1980) [7] )
Put θ = E u ξ 1 , ξ 2 . Let U n be a U-statistics with the degenerate kernel u ( x , y ) defined by
U n = 2 n 2 1 i < j n u ξ i , ξ j ,
Let { Z i } be i.i.d. random variables with the standard Normal distribution N ( 0 , 1 ) . Then, as n
n U n j = 1 λ j Z j 2 1 ,
where “⇒” means the weak convergence in R .
It is well known that the rate of convergence in (13) is O ( n 1 / 2 ) (See, e.g., Serfling (1980) [7] for more details). We obtain asymptotic expansions for U n and V n using asymptotic expansions due to Sazonov—Ulyanov (1995) [6] for sums of Hilbert space valued i.i.d. random variables in the next section.

4. Asymptotic Expansions for Single Sums which Hit a Ball in a Hilbert Space

In this section we consider an asymptotic expansions for sums of Hilbert space valued random vectors X i according to Sazonov—Ulyanov (1995) [6]. Let X i be a sequence of i.i.d. random vectors in a separable Hilbert space H with E X 1 = 0 and E X 1 2 = 1 , where x 2 = x , x for x H and · , · is the inner product in H. Define the covariance operator V of X1 by X 1 defined by x , y H
V x , y = E X 1 E X 1 , x X 1 E X 1 , y .
for x, yH. Denote by σ 1 2 σ 2 2 the eigenvalues of V and by e 1 , e 2 , be orthonormal eigenvectors corresponding to the eigenvalues. Put
S n = 1 σ n i = 1 n X i E X i , v k = i = 1 k σ i 1 1 k k , c k V = v k k 1 ,
where σ 2 = E X 1 E X 1 2 . Define the projection K : H H by
K y = i = 1 6 k 5 y , e i e i , y H .
Put
θ k ( L ) = sup { | E [ exp ( 1 y , X 1 ) ] | K y 1 L } .
for any L > 0 , Let Y be the H-valued Gaussian random variables with mean 0 and the covariance operator V. For a , h H , r > 0 , i = 0 , 1 , we put
Φ i ( a , r ) = P 1 i n 1 / 2 Y a < r
d h Φ i a , r = lim t Φ i a t h , r Φ i a , r t .
Define the differential operators d h k by
d h 1 Φ i a , r = d h Φ i a , r , d h k Φ i a , r = d h ( d h k 1 Φ i a , r ) , k 2 .
Put
χ j , L = I X j < L
for the indicator I ·
χ j , t = χ j , n 1 + t
and χ j = χ j , 0 . For positive integers l 1 , l 2 , , l s we put
Q s = d X 1 χ 1 l 1 d Y 1 l 1 d X 1 χ s l s d Y s l s
and for integers k 2 , 1 i k 2 , we put
A i a , r = n i i 2 2 j = 1 n n j n j l j ! 1 E Q j Φ j a , r ,
where l j = l 1 ! l j ! and denotes the summation over all, such that
l 1 3 , l 2 3 , , l j 3 , l 1 + l 2 + , + l j = 2 j + i .
The following theorem is the key result for the proofs of our theorems.
Theorem 2.
Sazonov—Ulyanov (1995) [6]
Suppose that E X 1 p < for some p 4 . For any t 0 and integer k 2 , let L be a positive number, such that
E X 1 2 1 χ j , L σ 6 k 5 2 3
Then, for L n 1 / 2
Δ n ( a , r ) : = | P S n a < r P { Y a < r } i = 1 k 2 A i ( a , r ) |
A p , s , t
+ c ( k ) exp s α c 6 k 5 ( V ) E B 2 a , r 1 χ 1 + 1 + M a , r k 2 E B k + 1 a , r 1 χ 1
+ c 6 k 5 ( V ) 1 + m 3 a , r | a | V a , a k 2 L 2 n ( k 1 ) / 2 + θ k n / ( k   log ( n / L 2 ) ) ( L ) log ( n / L 2 ) ,
where for s = a r and α 1 5 ,
A p , s , t : = n E 1 χ 1 , t + c p 1 s p n 1 p p 2 2 E X 1 p χ 1 , t χ 1 ,
B j , r = n j 2 X 1 j + m j a , r X 1 , a j ,
M a , r = m 2 a , r V a , a
and
m a , r : = min 1 , r a , a > 0 0 , a = 0 .
In addition, the terms in the asymptotic expansion for ε > 0 satisfies the estimates
A i a , r c ε , i exp s 2 2 + ε n i i s s c 6 i + 3 V × E χ i X 1 i + 2 + X 1 , a i + 2 χ i m i + 2 a , r ×
1 + m 2 i + 2 a , r 1 + m 2 i + 2 a , r V a , a i 1 + M a , r 3 i + 2
for even i, and if i is odd, then we have
A i ( a , r ) c ( ε , i ) exp s 2 2 + ε n i / s c 6 i + 3 ( V ) 1 + m 2 ( a , r ) V a , a i 1 × E X 1 , a χ i m ( a , r ) X 1 2 + X 1 i + 1 + X 1 , a i + 1 ( a , r ) m i + 1 ( a , r ) + c 6 i + 3 ( V ) m ( a , r ) V a , a 1 / 2 E χ i X 1 i + 1

5. The Sato–Mercer Theorem

In the proofs of our theorems we use the Fourier series expansion for the kernel function u ξ i , ξ j by eigenvalues and eigenfunction of the linear operator T u defined by (9). Since (11) holds in the sense of the L 2 -convergence, (11) can not be applied to show the asymptotic expansions for U-statistics or V-statistics as it is. We show that u ξ i , ξ j can be represented by the Fourier series expansion in (11) almost surely using the following Sato–Mercer theorem. (See Sato (1992) [8] for more details.)
Theorem 3.
(The Sato–Mercer theorem)
Let X be a separable metric space with a Borel measure ν on X, and K ( x , y ) be a function on X × X such that there exists a Borel-measurable subset X 0 , such that
ν ( X \ X 0 ) = 0 .
Suppose that K ( x , y ) is continuous on X 0 and satisfies
X X K x , y 2 ν d x ν d y <
and
X X K x , y f x f y ¯ ν d x ν d y 0 ,
for any f L 2 ( X , ν ) . Then, the linear operator T K on L 2 ( X , ν ) defined by
T K f x = X K x , y f y ν d y , f L 2 X , ν
is nuclear if, and only if,
X K x , x ν d x <
holds.
From Theorem 3, we have the next result.
Theorem 4.
Let ξ j , j 1 be i.i.d. random variables with the distribution μ. Let u ( x , y ) be a real valued symmetric function on R × R and T u be a linear operator defined by
T u f x = E u ξ 1 , x f ξ 1 = u y , x f y μ d y , f L 2 R , μ .
Suppose that u ( x , y ) is the square integrable degenerate kernel of the linear operator T u , such that
u 2 x , y μ d x μ d y < ,
u x , y f x f y ¯ μ d x μ d y 0 ,
for any f L 2 ( R , μ ) and
E u ξ 1 , x = u y , x μ d y = 0
for any x R . Let λ k and g k be eigenvalues and eingenfunctions of the linear operator T u , respectively. Suppose
λ k 0 , k 1 .
Furthermore assume that there exists a Lebesgue measurable subset X 0 R , such that
μ X 0 = 1
and u ( x , y ) is continuous on X 0 . Then, we have
u ξ i , ξ j = k = 1 λ k g k ξ i g k ξ j a . s . ,
for each i , j 1 .
Proof. 
It is easy to see that from (10)
E k = 1 n λ k g k ξ i g k ξ j = k = 1 n E λ k g k ξ i g k ξ j
= k = 1 n λ k E g k ξ i g k ξ j
k = 1 n λ k E g k ξ i 2 1 / 2 E g k ξ j 2 1 / 2
= k = 1 n λ k ,
for each n 1 . Tending n , (46) implies that
E k = 1 λ k g k ξ i g k ξ j k = 1 λ k .
On the other hand, from (40) and (41), u ( x , y ) satisfies (35) and (36). Therefore, T u is nuclear by Theorem 3. Hence, from (43) and the nuclearity of T u , we have
k = 1 λ k = k = 1 λ k < .
From (47) and (48), we have
E k = 1 λ k g k ξ i g k ξ j < ,
which implies
k = 1 λ k g k ξ i g k ξ j < , a . s .
Therefore, (45) is proved from (11) and (50). □
Remark 1.
If the symmetric function u ( x , y ) is piecewise continuous on R , then there exists X 0 R satisfying (44) such that u ( x , y ) is continuous on X 0 . In the next section, we show a typical example of U- or V-statistics defined by such piecewise continuous function u ( x , y ) as its kernel function.

6. Asymptotic Expansions for Degenerate V-Statistics and U-Statistics with Degree 2

For applying Theorem 2 for Hilbert space valued random variables to the proof of asymptotic expansions for V n , we represent V n by sums of Hilbert space valued random variables G i by the following method.
According to K.—Yoshihara (1994) [9], we introduce a separable Hilbert space H-equipped with the inner product · , · and the norm · as follows,
H = x = x 1 , x 2 , R k = 1 λ k x k 2 < ,
x , y = k = 1 λ k x k y k
and
x = k = 1 λ k x k 2 1 / 2 .
Using the assumptions of Theorem 4, we have from (10) and (48) that
E k = 1 λ k g k 2 ξ i = k = 1 λ k E g k 2 ξ i = k = 1 λ k < ,
which implies that we can define H-valued random variables by
G i = g 1 ξ i , g 2 ξ i , g 3 ξ i ,
for each i 1 . Let U n , n 1 and V n , n 1 be U-statistics and V-statistics with degree 2 defined by (3), respectively.
Theorem 5.
Without loss of generality we assume that θ = 0 . Suppose that ξ j , j 1 is a sequence of i.i.d. random variables with the distribution μ. Assume that u ( x , y ) is a square integrable symmetric function with respect to μ × μ satisfying (40) ∼ (42). Suppose that for some p 4
E G 1 p < .
Furthermore, without loss of generality, assume that
k = 1 λ k = 1 .
Let Y be the H-valued Gaussian random variables with mean 0 and the covariance operator V satisfying (14) with the eigenvalues σ 1 2 σ 2 2 and the orthogonal eigenvectors e 1 , e 2 , . For any t 0 , integer k 2 , let L be a positive number, such that
E G 1 2 1 χ j , L σ 6 k 5 2 3 .
Then, for L n 1 / 2 and α 1 5 ,
| P n V n r P { Y r } i = 1 k 2 A i ( 0 , r ) | A ( p , s , t ) + c ( k ) exp r α c 6 k 5 ( V ) E B 2 ( 0 , r ) 1 χ 1 + E B k + 1 ( 0 , r ) χ 1 + c 6 k 5 ( V ) L 2 n ( k 1 ) / 2 + θ k n / k log n / L 2 ( L ) log n / L 2 ,
where
Y = j = 1 λ j Z j 2 1 ,
A ( p , s , t ) = n E 1 χ 1 , t + c ( p ) ( 1 + r ) p n 1 p / 2 E G 1 p χ 1 , t χ 1
and
B j ( 0 , r ) = n j 2 j 2 2 2 G 1 j .
Proof. 
Put
h x = k = 1 λ k x k
for
x H = x = x 1 , x 2 , k = 1 λ k x k 2 <
Recall that for each i,
1 n i = 1 n G i = 1 n i = 1 n g 1 ξ i , 1 n i = 1 n g 2 ξ i , H .
Then we have
n V n = 1 n 1 i , j n u ξ i , ξ j = 1 n 1 i , j n k = 1 λ k g k ξ i g k ξ j
= 1 n k = 1 λ k 1 i , j n g k ξ i g k ξ j = 1 n k = 1 λ k i = 1 n g k ξ i 2
= 1 n i = 1 n G i .
Thus, we can apply Theorem 2 to show Theorem 5. □
Theorem 6.
Suppose that the i.i.d. random variables ξ i , i 1 obey a continuous distribution. Let v ( x , y ) be a symmetric function defined by
v x , y = u x , y , x y 0 , x = y .
Under the same assumptions in Theorem 5, the Equation (59) holds for U n with the degenerate kernel v x , y .
Proof. 
Since the i.i.d. random variables ξ i , i 1 obey a continuous distribution, we have
P ξ i ξ j = 1 i j .
Therefore, from (67) and (68)
n U n = 2 n 1 i < j n u ξ i , ξ j = 1 n 1 i , j n v ξ i , ξ j a . s .
Since the right hand side of (69) is the V-statistics with the degenerate kernel v ( x , y ) satisfying all assumptions of Theorem 5, Theorem 6 holds from Theorem 5. □
Remark 2.
From (10), E G 1 = 0 and σ 2 = E G 1 2 = 1 in Theorem 5.

7. Cramer–Von Mises Statistics

There are some examples of U-statistics or V-statistics for which the above theorems are applicable under the assumption of nuclearity of the kernel functions where the above theorems are applicable.
Example 3.
(Cram e ´ r-von Mises Statistics, Sato (1992) [8])
Assume that i.i.d. random variables ξ j , j 1 obey the uniform distribution U ( 0 , 1 ) , i.e., μ is the Lebesgue on 0 , 1 . Define a kernel function u ( x , y ) by
u x , y = 0 1 I x , 1 t t I y , 1 t t t 1 t d t , x , y 0 , 1
satisfies the hypothesis of Theorem 5 or Theorem 6. On the other hand, we have
0 1 u x , x d x = 0 1 d x 0 1 I x , 1 t t 2 t 1 t d t
= 0 1 d t t 1 t 0 1 I x , 1 t t 2 d x = 1 < .
Therefore, the integral operator T u defined by
T u f y = 0 1 u x , y f y d x
is nuclear from Theorem 3. Therefore, since the degenerate kernel u ( x , y ) defined by (70) satisfies all assumptions of Theorem 5, Theorem 5 holds for the Cram e ´ r-von Mises Statistics. Furthermore, Theorem 6 also holds for U-statistics with the degenerate kernel v ( x , y ) defined by (67) and (70).

8. Conclusions

Bentkus—Götze (1999) [4] and Zubayraev (2011) [5] obtained the remainder O ( n 1 ) in asymptotic expansions for U-statistics or V-statistics with degenerate kernels. From Theorems 5 and 6, if we assume E G 1 p , p 4 and some conditions, then we obtain the remainder O n 1 p p 2 2 . Applying Theorem 5, we obtain asymptotic expansions for the Cram e ´ r–von Mises statistics of the uniform distribution U ( 0 , 1 ) with the remainder O n 1 p / 2 for any p 4 .

Funding

Grant-in-Aid Scientific Research (C), No.18K03431, Ministry of Education, Science and Culture, Japan.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The author uses no data.

Acknowledgments

The author would like to express his gratitude to the anonymous referees for their useful comments. He also would like to express his gratitude to V.V. Ulyanov for giving the opportunity to present this work.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Callaert, H.; Janssen, P.; Veraverbeke, N. An edgeworth expansion for U-statistics. Ann. Stat. 1980, 8, 299–312. [Google Scholar] [CrossRef]
  2. Withers, C.S. Some asymptotics for U-statistics. Commun. Stat. Theory Methods 1988, 17, 3269–3276. [Google Scholar] [CrossRef]
  3. Maesono, Y. Asymptotic representations of skewness estimators of studentized U-statistics. Bull. Inform. Cybern. 2004, 36, 91–104. [Google Scholar] [CrossRef]
  4. Bentkus, V.; Götze, F. Optimal bounds in non-Gaussian limit theorems for U-statistics. Ann. Probab. 1999, 27, 454–521. [Google Scholar] [CrossRef]
  5. Zubayraev, T. Asymptotic analysis for U-statistics and its application to Von Mises statistics. Open J. Stat. 2011, 1, 139–144. [Google Scholar] [CrossRef] [Green Version]
  6. Sazonov, V.V.; Ul’yanov, V.V. Asymptotic expansions for the probability that the sum of independent random variables hits a ball in a Hilbert space. Russ. Math. Surv. 1995, 50, 1045–1063. [Google Scholar] [CrossRef]
  7. Serfling, R.J. Approximation Theorems of Mathematical Statistics; John Wiley & Sons: Hoboken, NJ, USA, 1980. [Google Scholar] [CrossRef]
  8. Sato, H. Nuclearity of a nonnegative definite integral kernel on a separable metric space. J. Theor. Probab. 1992, 5, 349–353. [Google Scholar] [CrossRef]
  9. Kanagawa, S.; Yoshihara, K. The almost sure invariance principles of degenerate U-statistics of degree two for stationary random variables. Stoch. Process. Their Appl. 1994, 49, 347–356. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kanagawa, S. Asymptotic Expansions for Symmetric Statistics with Degenerate Kernels. Mathematics 2022, 10, 4158. https://doi.org/10.3390/math10214158

AMA Style

Kanagawa S. Asymptotic Expansions for Symmetric Statistics with Degenerate Kernels. Mathematics. 2022; 10(21):4158. https://doi.org/10.3390/math10214158

Chicago/Turabian Style

Kanagawa, Shuya. 2022. "Asymptotic Expansions for Symmetric Statistics with Degenerate Kernels" Mathematics 10, no. 21: 4158. https://doi.org/10.3390/math10214158

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop