Next Article in Journal
The Effect of Jittered Stimulus Onset Interval on Electrophysiological Markers of Attention in a Brain–Computer Interface Rapid Serial Visual Presentation Paradigm
Previous Article in Journal
Automatic Detection of Electrodermal Activity Events during Sleep
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix

Independent Researcher, Austin, TX 78758, USA
Current address: 3001 Esperanza Xing, Austin, TX 787858, USA.
Signals 2024, 5(1), 1-17; https://doi.org/10.3390/signals5010001
Submission received: 21 September 2023 / Revised: 12 November 2023 / Accepted: 28 December 2023 / Published: 4 January 2024

Abstract

:
Banding the inverse of a covariance matrix has become a popular technique for estimating a covariance matrix from a limited number of samples. It is of interest to provide criteria to determine if a matrix is bandable, as well as to test the bandedness of a matrix. In this paper, we pose the bandedness testing problem as a hypothesis testing task in statistical signal processing. We then derive two detectors, namely the complex Rao test and the complex Wald test, to test the bandedness of a Cholesky-factor matrix of a covariance matrix’s inverse. Furthermore, in many signal processing fields, such as radar and communications, the covariance matrix and its parameters are often complex-valued; thus, it is of interest to focus on complex-valued cases. The first detector is based on the complex parameter Rao test theorem. It does not require the maximum likelihood estimates of unknown parameters under the alternative hypothesis. We also develop the complex parameter Wald test theorem for general cases and derive the complex Wald test statistic for the bandedness testing problem. Numerical examples and computer simulations are given to evaluate and compare the two detectors’ performance. In addition, we show that the two detectors and the generalized likelihood ratio test are equivalent for the important complex Gaussian linear models and provide an analysis of the root cause of the equivalence.

1. Introduction

In statistical signal processing applications, such as radar and communications, the sample covariance matrix plays an essential role [1]. It is usually estimated from N sample data vectors [ x 0 x 1 , x N 1 ] , where x n is assumed to be L × 1 identical and independently distributed (IID). The maximum likelihood covariance matrix estimate is [2]
C ^ = 1 N n = 0 N 1 x n x n H ,
where H denotes a Hermitian. A good covariance matrix estimate usually requires the number of samples N to be sufficiently large. For instance, in space–time adaptive processing (STAP), it requires N 2 L to have a good clutter covariance matrix estimate of size L × L  [3]. In practice, however, this is not valid due to the nonstationary environment. For example, the data for a STAP system are often nonstationary due to the heterogeneous clutter [1]. The number of data that are sufficiently IID (homogeneous) can be relatively small N L [3].
Techniques such as thresholding and banding are common ways to achieve better covariance matrix estimation. The thresholding method sets small elements of the sample covariance matrix to zero to obtain better estimators [4,5]. Another approach is to band or taper the sample covariance matrix [6,7]. Rothman et al. [8] proposed a Cholesky-based covariance regularization method to ensure positive definiteness. In practice, the inverse covariance matrix may be of primary interest. When the data are multivariate Gaussian, the inverse of the covariance matrix can be used to infer the conditional dependence structure of random variables [9].
Several researchers have investigated different methods for banding the inverse of the covariance matrix. Wu et al. proposed estimating the covariance matrix by banding the Cholesky-factor matrix and applying kernel smoothing estimation [10]. Bickel demonstrated that within the bandable class of covariance matrices, the estimator C ^ 1 obtained by banding the Cholesky-factor matrix of the covariance matrix’s inverse is consistent [4,6]. Qian et al. explored adaptive banding covariance estimation for high-dimensional multivariate longitudinal data [11]. However, not much work is available to provide a criterion for deciding if a covariance matrix is bandable. Such a criterion would be useful for deciding if the banding technique is a suitable strategy in covariance matrix estimation tasks. Moreover, other covariance estimation methods, such as modeling the covariance matrix as a time-varying autoregressive moving average (ARMA) model [12], also require one to test if the model has a good fit, which is similar to but distinct from model order estimation techniques such as the minimum description length (MDL), AIC, BIC, and Bayesian exponential embedded family [13]. Some recent tests for bandedness can be found in [14], where a method for estimating matrix bandwidth is presented. Peng et al. developed several tests for sparse high-dimensional covariance matrices [15]. In [9], An et al. proposed test statistics for detecting band size and applied them to cancer data analysis. In contrast to these works, we pose the problem as a classical parameter hypothesis testing problem, which allows us to employ well-established detection theorems and algorithms in statistical signal processing.
In many practical fields, such as radar and communications, the data and parameters are often complex-valued [16]; therefore, we consider the bandedness testing problem for a complex-valued covariance matrix herein. Some general topics related to complex-valued signal processing can be found in [17,18]. In [19], Kay and Zhu derived the complex parameter Rao test, which allows one to develop a Rao test for complex parameters in a complex-valued domain directly. Based on [19], Sun et al. extended the complex parameter Rao test to include the case of nuisance parameters and also derived the Wald, Gradient, and Durbin tests for complex-valued parameters in a recently published paper [20]. In the present paper, we also derive the complex parameter Wald test as a parallel task and apply it to the problem of testing the bandedness of a covariance matrix.
The Rao test and Wald test are asymptotically optimal detectors for large data records. The complex parameter Rao test requires a lower computational cost than some other detectors, i.e., the generalized likelihood ratio test (GLRT) and Wald test. This is because it does not require the MLEs of unknown parameters under the alternative hypothesis H 1 . This property can be desirable in high-dimensional multivariate signal processing [19], as low latency is a key performance indicator in such systems. The Rao test strikes a good balance between performance and computational cost. The complex parameter Rao test proposed by Kay and Zhu has been applied to multiple problems of radar and communication signal processing [19].
The Wald test is another very useful detector in addition to the Rao test. It is useful in radar target detection tasks, including but not limited to the detection of point targets, extended targets, and multiple-input/multiple-output radar targets in homogeneous, partially homogeneous, and heterogeneous environments [21,22]. In general, it is an equivalent large-data-record test that has the same asymptotically optimal detection performance as the GLRT and the Rao test. For finite-data records, however, it is not guaranteed to have the same performance as the GLRT [23,24,25,26]. In some cases, compared to the GLRT, the Wald test might be more robust when a mismatch exists and may have a lower computational complexity [22]. An example of its application can be found in adaptive detection for frequency diverse array multiple-input/multiple-output radar [27].
This paper is organized as follows: Section 2 formulates the problem of testing the bandedness of a covariance matrix; Section 3.1 derives the complex parameter Rao test detector for testing the bandedness of the Cholesky-factor matrix; and in Section 3.2, we derive the general complex Wald test for the complex-valued parameter hypothesis testing problem. In Section 3.2.3, the complex Wald test for the bandedness testing problem is derived. Examples and computer simulations for evaluating the Rao and Wald detector’s performance are given in Section 4. In addition, the equivalence between complex Wald and Rao tests for the ubiquitous complex Gaussian linear models is proved and analyzed in Section 4.2. Finally, conclusions are drawn in Section 5.

2. Problem Formulation

Assume that we have N IID observed data vectors, X = [ x 0 T x 1 T x N 1 T ] T , where T denotes transpose and each x n is an L × 1 complex-valued data vector conforming to a zero-mean multivariate complex Gaussian distribution x n CN ( 0 , C ) for n = 0 , 1 , , N 1 , and the x n s are mutually independent. In addition, we assume N L . The L × L covariance matrix C is a Hermitian matrix, so its inverse can be decomposed via the Cholesky decomposition as
C 1 = D H D ,
where D is a lower triangular L × L matrix. And it has a testing model as follows.
D = D B + k = 1 M b k Φ k ,
where D B is a known banded lower triangular matrix, with a bandwidth of m, b k ’s are unknown complex-valued parameters, and Φ k s are known basis matrices.
Specifically,
b 1 = [ D ] m + 2 , 1 , Φ 1 = e m + 2 e 1 T b 2 = [ D ] m + 3 , 2 , Φ 2 = e m + 3 e 2 T b N m 1 = [ D ] N , N m 1 , Φ N m 1 = e N e N m 1 T b N m = [ D ] m + 3 , 1 , Φ N m = e m + 3 e 1 T b N m + 1 = [ D ] m + 4 , 2 , Φ N m + 1 = e m + 4 e 2 T b M = [ D ] N , 1 , Φ M = e N e 1 T
where M = ( N m 1 ) ( N m ) 2 and e k is an L × 1 vector with its k t h element being one and all other elements being zeros. The objective is to test whether the lower triangular Cholesky factor matrix D is equal to the banded lower triangular matrix D B . Let b = [ b 1 b 2 b M ] T , then the detection problem is equivalent to choosing between the following two hypotheses:
H 0 : b = 0 ; H 1 : b 0 ;

3. Methods

In this section, we derive the complex Rao test and the complex Wald test for the hypothesis testing problem stated above.

3.1. The Complex Rao Test for Testing the Bandedness

The Rao test attains asymptotic (as N ) performance as the GLRT, yet it circumvents the necessity of MLEs under the alternative hypothesis H 1 . As a result, so its computation cost can be lower than that of the GLRT, offering a desirable property in high-dimensional signal processing, including real-time STAP. Subsequently, we proceed by applying the complex Rao test theorem introduced in [19] to derive the Rao test statistic. Let b * = [ b 1 * b 2 * b M * ] T , where ∗ denotes conjugate, and b ̲ = [ b T b H ] T , which is a 2 M × 1 complex-valued parameter vector. The complex parameter Rao test detector can be formed according to [19]
T R ( X ) = ln p ( X ; b ̲ ) b ̲ * b = 0 H I 1 ( b ̲ ) b = 0 ln p ( X ; b ̲ ) b ̲ * b = 0
where,
ln p ( X ; b ̲ ) b ̲ = ln p ( X ; b ̲ ) b T ln p ( X ; b ) b * T T ,
ln p ( X ; b ̲ ) b = ln p ( X ; b ̲ ) b 1 ln p ( X ; b ̲ ) b 2 ln p ( X ; b ̲ ) b M T ,
ln p ( X ; b ̲ ) b * = ln p ( X ; b ̲ ) b 1 * ln p ( X ; b ̲ ) b 2 * ln p ( X ; b ̲ ) b M * T ,
are based on Wirtinger derivatives. We can find each element, ln p ( X ; b ) b k , as follows. First.
p ( X ; b ̲ ) = n = 0 N 1 p ( x n ; b ̲ ) = n = 0 N 1 1 π L det ( C ) exp ( x n H C 1 x n ) = 1 π N L n = 0 N 1 det ( C ) exp ( n = 0 N 1 x n H C 1 x n ) = 1 π N L exp ( n = 0 N 1 x n H D H D x n ) n = 0 N 1 det ( D H D ) .
Then,
ln p ( X ; b ̲ ) = ln ( 1 π N L ) n = 0 N 1 x n H D H D x n + N ln det ( D H D ) ,
and
ln p ( X ; b ̲ ) b k = N ln det ( D H D ) b k n = 0 N 1 x n H D H D x n b k = N ln det ( D H D ) b k n = 0 N 1 tr ( D x n x n H D H ) b k ,
for k = 1 , 2 , , M , where
ln det ( D H D ) b k = tr ( D 1 Φ k ) ,
and
tr ( D x n x n H D H ) b k = tr ( x n x n H D H Φ k ) .
Thus,
ln p ( X ; b ̲ ) b k = N tr ( D 1 Φ k ) n = 0 N 1 tr ( x n x n H D H Φ k ) ,
Under H 0 , where b = 0 ,
ln p ( X ; b ̲ ) b k b ̲ = 0 = N tr ( D B 1 Φ k ) n = 0 N 1 tr ( x n x n H D B H Φ k )
Also, we have
ln p ( X ; b ̲ ) b k * = N tr ( D H Φ k H ) n = 0 N 1 tr ( x n x n H D Φ k H ) ,
and its value under H 0
ln p ( X ; b ̲ ) b k * b = 0 = N tr ( D B H Φ k H ) n = 0 N 1 tr ( x n x n H D B Φ k H )
We next compute I ( b ̲ ) .
I ( b ̲ ) = E ln p ( X ; b ̲ ) b ̲ * ln p ( X ; b ̲ ) H b ̲ * = A B * B A * = M × M M × M M × M M × M
where,
A = E ln p ( X ; b ̲ ) b * ln p ( X ; b ̲ ) H b * B = E ln p ( X ; b ̲ ) b ln p ( X ; b ̲ ) T b
For each element [ A ] k , l and [ B ] k , l for 1 k , l M , we can compute as follows,
A k , l = E 2 ln p ( X ; b ̲ ) b k * b l = E n = 0 N 1 tr ( Φ l x n x n H Φ k H ) = N tr ( Φ l D 1 D H Φ k H )
Under H 0 , where b = 0 , we have
A k , l b = 0 = N tr ( Φ l D B 1 D B H Φ k H )
In a similar fashion, we have
B k , l = E 2 ln p ( X ; b ̲ ) b k b l = N tr ( D 1 Φ l D 1 Φ k )
and its value under H 0 can be found as follows
B k , l b = 0 = N tr ( D B 1 Φ l D B 1 Φ k )
Substituting Equations (16), (18), (19), (22) and (24) in the complex parameter Rao test Equation (6) yields the complex Rao test statistic. For each unknown parameter, it necessitates two N × N matrix multiplications along with an inversion operation involving the Fisher Information Matrix (FIM). The computational complexity scales approximately in proportion to the number of parameters under scrutiny. Specifically, if there exist M unknown parameters, the computational load increases by a factor of M. Although opportunities for further optimization to mitigate computational demands may exist, exploring such optimizations lies beyond the scope of this paper.
When the detection problem has M unknown parameters, the complex Rao test statistic under the null hypothesis H 0 can be shown to have a chi-squared distribution with M degrees of freedom [28].

3.2. Complex Parameter Wald Test

In this section, we present a novel detection theorem referred to as the Complex Wald test, which is developed in parallel with [20], addressing the general hypothesis testing problem involving complex-valued parameters. Traditionally, this approach required concatenating the real and imaginary parts of the complex-valued data to create an augmented real vector for conducting Wald test computations in the real-valued domain. However, the derived Complex Wald test enables direct computation with complex-valued quantities. Moreover, when the Fisher Information Matrix (FIM) of the unknown parameters exhibits a specific structure, the Complex Wald Test simplifies to a more streamlined form.
Suppose x = [ u T v T ] T with x R 2 N × 1 , u R N × 1 and v R N × 1 , which is formed from the observed complex-valued vector x ˜ = u + j v , where x ˜ C N × 1 . Let ξ = [ α T β T ] T , formed from the unknown complex-valued parameter vector θ ˜ = α + j β , where ξ R 2 p × 1 , α R p × 1 , β R p × 1 and θ ˜ C p × 1 . We denote the probability density function (PDF) of the data as p x ( x ; ξ ) . Then, we have the PDF p x ˜ ( x ˜ ; θ ˜ ) p x ( x ; ξ ) p u , v ( u , v ; α , β ) . The real Wald test without nuisance parameters (parameters that are unknown yet of no interest) [28] is
T W ( x ) = ( ξ ξ 0 ) T I ( ξ ) ( ξ ξ 0 ) | ξ = ξ ^ 1 ,
where ξ ^ 1 is the MLE of ξ under H 1 , and I ( ξ ) is Fisher Information Matrix (FIM) of ξ and can be partitioned as
I ( ξ ) = I α α I α β I β α I β β
where I α α , I β β , I β α , I α β R p × p I α α T = I α α , I β β T = I β β , and I α β T = I β α . We next derive the complex Wald test for the unknown parameter θ ˜ by carrying out mathematical operations with respect to complex-valued quantities.

3.2.1. Complex Wald Test for General Cases

Observe that p x ˜ ( x ˜ ; θ ˜ ) is a real function of θ ˜ , depending on θ ˜ and θ ˜ * , where ∗ denotes complex conjugate. Denote p x ˜ ( x ˜ ; θ ˜ ) as p x ˜ ( x ˜ ; Θ ˜ ) p x ˜ ( x ˜ ; θ ˜ , θ ˜ * ) , where Θ ˜ = θ ˜ T θ ˜ H T , Θ ˜ C 2 p × 1 , and ( · ) H represents Hermitian. Also, the complex partial derivatives of a real scalar function g ( z ˜ , z ˜ * ) f ( x , y ) are
g ( z ˜ , z ˜ * ) z ˜ = 1 2 ( f x j f y )
and
g ( z ˜ , z ˜ * ) z ˜ * = 1 2 ( f x + j f y ) ,
where z ˜ = x + j y . And, for a real function g of complex vectors z ˜ and z ˜ * .
g ( z ˜ , z ˜ * ) z ˜ i = g ( z ˜ , z ˜ * ) z ˜ i
g ( z ˜ , z ˜ * ) z ˜ * i = g ( z ˜ , z ˜ * ) z ˜ i * .
With these definitions, we have
Theorem 1.
Complex Wald Test
T W ˜ ( x ˜ ) = ( Θ ˜ Θ ˜ 0 ) H I ˜ ( Θ ˜ ) ( Θ ˜ Θ ˜ 0 ) | Θ ˜ = Θ ˜ ^ 1 = T W ( x ) ,
where
I ˜ ( Θ ˜ ) = E ln p x ˜ ( x ˜ ; Θ ˜ ) Θ ˜ * ln p x ˜ ( x ˜ ; Θ ˜ ) Θ ˜ * H ,
T W ˜ ( x ˜ ) is a complex Wald test statistic, and T W ( x ) is real Wald test statistic. Note that Θ ˜ ^ 1 is the MLE of Θ ˜ under H 1 , and Θ ˜ ^ 1 = θ ˜ ^ 1 θ ˜ ^ 1 * , and θ ˜ ^ 1 is the MLE of θ ˜ under H 1 ; Θ ˜ 0 = θ ˜ 0 θ ˜ 0 * , and θ ˜ 0 is the true value of the unknown parameter under H 0 . Note that I ˜ ( Θ ˜ ) can be used as a FIM for an unbiased estimation of Θ ˜ . Also, I ˜ ( Θ ˜ ) is a 2 p × 2 p complex hermitian matrix. Hence, T W ˜ ( x ˜ ) is real.
Note that no assumption has been imposed on the form of I ( ξ ) .
Proof. 
Let
T = 1 2 I p j 2 I p 1 2 I p j 2 I p
where I p is a p × p identity matrix. Then,
I ˜ ( Θ ˜ ) = T I ( ξ ) T H
Hence, we have
T W ˜ ( x ˜ ) = Θ ˜ Θ ˜ 0 H T I ( ξ ) T H ( Θ ˜ Θ ˜ 0 ) = I p j I p I p j I p ( ξ ξ 0 ) H T I ( ξ ) T H I p j I p I p j I p ( ξ ξ 0 ) = 2 T ( ξ ξ 0 ) H T I ( ξ ) T H 2 T ( ξ ξ 0 ) = ( ξ ξ 0 ) T 2 T H T I ( ξ ) 2 T H T ( ξ ξ 0 ) = ( ξ ξ 0 ) T I ( ξ ) ( ξ ξ 0 ) = T W ( x )

3.2.2. Complex Wald Test for Special Fisher Information Matrix

Subsequently, we examine a commonly encountered special form of the FIM in practical applications.
Theorem 2.
If the real FIM given by (26) has the special form
I ( ξ ) = 2 E F F E ,
then
T W ( x ) = T W ˜ ( x ˜ ) = 2 ( θ ˜ θ ˜ 0 ) H I ˜ ( θ ˜ ) ( θ ˜ θ ˜ 0 ) | θ ˜ = θ ˜ ^ 1 ,
where
I ˜ ( θ ˜ ) = E ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * H .
Note that I ˜ ( θ ˜ ) is hermitian and hence the expression is a real number.
Proof. 
When the real FIM attains the special form, we have
I ˜ ( Θ ˜ ) = I ˜ ( θ ˜ ) 0 0 I ˜ * ( θ ˜ ) .
And with (31), we have
T W ˜ ( x ˜ ) = θ ˜ θ ˜ 0 θ ˜ * θ ˜ 0 * H I ˜ ( θ ˜ ) 0 0 I ˜ ( θ ˜ ) θ ˜ θ ˜ 0 θ ˜ * θ ˜ 0 * = 2 R e { ( θ ˜ θ ˜ 0 ) H I ˜ ( θ ˜ ) ( θ ˜ θ ˜ 0 ) } ,
Also, note that ( θ ˜ θ ˜ 0 ) H I ˜ ( θ ˜ ) ( θ ˜ θ ˜ 0 ) is real. Therefore,
T W ˜ ( x ˜ ) = 2 ( θ ˜ θ ˜ 0 ) H I ˜ ( θ ˜ ) ( θ ˜ θ ˜ 0 ) = T W ( x )

3.2.3. The Complex Wald Test for Testing Bandedness

This section delves into deriving the complex Wald test statistic for the aforementioned problem of testing bandedness. Recall that
ln p ( X ; b ̲ ) = ln ( 1 π N L ) n = 0 N 1 x n H D H D x n + N ln det ( D H D ) ,
and
ln p ( X ; b ̲ ) b k = N ln det ( D H D ) b k n = 0 N 1 x n H D H D x n b k = N ln det ( D H D ) b k n = 0 N 1 tr ( D x n x n H D H ) b k ,
for k = 1 , 2 , , M , where
ln det ( D H D ) b k = tr ( D 1 Φ k ) ,
and
tr ( D x n x n H D H ) b k = tr ( x n x n H D H Φ k ) .
We aim to determine the Maximum Likelihood Estimates (MLEs) of b k . Considering that b k are relatively small, we are interested in testing whether they are equal to zero, and we approximate D 1 as D B 1 .
ln p ( X ; b ̲ ) b k = N tr ( D 1 Φ k ) n = 0 N 1 tr ( x n x n H D H Φ k ) = N tr ( D b 1 Φ k ) n = 0 N 1 tr ( x n x n H ( D b + i = 1 M b i Φ i ) H Φ k ) = N tr ( D b 1 Φ k ) n = 0 N 1 tr ( x n x n H D b H Φ k ) b k * n = 0 N 1 tr ( x n x n H Φ k H Φ k )
therefore
b k * ^ = N tr ( D b 1 Φ k ) n = 0 N 1 tr ( x n x n H D b H Φ k ) n = 0 N 1 tr ( x n x n H Φ k H Φ k )
Up to this point, we have obtained the MLEs of unknown parameters under the alternative hypothesis. Computation of these MLEs constitutes extra computational cost in the Wald test compared to the Rao test. To put it simply, for each additional unknown parameter, the Wald test requires roughly three times the workload of N × N matrix multiplications in contrast to the Rao test.
Upon substituting the MLEs into both the complex FIM equation and the complex Wald test equation in (31), we derive the Wald test statistic. It is noteworthy that given the presence of the unknown parameter within the covariance matrix, the conditions required for the application of the specialized complex Wald test theorem, as delineated in [19], are not satisfied in this case due to the absence of the requisite special form in the FIM.

4. Simulations, Results and Discussion

4.1. Simulations and Result Discussion on Complex Rao and Wald Tests for Bandedness Testing

Consider an illustrative example, where we have the N = 4 observed data set X = [ x 0 T x 1 T x 2 T x 3 T ] T , each x n ’s is a 4 × 1 complex-valued IID Gaussian vector, x n CN ( 0 , C ) . Also, C 1 = D H D , and D = D B + b 1 Φ 1 with Φ 1 = e 4 e 1 T and
D B = 0.53 0 0 0 0.26 + 0.25 j 0.53 0 0 0.12 + 0.1 j 0.33 + 0.28 j 0.51 0 0 0.17 0.10 j 0.2 0.27 j 0.5
We are testing whether the Cholesky factor matrix D is banded and equal to the known D B . It is equivalent to testing between b 1 = 0 versus b 1 0 . The complex Rao test for this example can be shown to be (49)
T R ( X ) = Re { 4 tr ( D B 1 Φ 1 ) n = 0 3 tr ( x n x n H D B H Φ 1 ) 2 tr * ( D B 1 Φ 1 D B H Φ 1 H ) } 2 | tr ( Φ l D B 1 D B H Φ k H ) | 2 | tr ( D B 1 Φ l D B 1 Φ k ) | 2 Re { 4 tr ( D B 1 Φ 1 ) n = 0 3 tr ( x n x n H D B H Φ 1 ) 2 tr ( D B 1 Φ 1 D B 1 Φ 1 ) } 2 | tr ( Φ l D B 1 D B H Φ k H ) | 2 | t r ( D B 1 Φ l D B 1 Φ k ) | 2
The complex Wald test for this illustrative example can be obtained by using Equations (31) and (47).
To evaluate the performance of the complex Rao and Wald tests for this example, we set b 1 = 0.5 + 0.5 j under the alternative hypothesis H 1 .
All simulations were conducted using Matlab by MathWorks, Inc.,(Portola Valley, CA, USA). The Receiver Operating Characteristic (ROC) curves, depicting the relationship between the Probability of Detection ( P d ) and the Probability of False Alarm ( P f a ), were generated to characterize the performance of the detectors. The construction of the ROC curve involved the following steps: for each simulation setup, a substantial number of Monte Carlo simulation trials were executed. The test statistics were computed under both scenarios, H 1 and H 0 . Specifically, simulations were run 50,000 times, resulting in a vector of size 50,000 for the test statistic under H 1 , denoted as T 1 , and similarly, another vector of size 50,000 for the test statistic under H 0 , denoted as T 0 . Varying a threshold γ , any element in T 0 greater than γ signified a false alarm trial, while any element in T 1 greater than γ represented a correct detection trial. By systematically varying the threshold γ , a series of ( P f a , P d ) pairs were obtained, constituting the ROC curve. This curve, encapsulating the trade-off between P d and P f a , was generated as a vector of such pairs by varying the threshold γ . Figure 1 presents a high-level flowchart outlining the steps involved in the Monte Carlo simulations for generating the ROC curves for both the Wald and Rao tests.
The ROC curves of the derived complex Rao and Wald test is given in Figure 2.
The results demonstrate that both the complex Rao and Wald tests exhibit commendable performance when confronted with limited data records. Notably, the complex Wald test outperforms the complex Rao test, aligning with expectations given that the latter generally demonstrates suboptimal performance. This performance disparity is expected considering that the complex Rao test, while less computationally intensive, as it does not necessitate the MLEs of the unknown parameters under H 1 , inherently delivers slightly inferior performance.
A second illustrative example is presented, where b 1 = 0.3 + 0.3 j , representing a more challenging scenario compared to the earlier instance. Figure 3 showcases the performance of both proposed detectors in this setting. It is evident that in comparison to the prior example, the detectors’ performance declines due to the smaller magnitude of b 1 . Notably, the complex Wald test exhibits a slight edge over the complex Rao test in this more challenging scenario.
Next, we increase the number of IID samples from N = 4 to N = 10 and set the b 1 = 0.3 + 0.3 j . The comparative performance of the two detectors is depicted in Figure 4. A comparative analysis with Figure 3 reveals a conspicuous enhancement in the performance of both detectors, attributable to the increased availability of data samples. Notably, it is discernible that in this scenario, the performance of the two detectors converges significantly, indicating that the Complex Rao and Wald tests exhibit asymptotic equivalence as the dataset size grows.
Both the Complex Rao test statistic and Complex Wald test under the null hypothesis H 0 are chi-squared distributed with one degree of freedom, T R ( X ) χ 1 2 [28]. The performance of the Rao test and Wald test can be found asymptotically or as N . Estimated probability density function (PDFs), shown as bar plots, for both the Rao test and the Wald test, and the theoretical asymptotic χ 1 2 PDF are shown in Figure 5. Notably, even with a limited data record of N = 4 , the estimated PDFs remarkably align with the theoretical distribution.
The detectors’ performance is also dependent on the base matrix D B . Next, we double the magnitude of each element of D B ; that is,
D B = 1.06 0 0 0 0.52 + 0.5 j 1.06 0 0 0.24 + 0.2 j 0.66 + 0.56 j 1.02 0 0 0.34 0.20 j 0.4 0.54 j 1.0
and keep the rest of the setup unchanged with N = 4 and b 1 = 0.5 + 0.5 j . The two detectors’ performances can be found in Figure 6.
In comparison with Figure 2, it is evident that both detectors’ performances have degraded due to the larger base matrix. Next, we show the detectors’ performance change when the base matrix become smaller. We half each element of D B ; that is,
D B = 0.265 0 0 0 0.13 + 0.125 j 0.265 0 0 0.06 + 0.05 j 0.165 + 0.14 j 0.255 0 0 0.085 0.05 j 0.1 0.135 j 0.25
and keep the rest of the setup unchanged with N = 4 and b 1 = 0.5 + 0.5 j . The two detectors’ performances can be found in Figure 7. Compared with Figure 2, clearly both detectors’ performances have improved due to the smaller base matrix.

4.2. Equivalence among Complex Wald, Rao Test and GlRT for Linear Model

When the unknown parameter is present in the covariance matrix, the structure of the FIM does not attain the special form that allows one to use the Reduced Complex Wald Test and the Reduced Complex Rao Test [19]. One such case is the problem of testing the bandedness of the covariance matrix discussed above. As a counter example, in this section, we show the equivalence between the Complex Rao Test and the Complex Wald Test for an important case of general practical interest—the complex Gaussian linear model. A large number of signal processing, detection and estimation problems like radar signal processing and communications can be represented by a linear model, and hence it is of practical importance to discuss the topic.

4.2.1. Complex Classical Linear Model Testing Problem

First, we apply complex Wald test to the complex linear model problem. Assume the data are modeled according to [23]:
x ˜ = H ˜ θ ˜ + w ˜ ,
where H ˜ C N × p is a known matrix with N > p and full rank, θ ˜ is an unknown complex p × 1 parameter vector, and w ˜ is a complex N × 1 random vector with PDF w ˜ CN ( 0 , C ˜ ) , with C ˜ C N × N . The testing problem is equivalent to deciding between two hypotheses:
H 0 : θ ˜ = 0 H 1 : θ ˜ 0
Then, based on the properties of the complex Gaussian PDF
x ˜ CN ( μ ˜ , C ˜ )
so that μ ˜ = H ˜ θ ˜ and C ˜ ( θ ˜ ) = C ˜ (not dependent on θ ˜ or θ ˜ * ). The PDF is
p ( x ˜ ; θ ˜ ) = 1 π N det ( C ˜ ) exp [ ( x ˜ H ˜ θ ˜ ) H C ˜ 1 ( x ˜ H ˜ θ ˜ ) ] .
We have
ln p ( x ˜ ; θ ˜ ) θ ˜ * = ( x ˜ H ˜ θ ˜ ) H C ˜ 1 ( x ˜ H ˜ θ ˜ ) θ ˜ * = H ˜ H C ˜ 1 ( x ˜ H ˜ θ ˜ )
Therefore, the MLE of θ ˜ under H 1 is θ ˜ ^ 1 = ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ . Also we have, as shown in [23],
I ˜ ( θ ˜ ) = H ˜ H C ˜ 1 H ˜ ,
and the real FIM has the special form
I 1 ( ξ ) = 1 2 Re { ( H ˜ H C ˜ 1 H ˜ ) 1 } Im { ( H ˜ H C ˜ 1 H ˜ ) 1 } Im { ( H ˜ H C ˜ 1 H ˜ ) 1 } Re { ( H ˜ H C ˜ 1 H ˜ ) 1 } .

4.2.2. Generalized Likelihood Ratio Test (GlRT)

The GLRT decides H 1 if
L G ( x ˜ ) = p ( x ˜ ; θ ˜ ^ 1 ) p ( x ˜ ; θ ˜ 0 = 0 ) > γ
We have
2 ln L G ( x ˜ ) = 2 ln p ( x ˜ ; θ ˜ ^ 1 ) p ( x ˜ ; θ ˜ 0 = 0 ) = 2 x ˜ H C ˜ 1 x ˜ ( x ˜ H ˜ θ ˜ ^ 1 ) H C ˜ 1 ( x ˜ H ˜ θ ˜ ^ 1 ) = 2 ( x ˜ H C ˜ 1 x ˜ x ˜ H C ˜ 1 x ˜ + x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ + x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ ) = 2 x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜

4.2.3. Complex Rao Test

The complex Rao test is [19]
T R ˜ ( x ˜ ) = 2 ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * H I ˜ 1 ( θ ˜ ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 = 2 [ H ˜ H C ˜ 1 ( x ˜ H ˜ θ ˜ ) ] H ( H ˜ H C ˜ 1 H ˜ ) 1 [ H ˜ H C ˜ 1 ( x ˜ H ˜ θ ˜ ) ] | θ ˜ = 0 = 2 x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜

4.2.4. Complex Wald Test

The real FIM of this problem has a special form, so Theorem 2 applies. The complex Wald test is
T W ˜ ( x ˜ ) = 2 ( θ ˜ θ ˜ 0 ) H I ˜ ( θ ˜ ) ( θ ˜ θ ˜ 0 ) | θ ˜ = θ ˜ ^ 1 = 2 [ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ ] H ( H ˜ H C ˜ 1 H ˜ ) ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜ = 2 x ˜ H C ˜ 1 H ˜ ( H ˜ H C ˜ 1 H ˜ ) 1 H ˜ H C ˜ 1 x ˜
Compared with the results obtained by using GLRT and the complex Rao Test for the same problem, all three detectors are equivalent in this case. Next, we show the root cause of this equivalence.

4.2.5. The Root Cause of the Equivalence

First,
ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * = I ˜ ( θ ˜ ) ( θ ˜ ^ θ ˜ )
where θ ˜ is the true value. Because the complex FIM is not dependent on the true value of the unknown parameters, I ˜ ( θ ˜ ) = I ˜ ( θ ˜ ^ 1 ) , we have
ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * = I ˜ ( θ ˜ ^ 1 ) ( θ ˜ ^ θ ˜ )
So, by integrating with respect to θ ˜ * , it produces
ln p x ˜ ( x ˜ ; θ ˜ ) = ( θ ˜ ^ θ ˜ ) H I ˜ ( θ ˜ ^ ) ( θ ˜ ^ θ ˜ ) + c ( θ ˜ ^ )
since the constant of integration must be c ( θ ˜ ^ ) = ln p x ˜ ( x ˜ ; θ ˜ ^ ) .
The GLRT becomes
L G ( x ˜ ) = p ( x ˜ ; θ ˜ ^ 1 ) p ( x ˜ ; θ ˜ 0 ) = p ( x ˜ ; θ ˜ ^ 1 ) p ( x ˜ ; θ ˜ ^ 1 ) exp ( θ ˜ ^ 1 θ ˜ 0 ) H I ˜ ( θ ˜ ^ 1 ) ( θ ˜ ^ 1 θ ˜ 0 )
Thus,
2 ln L G ( x ˜ ) = 2 ( θ ˜ ^ 1 θ ˜ 0 ) H I ˜ ( θ ˜ ^ 1 ) ( θ ˜ ^ 1 θ ˜ 0 )
This explains why the GLRT and the complex Wald test coincide for the complex linear model. Furthermore, to establish the equivalence between the complex Rao test and the Generalized Likelihood Ratio Test (GLRT), we use Equation (56)
( θ ˜ ^ θ ˜ ) = I ˜ 1 ( θ ˜ ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ *
Specially,
( θ ˜ ^ 1 θ ˜ 0 ) = I ˜ 1 ( θ ˜ 0 ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0
By substituting (62) to (60), we have
2 ln L G ( x ˜ ) = 2 I ˜ 1 ( θ ˜ 0 ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 H I ˜ ( θ ˜ ^ 1 ) I ˜ 1 ( θ ˜ 0 ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 = 2 ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 H I ˜ 1 ( θ ˜ 0 ) I ˜ ( θ ˜ ^ 1 ) I ˜ 1 ( θ ˜ 0 ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 = 2 ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0 H I ˜ 1 ( θ ˜ 0 ) ln p x ˜ ( x ˜ ; θ ˜ ) θ ˜ * | θ ˜ = θ ˜ 0
where we have used the property of I ˜ ( θ ˜ ) = I ˜ ( θ ˜ 0 ) = I ˜ ( θ ˜ ^ 1 ) . This completes the proof that the complex Rao test and the GLRT are equivalent for the complex linear model. Consequently, it also proves the equivalence among the aforementioned three detectors for the complex linear model. In summary, the equivalence stems from the fact that the complex FIM of the linear model I ˜ ( θ ˜ ) = H ˜ H C ˜ 1 H ˜ does not depend on the true value of θ ˜ . In particular, we have I ˜ ( θ ˜ ) = I ˜ ( θ ˜ 0 ) = I ˜ ( θ ˜ ^ 1 ) , and hence the complex Rao test and Wald test are both equivalent to the GLRT without extra constraints.

5. Conclusions

The utilization of banding techniques has gained prominence in the estimation of covariance matrices, particularly in scenarios with limited sample sizes within high-dimensional signal processing. Before adopting such techniques, assessing the matrix’s ‘bandability’ becomes crucial. To this end, we have derived both the complex Rao test and the complex Wald test, specifically tailored for evaluating the bandedness of the Cholesky factor matrix within the inverse of the covariance matrix. The computational cost of the Rao test is comparatively lower, while implementing the complex Wald test demands obtaining maximum likelihood estimates under the alternative hypothesis. Consequently, the latter proofs more challenging to derive and incur higher computational expenses. We present examples and simulations to assess the performance of these proposed detectors. In our evaluations, the Wald test exhibits slightly superior performance in cases with smaller ‘signal’ magnitudes and significantly outperforms the complex parameter Rao test as the tested parameter grows larger. However, as the sample size increases, the performance gap between the two detectors diminishes. Notably, both detectors demonstrate asymptotic optimality with a substantial volume of available data.These derived detectors can serve as a preparatory step before implementing banding techniques for covariance matrix estimation. Furthermore, they extend applicability beyond bandedness assessment by enabling tests for zero elements within a matrix, achieved by appropriately modifying the basis matrix Φ k . Moreover, our investigation reveals the equivalence between the complex Rao test, Wald test, and GLRT within the general complex Gaussian linear model, shedding light on the underlying mechanisms of this equivalence. In our forthcoming research, we aim to delve deeper into the computational costs associated with these two detectors.

Funding

This research received no external funding. This work was partially presented at the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Melvin, W.L. A STAP Overview. IEEE AES Syst. Mag.—Spec. Tutor. Issue 2004, 19, 19–35. [Google Scholar] [CrossRef]
  2. Anderson, T. An Introduction to Multivariate Statistical Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 2003. [Google Scholar]
  3. Melvin, W.L.; Showman, G.A. An Approach to Knowledge-Aided Covariance Estimation. IEEE Trans. Aerosp. Electron. Syst. 2006, 42, 1021–1042. [Google Scholar] [CrossRef]
  4. Bickel, P.J.; Levina, E. Covariance regularization by thresholding. Ann. Statist. 2008, 36, 2577–2604. [Google Scholar] [CrossRef] [PubMed]
  5. Rothman, A.; Levina, L.; Zhu, J. Generalized thresholding of large covariance matrices. J. Am. Statist. Assoc. 2009, 104, 177–186. [Google Scholar] [CrossRef]
  6. Bickel, P.J.; Levina, E. Regularized estimation of large covariance matrices. Ann. Statist. 2008, 36, 199–227. [Google Scholar] [CrossRef]
  7. Cai, T.T.; Zhang, C.; Zhou, H. Optimal rates of convergence for covariance matrix estimation. Ann. Statist. 2010, 38, 2118–2144. [Google Scholar] [CrossRef]
  8. Rothman, A.; Levina, L.; Zhu, J. A new approach to Cholesky-based covariance regularization in high dimensions. Biometrika 2010, 97, 539–550. [Google Scholar] [CrossRef]
  9. An, B.; Guo, J.; Liu, Y. Hypothesis testing for band size detection of high-dimensional banded precision matrices. Biometrika 2014, 101, 477–483. [Google Scholar] [CrossRef]
  10. Wu, W.B.; Pourahmadi, M. Nonparametric estimation of large covariance matrices of longitudinal data. Biometrika 2003, 90, 831–844. [Google Scholar] [CrossRef]
  11. Qian, F.; Zhang, W.; Chen, Y. Adaptive banding covariance estimation for high-dimensional multivariate longitudinal data. Can. J. Stat. 2021, 49, 906–938. [Google Scholar] [CrossRef]
  12. Wiesel, A.; Bibi, O.; Globerson, A. Time varying autoregressive moving average models for covariance estimation. IEEE Trans. Signal Process. 2013, 61, 2791–2801. [Google Scholar] [CrossRef]
  13. Zhu, Z.; Kay, S. On Bayesian Exponentially Embedded Family for model order selection. IEEE Trans. Signal Process. 2018, 66, 933–943. [Google Scholar] [CrossRef]
  14. Qiu, Y.-M.; Chen, S.X. Test for bandedness of high dimensional covariance matrices with bandwidth estimation. Ann. Stat. 2002, 40, 1285–1314. [Google Scholar] [CrossRef]
  15. Peng, L.; Chen, S.X.; Zhou, W. More powerful tests for sparse high-dimensional covariances matrices. J. Multivar. Anal. 2016, 149, 124–143. [Google Scholar] [CrossRef]
  16. Zhu, Z.; Kay, S.; Raghavan, R.S. Information-theoretic optimal radar waveform design. IEEE Signal Process. Lett. 2017, 24, 274–278. [Google Scholar] [CrossRef]
  17. Schreier, P.J.; Scharf, L.L. Statistical Signal Processing of Complex-Valued Data: The Theory of Improper and Noncircular Signals; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
  18. Adali, T.; Schreier, P.J.; Scharf, L.L. Complex-valued signal processing: The proper way to deal with impropriety. IEEE Trans. Signal Process. 2011, 59, 5101–5125. [Google Scholar] [CrossRef]
  19. Kay, S.; Zhu, Z. The complex parameter Rao test. IEEE Trans. Signal Process. 2016, 94, 6580–6588. [Google Scholar] [CrossRef]
  20. Sun, M.; Liu, W.; Liu, J.; Hao, C. Complex parameter Rao, Wald, Gradient, and Durbin Tests for multichannel signal detection. IEEE Trans. Signal Process. 2021, 70, 117–131. [Google Scholar] [CrossRef]
  21. Liu, W.; Xie, W.; Wang, Y. Rao and Wald tests for distributed targets detection with unknown signal steering. IEEE Signal Process. Lett. 2013, 20, 1086–1089. [Google Scholar]
  22. De Maio, A.; Iommelli, S. Coincidence of the Rao Test, Wald Test, and GLRT in partially homogeneous environment. IEEE Signal Process. Lett. 2008, 15, 385–388. [Google Scholar] [CrossRef]
  23. Kay, S. Fundamentals of Statistical Signal Processing: Estimation Theory; Prentice-Hall: Englewood Cliffs, NJ, USA, 1993. [Google Scholar]
  24. Liu, W.; Wang, Y.; Xie, W. Fisher information matrix, Rao test, and Wald test for complex-valued signals and their applications. Signal Process. 2014, 94, 1–5. [Google Scholar] [CrossRef]
  25. Zhu, Z.; Kay, S.; Cogun, F.; Raghavan, R.S. On detection of nonstationarity in radar signal processing. In Proceedings of the 2016 IEEE Radar Conference (RadarConf), Philadelphia, PA, USA, 2–6 May 2016; pp. 1–4. [Google Scholar]
  26. Zhu, Z.; Kay, S. The Rao test for testing handedness of complex-valued covariance matrix. In Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 20–25 March 2016; pp. 3960–3963. [Google Scholar]
  27. Huang, B.; Basit, A.; Wang, W.; Zhang, S. Adaptive detection with Bayesian framework for FDA-MIMO radar. IEEE Geosci. Remote Sens. Lett. 2021, 19, 3509505. [Google Scholar] [CrossRef]
  28. Kay, S. Fundamentals of Statistical Signal Processing: Detection Theory; Prentice-Hall: Englewood Cliffs, NJ, USA, 1998. [Google Scholar]
Figure 1. Illustration of Monte Carlo Simulation Steps for each simulation setup.
Figure 1. Illustration of Monte Carlo Simulation Steps for each simulation setup.
Signals 05 00001 g001
Figure 2. ROC curve of the complex Rao and Wald test detector with b 1 = 0.5 + 0.5 j .
Figure 2. ROC curve of the complex Rao and Wald test detector with b 1 = 0.5 + 0.5 j .
Signals 05 00001 g002
Figure 3. ROC curve of the complex Rao and Wald test detector with b 1 = 0.3 + 0.3 j .
Figure 3. ROC curve of the complex Rao and Wald test detector with b 1 = 0.3 + 0.3 j .
Signals 05 00001 g003
Figure 4. ROC curve of the complex Rao and Wald test detector with b 1 = 0.3 + 0.3 j and N = 10 .
Figure 4. ROC curve of the complex Rao and Wald test detector with b 1 = 0.3 + 0.3 j and N = 10 .
Signals 05 00001 g004
Figure 5. Estimated and theoretical PDFs of the test statistic for N = 4.
Figure 5. Estimated and theoretical PDFs of the test statistic for N = 4.
Signals 05 00001 g005
Figure 6. Complex Rao and Wald test performance with “larger” D B .
Figure 6. Complex Rao and Wald test performance with “larger” D B .
Signals 05 00001 g006
Figure 7. Complex Rao and Wald test performance with “smaller” D B .
Figure 7. Complex Rao and Wald test performance with “smaller” D B .
Signals 05 00001 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, Z. Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix. Signals 2024, 5, 1-17. https://doi.org/10.3390/signals5010001

AMA Style

Zhu Z. Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix. Signals. 2024; 5(1):1-17. https://doi.org/10.3390/signals5010001

Chicago/Turabian Style

Zhu, Zhenghan. 2024. "Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix" Signals 5, no. 1: 1-17. https://doi.org/10.3390/signals5010001

Article Metrics

Back to TopTop