Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix

Zhu, Zhenghan

doi:10.3390/signals5010001

Open AccessArticle

Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix

by

Zhenghan Zhu

^†

Independent Researcher, Austin, TX 78758, USA

^†

Current address: 3001 Esperanza Xing, Austin, TX 787858, USA.

Signals 2024, 5(1), 1-17; https://doi.org/10.3390/signals5010001

Submission received: 21 September 2023 / Revised: 12 November 2023 / Accepted: 28 December 2023 / Published: 4 January 2024

Download

Browse Figures

Versions Notes

Abstract

:

Banding the inverse of a covariance matrix has become a popular technique for estimating a covariance matrix from a limited number of samples. It is of interest to provide criteria to determine if a matrix is bandable, as well as to test the bandedness of a matrix. In this paper, we pose the bandedness testing problem as a hypothesis testing task in statistical signal processing. We then derive two detectors, namely the complex Rao test and the complex Wald test, to test the bandedness of a Cholesky-factor matrix of a covariance matrix’s inverse. Furthermore, in many signal processing fields, such as radar and communications, the covariance matrix and its parameters are often complex-valued; thus, it is of interest to focus on complex-valued cases. The first detector is based on the complex parameter Rao test theorem. It does not require the maximum likelihood estimates of unknown parameters under the alternative hypothesis. We also develop the complex parameter Wald test theorem for general cases and derive the complex Wald test statistic for the bandedness testing problem. Numerical examples and computer simulations are given to evaluate and compare the two detectors’ performance. In addition, we show that the two detectors and the generalized likelihood ratio test are equivalent for the important complex Gaussian linear models and provide an analysis of the root cause of the equivalence.

Keywords:

complex parameter Wald test; complex parameter Rao test; bandedness; complex-valued covariance matrix; Gaussian linear model

1. Introduction

In statistical signal processing applications, such as radar and communications, the sample covariance matrix plays an essential role [1]. It is usually estimated from N sample data vectors

[x_{0} x_{1} \dots, x_{N - 1}]

, where

x_{n}

is assumed to be

L \times 1

identical and independently distributed (IID). The maximum likelihood covariance matrix estimate is [2]

\hat{C} = \frac{1}{N} \sum_{n = 0}^{N - 1} x_{n} x_{n}^{H},

(1)

where H denotes a Hermitian. A good covariance matrix estimate usually requires the number of samples N to be sufficiently large. For instance, in space–time adaptive processing (STAP), it requires

N \geq 2 L

to have a good clutter covariance matrix estimate of size

L \times L

[3]. In practice, however, this is not valid due to the nonstationary environment. For example, the data for a STAP system are often nonstationary due to the heterogeneous clutter [1]. The number of data that are sufficiently IID (homogeneous) can be relatively small

N \leq L

[3].

Techniques such as thresholding and banding are common ways to achieve better covariance matrix estimation. The thresholding method sets small elements of the sample covariance matrix to zero to obtain better estimators [4,5]. Another approach is to band or taper the sample covariance matrix [6,7]. Rothman et al. [8] proposed a Cholesky-based covariance regularization method to ensure positive definiteness. In practice, the inverse covariance matrix may be of primary interest. When the data are multivariate Gaussian, the inverse of the covariance matrix can be used to infer the conditional dependence structure of random variables [9].

Several researchers have investigated different methods for banding the inverse of the covariance matrix. Wu et al. proposed estimating the covariance matrix by banding the Cholesky-factor matrix and applying kernel smoothing estimation [10]. Bickel demonstrated that within the bandable class of covariance matrices, the estimator

{\hat{C}}^{- 1}

obtained by banding the Cholesky-factor matrix of the covariance matrix’s inverse is consistent [4,6]. Qian et al. explored adaptive banding covariance estimation for high-dimensional multivariate longitudinal data [11]. However, not much work is available to provide a criterion for deciding if a covariance matrix is bandable. Such a criterion would be useful for deciding if the banding technique is a suitable strategy in covariance matrix estimation tasks. Moreover, other covariance estimation methods, such as modeling the covariance matrix as a time-varying autoregressive moving average (ARMA) model [12], also require one to test if the model has a good fit, which is similar to but distinct from model order estimation techniques such as the minimum description length (MDL), AIC, BIC, and Bayesian exponential embedded family [13]. Some recent tests for bandedness can be found in [14], where a method for estimating matrix bandwidth is presented. Peng et al. developed several tests for sparse high-dimensional covariance matrices [15]. In [9], An et al. proposed test statistics for detecting band size and applied them to cancer data analysis. In contrast to these works, we pose the problem as a classical parameter hypothesis testing problem, which allows us to employ well-established detection theorems and algorithms in statistical signal processing.

In many practical fields, such as radar and communications, the data and parameters are often complex-valued [16]; therefore, we consider the bandedness testing problem for a complex-valued covariance matrix herein. Some general topics related to complex-valued signal processing can be found in [17,18]. In [19], Kay and Zhu derived the complex parameter Rao test, which allows one to develop a Rao test for complex parameters in a complex-valued domain directly. Based on [19], Sun et al. extended the complex parameter Rao test to include the case of nuisance parameters and also derived the Wald, Gradient, and Durbin tests for complex-valued parameters in a recently published paper [20]. In the present paper, we also derive the complex parameter Wald test as a parallel task and apply it to the problem of testing the bandedness of a covariance matrix.

The Rao test and Wald test are asymptotically optimal detectors for large data records. The complex parameter Rao test requires a lower computational cost than some other detectors, i.e., the generalized likelihood ratio test (GLRT) and Wald test. This is because it does not require the MLEs of unknown parameters under the alternative hypothesis

H_{1}

. This property can be desirable in high-dimensional multivariate signal processing [19], as low latency is a key performance indicator in such systems. The Rao test strikes a good balance between performance and computational cost. The complex parameter Rao test proposed by Kay and Zhu has been applied to multiple problems of radar and communication signal processing [19].

The Wald test is another very useful detector in addition to the Rao test. It is useful in radar target detection tasks, including but not limited to the detection of point targets, extended targets, and multiple-input/multiple-output radar targets in homogeneous, partially homogeneous, and heterogeneous environments [21,22]. In general, it is an equivalent large-data-record test that has the same asymptotically optimal detection performance as the GLRT and the Rao test. For finite-data records, however, it is not guaranteed to have the same performance as the GLRT [23,24,25,26]. In some cases, compared to the GLRT, the Wald test might be more robust when a mismatch exists and may have a lower computational complexity [22]. An example of its application can be found in adaptive detection for frequency diverse array multiple-input/multiple-output radar [27].

This paper is organized as follows: Section 2 formulates the problem of testing the bandedness of a covariance matrix; Section 3.1 derives the complex parameter Rao test detector for testing the bandedness of the Cholesky-factor matrix; and in Section 3.2, we derive the general complex Wald test for the complex-valued parameter hypothesis testing problem. In Section 3.2.3, the complex Wald test for the bandedness testing problem is derived. Examples and computer simulations for evaluating the Rao and Wald detector’s performance are given in Section 4. In addition, the equivalence between complex Wald and Rao tests for the ubiquitous complex Gaussian linear models is proved and analyzed in Section 4.2. Finally, conclusions are drawn in Section 5.

2. Problem Formulation

Assume that we have N IID observed data vectors,

X = {[x_{0}^{T} x_{1}^{T} \dots x_{N - 1}^{T}]}^{T}

, where T denotes transpose and each

x_{n}

is an

L \times 1

complex-valued data vector conforming to a zero-mean multivariate complex Gaussian distribution

x_{n} \sim CN (0, C)

for

n = 0, 1, \dots, N - 1

, and the

x_{n}

s are mutually independent. In addition, we assume

N \leq L

. The

L \times L

covariance matrix

C

is a Hermitian matrix, so its inverse can be decomposed via the Cholesky decomposition as

C^{- 1} = D^{H} D,

(2)

where

D

is a lower triangular

L \times L

matrix. And it has a testing model as follows.

D = D_{B} + \sum_{k = 1}^{M} b_{k} Φ_{k},

(3)

where

D_{B}

is a known banded lower triangular matrix, with a bandwidth of m,

b_{k}

’s are unknown complex-valued parameters, and

Φ_{k}

s are known basis matrices.

Specifically,

\begin{matrix} b_{1} = {[D]}_{m + 2, 1}, & Φ_{1} = e_{m + 2} e_{1}^{T} \\ b_{2} = {[D]}_{m + 3, 2}, & Φ_{2} = e_{m + 3} e_{2}^{T} \\ ⋮ & ⋮ \\ b_{N - m - 1} = {[D]}_{N, N - m - 1}, & Φ_{N - m - 1} = e_{N} e_{N - m - 1}^{T} \\ b_{N - m} = {[D]}_{m + 3, 1}, & Φ_{N - m} = e_{m + 3} e_{1}^{T} \\ b_{N - m + 1} = {[D]}_{m + 4, 2}, & Φ_{N - m + 1} = e_{m + 4} e_{2}^{T} \\ ⋮ & ⋮ \\ b_{M} = {[D]}_{N, 1}, & Φ_{M} = e_{N} e_{1}^{T} \end{matrix}

(4)

where

M = \frac{(N - m - 1) (N - m)}{2}

and

e_{k}

is an

L \times 1

vector with its

k^{t h}

element being one and all other elements being zeros. The objective is to test whether the lower triangular Cholesky factor matrix

D

is equal to the banded lower triangular matrix

D_{B}

. Let

b = {[b_{1} b_{2} \dots b_{M}]}^{T}

, then the detection problem is equivalent to choosing between the following two hypotheses:

\begin{matrix} H_{0} : b & = 0; \\ H_{1} : b & \neq 0; \end{matrix}

(5)

3. Methods

In this section, we derive the complex Rao test and the complex Wald test for the hypothesis testing problem stated above.

3.1. The Complex Rao Test for Testing the Bandedness

The Rao test attains asymptotic (as

N \to \infty

) performance as the GLRT, yet it circumvents the necessity of MLEs under the alternative hypothesis

H_{1}

. As a result, so its computation cost can be lower than that of the GLRT, offering a desirable property in high-dimensional signal processing, including real-time STAP. Subsequently, we proceed by applying the complex Rao test theorem introduced in [19] to derive the Rao test statistic. Let

b^{*} = {[b_{1}^{*} b_{2}^{*} \dots b_{M}^{*}]}^{T}

, where ∗ denotes conjugate, and

\underset{̲}{b} = {[b^{T} b^{H}]}^{T}

, which is a

2 M \times 1

complex-valued parameter vector. The complex parameter Rao test detector can be formed according to [19]

T_{R} (X) = {\frac{\partial ln p (X; \underset{̲}{b})}{\partial {\underset{̲}{b}}^{*}}|}_{b = 0}^{H} {I^{- 1} (\underset{̲}{b})|}_{b = 0} {\frac{\partial ln p (X; \underset{̲}{b})}{\partial {\underset{̲}{b}}^{*}}|}_{b = 0}

(6)

where,

\frac{\partial ln p (X; \underset{̲}{b})}{\partial \underset{̲}{b}} = {[{\frac{\partial ln p (X; \underset{̲}{b})}{\partial b}}^{T} {\frac{\partial ln p (X; b)}{\partial b^{*}}}^{T}]}^{T},

(7)

\frac{\partial ln p (X; \underset{̲}{b})}{\partial b} = {[\frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{1}} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{2}} \dots \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{M}}]}^{T},

(8)

\frac{\partial ln p (X; \underset{̲}{b})}{\partial b^{*}} = {[\frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{1}^{*}} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{2}^{*}} \dots \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{M}^{*}}]}^{T},

(9)

are based on Wirtinger derivatives. We can find each element,

\frac{\partial ln p (X; b)}{\partial b_{k}}

, as follows. First.

\begin{matrix} p (X; \underset{̲}{b}) & = \prod_{n = 0}^{N - 1} p (x_{n}; \underset{̲}{b}) \\ = \prod_{n = 0}^{N - 1} \frac{1}{π^{L} det (C)} exp (- x_{n}^{H} C^{- 1} x_{n}) \\ = \frac{1}{π^{N L} \prod_{n = 0}^{N - 1} det (C)} exp (- \sum_{n = 0}^{N - 1} x_{n}^{H} C^{- 1} x_{n}) \\ = \frac{1}{π^{N L}} exp (- \sum_{n = 0}^{N - 1} x_{n}^{H} D^{H} D x_{n}) \prod_{n = 0}^{N - 1} det (D^{H} D) . \end{matrix}

(10)

Then,

\begin{matrix} ln p (X; \underset{̲}{b}) = ln (\frac{1}{π^{N L}}) - \sum_{n = 0}^{N - 1} x_{n}^{H} D^{H} D x_{n} + N ln det (D^{H} D), \end{matrix}

(11)

and

\begin{matrix} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}} & = N \frac{\partial ln det (D^{H} D)}{\partial b_{k}} - \sum_{n = 0}^{N - 1} \frac{\partial x_{n}^{H} D^{H} D x_{n}}{\partial b_{k}} \\ = N \frac{\partial ln det (D^{H} D)}{\partial b_{k}} - \sum_{n = 0}^{N - 1} \frac{\partial tr (D x_{n} x_{n}^{H} D^{H})}{\partial b_{k}}, \end{matrix}

(12)

for

k = 1, 2, \dots, M

, where

\begin{matrix} \frac{\partial ln det (D^{H} D)}{\partial b_{k}} & = tr (D^{- 1} Φ_{k}), \end{matrix}

(13)

and

\begin{matrix} \frac{\partial tr (D x_{n} x_{n}^{H} D^{H})}{\partial b_{k}} = tr (x_{n} x_{n}^{H} D^{H} Φ_{k}) . \end{matrix}

(14)

Thus,

\begin{matrix} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}} & = N tr (D^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D^{H} Φ_{k}), \end{matrix}

(15)

Under

H_{0}

, where

b = 0

,

\begin{matrix} {\frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}}|}_{\underset{̲}{b} = 0} = N tr (D_{B}^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D_{B}^{H} Φ_{k}) \end{matrix}

(16)

Also, we have

\begin{matrix} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}^{*}} & = N tr (D^{- H} Φ_{k}^{H}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D Φ_{k}^{H}), \end{matrix}

(17)

and its value under

H_{0}

\begin{matrix} {\frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}^{*}}|}_{b = 0} = N tr (D_{B}^{- H} Φ_{k}^{H}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D_{B} Φ_{k}^{H}) \end{matrix}

(18)

We next compute

I (\underset{̲}{b})

.

\begin{matrix} I (\underset{̲}{b}) & = E (\frac{\partial ln p (X; \underset{̲}{b})}{\partial {\underset{̲}{b}}^{*}} \frac{\partial ln p {(X; \underset{̲}{b})}^{H}}{\partial {\underset{̲}{b}}^{*}}) \\ = [\begin{matrix} A & B^{*} \\ B & A^{*} \end{matrix}] \\ = [\begin{matrix} M \times M & M \times M \\ M \times M & M \times M \end{matrix}] \end{matrix}

(19)

where,

\begin{matrix} A & = E (\frac{\partial ln p (X; \underset{̲}{b})}{\partial b^{*}} \frac{\partial ln p {(X; \underset{̲}{b})}^{H}}{\partial b^{*}}) \\ B & = E (\frac{\partial ln p (X; \underset{̲}{b})}{\partial b} \frac{\partial ln p {(X; \underset{̲}{b})}^{T}}{\partial b}) \end{matrix}

(20)

For each element

{[A]}_{k, l}

and

{[B]}_{k, l}

for

1 \leq k, l \leq M

, we can compute as follows,

\begin{matrix} A_{k, l} & = - E (\frac{\partial^{2} ln p (X; \underset{̲}{b})}{\partial b_{k}^{*} \partial b_{l}}) \\ = E (\sum_{n = 0}^{N - 1} tr (Φ_{l} x_{n} x_{n}^{H} Φ_{k}^{H})) \\ = N tr (Φ_{l} D^{- 1} D^{- H} Φ_{k}^{H}) \end{matrix}

(21)

Under

H_{0}

, where

b = 0

, we have

\begin{matrix} {A_{k, l}|}_{b = 0} = N tr (Φ_{l} D_{B}^{- 1} D_{B}^{- H} Φ_{k}^{H}) \end{matrix}

(22)

In a similar fashion, we have

\begin{matrix} B_{k, l} & = - E (\frac{\partial^{2} ln p (X; \underset{̲}{b})}{\partial b_{k} \partial b_{l}}) \\ = N tr (D^{- 1} Φ_{l} D^{- 1} Φ_{k}) \end{matrix}

(23)

and its value under

H_{0}

can be found as follows

\begin{matrix} {B_{k, l}|}_{b = 0} & = N tr (D_{B}^{- 1} Φ_{l} D_{B}^{- 1} Φ_{k}) \end{matrix}

(24)

Substituting Equations (16), (18), (19), (22) and (24) in the complex parameter Rao test Equation (6) yields the complex Rao test statistic. For each unknown parameter, it necessitates two

N \times N

matrix multiplications along with an inversion operation involving the Fisher Information Matrix (FIM). The computational complexity scales approximately in proportion to the number of parameters under scrutiny. Specifically, if there exist M unknown parameters, the computational load increases by a factor of M. Although opportunities for further optimization to mitigate computational demands may exist, exploring such optimizations lies beyond the scope of this paper.

When the detection problem has M unknown parameters, the complex Rao test statistic under the null hypothesis

H_{0}

can be shown to have a chi-squared distribution with M degrees of freedom [28].

3.2. Complex Parameter Wald Test

In this section, we present a novel detection theorem referred to as the Complex Wald test, which is developed in parallel with [20], addressing the general hypothesis testing problem involving complex-valued parameters. Traditionally, this approach required concatenating the real and imaginary parts of the complex-valued data to create an augmented real vector for conducting Wald test computations in the real-valued domain. However, the derived Complex Wald test enables direct computation with complex-valued quantities. Moreover, when the Fisher Information Matrix (FIM) of the unknown parameters exhibits a specific structure, the Complex Wald Test simplifies to a more streamlined form.

Suppose

x = {[u^{T} v^{T}]}^{T}

with

x \in R^{2 N \times 1}

,

u \in R^{N \times 1}

and

v \in R^{N \times 1}

, which is formed from the observed complex-valued vector

\tilde{x} = u + j v

, where

\tilde{x} \in C^{N \times 1}

. Let

ξ = {[α^{T} β^{T}]}^{T}

, formed from the unknown complex-valued parameter vector

\tilde{θ} = α + j β

, where

ξ \in R^{2 p \times 1}

,

α \in R^{p \times 1}

,

β \in R^{p \times 1}

and

\tilde{θ} \in C^{p \times 1}

. We denote the probability density function (PDF) of the data as

p_{x} (x; ξ)

. Then, we have the PDF

p_{\tilde{x}} (\tilde{x}; \tilde{θ}) \equiv p_{x} (x; ξ) \equiv p_{u, v} (u, v; α, β)

. The real Wald test without nuisance parameters (parameters that are unknown yet of no interest) [28] is

T_{W} (x) = {(ξ - ξ_{0})}^{T} I (ξ) (ξ - ξ_{0}) |_{ξ = {\hat{ξ}}_{1}},

(25)

where

{\hat{ξ}}_{1}

is the MLE of

ξ

under

H_{1}

, and

I (ξ)

is Fisher Information Matrix (FIM) of

ξ

and can be partitioned as

I (ξ) = [\begin{matrix} I_{α α} & I_{α β} \\ I_{β α} & I_{β β} \end{matrix}]

(26)

where

I_{α α}, I_{β β}, I_{β α}, I_{α β} \in R^{p \times p}

,

I_{α α}^{T} = I_{α α}

,

I_{β β}^{T} = I_{β β}

, and

I_{α β}^{T} = I_{β α}

. We next derive the complex Wald test for the unknown parameter

\tilde{θ}

by carrying out mathematical operations with respect to complex-valued quantities.

3.2.1. Complex Wald Test for General Cases

Observe that

p_{\tilde{x}} (\tilde{x}; \tilde{θ})

is a real function of

\tilde{θ}

, depending on

\tilde{θ}

and

{\tilde{θ}}^{*}

, where ∗ denotes complex conjugate. Denote

p_{\tilde{x}} (\tilde{x}; \tilde{θ})

as

p_{\tilde{x}} (\tilde{x}; \tilde{Θ}) \equiv p_{\tilde{x}} (\tilde{x}; \tilde{θ}, {\tilde{θ}}^{*})

, where

\tilde{Θ} = {[{\tilde{θ}}^{T} {\tilde{θ}}^{H}]}^{T}

,

\tilde{Θ} \in C^{2 p \times 1}

, and

{(\cdot)}^{H}

represents Hermitian. Also, the complex partial derivatives of a real scalar function

g (\tilde{z}, {\tilde{z}}^{*}) \equiv f (x, y)

are

\frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial \tilde{z}} = \frac{1}{2} (\frac{\partial f}{\partial x} - j \frac{\partial f}{\partial y})

(27)

and

\frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial {\tilde{z}}^{*}} = \frac{1}{2} (\frac{\partial f}{\partial x} + j \frac{\partial f}{\partial y}),

(28)

where

\tilde{z} = x + j y

. And, for a real function g of complex vectors

\tilde{z}

and

{\tilde{z}}^{*}

.

{[\frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial \tilde{z}}]}_{i} = \frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial {\tilde{z}}_{i}}

(29)

{[\frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial {\tilde{z}}^{*}}]}_{i} = \frac{\partial g (\tilde{z}, {\tilde{z}}^{*})}{\partial {\tilde{z}}_{i}^{*}} .

(30)

With these definitions, we have

Theorem 1.

Complex Wald Test

T_{\tilde{W}} (\tilde{x}) = {(\tilde{Θ} - {\tilde{Θ}}_{0})}^{H} \tilde{I} (\tilde{Θ}) (\tilde{Θ} - {\tilde{Θ}}_{0}) |_{\tilde{Θ} = {\hat{\tilde{Θ}}}_{1}} = T_{W} (x),

(31)

where

\begin{matrix} \tilde{I} (\tilde{Θ}) & = E (\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{Θ})}{\partial {\tilde{Θ}}^{*}} {\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{Θ})}{\partial {\tilde{Θ}}^{*}}}^{H}), \end{matrix}

(32)

T_{\tilde{W}} (\tilde{x})

is a complex Wald test statistic, and

T_{W} (x)

is real Wald test statistic. Note that

{\hat{\tilde{Θ}}}_{1}

is the MLE of

\tilde{Θ}

under

H_{1}

, and

{\hat{\tilde{Θ}}}_{1} = [\begin{matrix} {\hat{\tilde{θ}}}_{1} \\ {\hat{\tilde{θ}}}_{1}^{*} \end{matrix}]

, and

{\hat{\tilde{θ}}}_{1}

is the MLE of

\tilde{θ}

under

H_{1}

;

{\tilde{Θ}}_{0} = [\begin{matrix} {\tilde{θ}}_{0} \\ {\tilde{θ}}_{0}^{*} \end{matrix}]

, and

{\tilde{θ}}_{0}

is the true value of the unknown parameter under

H_{0}

. Note that

\tilde{I} (\tilde{Θ})

can be used as a FIM for an unbiased estimation of

\tilde{Θ}

. Also,

\tilde{I} (\tilde{Θ})

is a

2 p \times 2 p

complex hermitian matrix. Hence,

T_{\tilde{W}} (\tilde{x})

is real.

Note that no assumption has been imposed on the form of

I (ξ)

.

Proof.

Let

T = [\begin{matrix} \frac{1}{2} I_{p} & \frac{j}{2} I_{p} \\ \frac{1}{2} I_{p} & - \frac{j}{2} I_{p} \end{matrix}]

(33)

where

I_{p}

is a

p \times p

identity matrix. Then,

\begin{matrix} \tilde{I} (\tilde{Θ}) = T I (ξ) T^{H} \end{matrix}

(34)

Hence, we have

\begin{matrix} T_{\tilde{W}} (\tilde{x}) & = {(\tilde{Θ} - {\tilde{Θ}}_{0})}^{H} (T I (ξ) T^{H}) (\tilde{Θ} - {\tilde{Θ}}_{0}) \\ = {([\begin{matrix} I_{p} & j I_{p} \\ I_{p} & - j I_{p} \end{matrix}] (ξ - ξ_{0}))}^{H} (T I (ξ) T^{H}) ([\begin{matrix} I_{p} & j I_{p} \\ I_{p} & - j I_{p} \end{matrix}] (ξ - ξ_{0})) \\ = {(2 T (ξ - ξ_{0}))}^{H} (T I (ξ) T^{H}) (2 T (ξ - ξ_{0})) \\ = {(ξ - ξ_{0})}^{T} 2 T^{H} T I (ξ) 2 T^{H} T (ξ - ξ_{0}) \\ = {(ξ - ξ_{0})}^{T} I (ξ) (ξ - ξ_{0}) \\ = T_{W} (x) \end{matrix}

(35)

□

3.2.2. Complex Wald Test for Special Fisher Information Matrix

Subsequently, we examine a commonly encountered special form of the FIM in practical applications.

Theorem 2.

If the real FIM given by (26) has the special form

\begin{matrix} I (ξ) & = 2 [\begin{matrix} E & - F \\ F & E \end{matrix}], \end{matrix}

(36)

then

T_{W} (x) = T_{\tilde{W}} (\tilde{x}) = 2 {(\tilde{θ} - {\tilde{θ}}_{0})}^{H} \tilde{I} (\tilde{θ}) (\tilde{θ} - {\tilde{θ}}_{0}) |_{\tilde{θ} = {\hat{\tilde{θ}}}_{1}},

(37)

where

\begin{matrix} \tilde{I} (\tilde{θ}) & = E (\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} {\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}}}^{H}) . \end{matrix}

(38)

Note that

\tilde{I} (\tilde{θ})

is hermitian and hence the expression is a real number.

Proof.

When the real FIM attains the special form, we have

\begin{matrix} \tilde{I} (\tilde{Θ}) = [\begin{matrix} \tilde{I} (\tilde{θ}) & 0 \\ 0 & {\tilde{I}}^{*} (\tilde{θ}) \end{matrix}] . \end{matrix}

(39)

And with (31), we have

\begin{matrix} T_{\tilde{W}} (\tilde{x}) & = {[\begin{matrix} \tilde{θ} - {\tilde{θ}}_{0} \\ {\tilde{θ}}^{*} - {\tilde{θ}}_{0}^{*} \end{matrix}]}^{H} [\begin{matrix} \tilde{I} (\tilde{θ}) & 0 \\ 0 & \tilde{I} (\tilde{θ}) \end{matrix}] [\begin{matrix} \tilde{θ} - {\tilde{θ}}_{0} \\ {\tilde{θ}}^{*} - {\tilde{θ}}_{0}^{*} \end{matrix}] \\ = 2 R e {{(\tilde{θ} - {\tilde{θ}}_{0})}^{H} \tilde{I} (\tilde{θ}) (\tilde{θ} - {\tilde{θ}}_{0})}, \end{matrix}

(40)

Also, note that

{(\tilde{θ} - {\tilde{θ}}_{0})}^{H} \tilde{I} (\tilde{θ}) (\tilde{θ} - {\tilde{θ}}_{0})

is real. Therefore,

\begin{matrix} T_{\tilde{W}} (\tilde{x}) = 2 {(\tilde{θ} - {\tilde{θ}}_{0})}^{H} \tilde{I} (\tilde{θ}) (\tilde{θ} - {\tilde{θ}}_{0}) = T_{W} (x) \end{matrix}

(41)

□

3.2.3. The Complex Wald Test for Testing Bandedness

This section delves into deriving the complex Wald test statistic for the aforementioned problem of testing bandedness. Recall that

\begin{matrix} ln p (X; \underset{̲}{b}) = ln (\frac{1}{π^{N L}}) - \sum_{n = 0}^{N - 1} x_{n}^{H} D^{H} D x_{n} + N ln det (D^{H} D), \end{matrix}

(42)

and

\begin{matrix} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}} & = N \frac{\partial ln det (D^{H} D)}{\partial b_{k}} - \sum_{n = 0}^{N - 1} \frac{\partial x_{n}^{H} D^{H} D x_{n}}{\partial b_{k}} \\ = N \frac{\partial ln det (D^{H} D)}{\partial b_{k}} - \sum_{n = 0}^{N - 1} \frac{\partial tr (D x_{n} x_{n}^{H} D^{H})}{\partial b_{k}}, \end{matrix}

(43)

for

k = 1, 2, \dots, M

, where

\begin{matrix} \frac{\partial ln det (D^{H} D)}{\partial b_{k}} & = tr (D^{- 1} Φ_{k}), \end{matrix}

(44)

and

\begin{matrix} \frac{\partial tr (D x_{n} x_{n}^{H} D^{H})}{\partial b_{k}} = tr (x_{n} x_{n}^{H} D^{H} Φ_{k}) . \end{matrix}

(45)

We aim to determine the Maximum Likelihood Estimates (MLEs) of

b_{k}

. Considering that

b_{k}

are relatively small, we are interested in testing whether they are equal to zero, and we approximate

D^{- 1}

as

D_{B}^{- 1}

.

\begin{matrix} \frac{\partial ln p (X; \underset{̲}{b})}{\partial b_{k}} & = N tr (D^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D^{H} Φ_{k}) \\ = N tr (D_{b}^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} {(D_{b} + \sum_{i = 1}^{M} b_{i} Φ_{i})}^{H} Φ_{k}) \\ = N tr (D_{b}^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D_{b}^{H} Φ_{k}) - b_{k}^{*} \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} Φ_{k}^{H} Φ_{k}) \end{matrix}

(46)

therefore

\begin{matrix} \hat{b_{k}^{*}} & = \frac{N tr (D_{b}^{- 1} Φ_{k}) - \sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} D_{b}^{H} Φ_{k})}{\sum_{n = 0}^{N - 1} tr (x_{n} x_{n}^{H} Φ_{k}^{H} Φ_{k})} \end{matrix}

(47)

Up to this point, we have obtained the MLEs of unknown parameters under the alternative hypothesis. Computation of these MLEs constitutes extra computational cost in the Wald test compared to the Rao test. To put it simply, for each additional unknown parameter, the Wald test requires roughly three times the workload of

N \times N

matrix multiplications in contrast to the Rao test.

Upon substituting the MLEs into both the complex FIM equation and the complex Wald test equation in (31), we derive the Wald test statistic. It is noteworthy that given the presence of the unknown parameter within the covariance matrix, the conditions required for the application of the specialized complex Wald test theorem, as delineated in [19], are not satisfied in this case due to the absence of the requisite special form in the FIM.

4. Simulations, Results and Discussion

4.1. Simulations and Result Discussion on Complex Rao and Wald Tests for Bandedness Testing

Consider an illustrative example, where we have the

N = 4

observed data set

X = {[x_{0}^{T} x_{1}^{T} x_{2}^{T} x_{3}^{T}]}^{T}

, each

x_{n}

’s is a

4 \times 1

complex-valued IID Gaussian vector,

x_{n} \sim CN (0, C)

. Also,

C^{- 1} = D^{H} D

, and

D = D_{B} + b_{1} Φ_{1}

with

Φ_{1} = e_{4} e_{1}^{T}

and

D_{B} = [\begin{matrix} 0.53 & 0 & 0 & 0 \\ - 0.26 + 0.25 j & 0.53 & 0 & 0 \\ - 0.12 + 0.1 j & - 0.33 + 0.28 j & 0.51 & 0 \\ 0 & - 0.17 - 0.10 j & 0.2 - 0.27 j & 0.5 \end{matrix}]

(48)

We are testing whether the Cholesky factor matrix

D

is banded and equal to the known

D_{B}

. It is equivalent to testing between

b_{1} = 0

versus

b_{1} \neq 0

. The complex Rao test for this example can be shown to be (49)

\begin{matrix} T_{R} (X) & = & \frac{Re {{(4 tr (D_{B}^{- 1} Φ_{1}) - \sum_{n = 0}^{3} tr (x_{n} x_{n}^{H} D_{B}^{H} Φ_{1}))}^{2} {tr}^{*} (D_{B}^{- 1} Φ_{1} D_{B}^{- H} Φ_{1}^{H})}}{2 [| tr (Φ_{l} D_{B}^{- 1} D_{B}^{- H} Φ_{k}^{H}) |^{2} - {| tr (D_{B}^{- 1} Φ_{l} D_{B}^{- 1} Φ_{k}) |}^{2}]} \\ - \frac{Re {{(4 tr (D_{B}^{- 1} Φ_{1}) - \sum_{n = 0}^{3} tr (x_{n} x_{n}^{H} D_{B}^{H} Φ_{1}))}^{2} tr (D_{B}^{- 1} Φ_{1} D_{B}^{- 1} Φ_{1})}}{2 [| tr (Φ_{l} D_{B}^{- 1} D_{B}^{- H} Φ_{k}^{H}) |^{2} - {| t r (D_{B}^{- 1} Φ_{l} D_{B}^{- 1} Φ_{k}) |}^{2}]} \end{matrix}

(49)

The complex Wald test for this illustrative example can be obtained by using Equations (31) and (47).

To evaluate the performance of the complex Rao and Wald tests for this example, we set

b_{1} = 0.5 + 0.5 j

under the alternative hypothesis

H_{1}

.

All simulations were conducted using Matlab by MathWorks, Inc.,(Portola Valley, CA, USA). The Receiver Operating Characteristic (ROC) curves, depicting the relationship between the Probability of Detection (

P_{d}

) and the Probability of False Alarm (

P_{f a}

), were generated to characterize the performance of the detectors. The construction of the ROC curve involved the following steps: for each simulation setup, a substantial number of Monte Carlo simulation trials were executed. The test statistics were computed under both scenarios,

H_{1}

and

H_{0}

. Specifically, simulations were run 50,000 times, resulting in a vector of size 50,000 for the test statistic under

H_{1}

, denoted as

T_{1}

, and similarly, another vector of size 50,000 for the test statistic under

H_{0}

, denoted as

T_{0}

. Varying a threshold

γ

, any element in

T_{0}

greater than

γ

signified a false alarm trial, while any element in

T_{1}

greater than

γ

represented a correct detection trial. By systematically varying the threshold

γ

, a series of

(P_{f a}, P_{d})

pairs were obtained, constituting the ROC curve. This curve, encapsulating the trade-off between

P_{d}

and

P_{f a}

, was generated as a vector of such pairs by varying the threshold

γ

. Figure 1 presents a high-level flowchart outlining the steps involved in the Monte Carlo simulations for generating the ROC curves for both the Wald and Rao tests.

The ROC curves of the derived complex Rao and Wald test is given in Figure 2.

The results demonstrate that both the complex Rao and Wald tests exhibit commendable performance when confronted with limited data records. Notably, the complex Wald test outperforms the complex Rao test, aligning with expectations given that the latter generally demonstrates suboptimal performance. This performance disparity is expected considering that the complex Rao test, while less computationally intensive, as it does not necessitate the MLEs of the unknown parameters under

H_{1}

, inherently delivers slightly inferior performance.

A second illustrative example is presented, where

b_{1} = 0.3 + 0.3 j

, representing a more challenging scenario compared to the earlier instance. Figure 3 showcases the performance of both proposed detectors in this setting. It is evident that in comparison to the prior example, the detectors’ performance declines due to the smaller magnitude of

b_{1}

. Notably, the complex Wald test exhibits a slight edge over the complex Rao test in this more challenging scenario.

Next, we increase the number of IID samples from

N = 4

to

N = 10

and set the

b_{1} = 0.3 + 0.3 j

. The comparative performance of the two detectors is depicted in Figure 4. A comparative analysis with Figure 3 reveals a conspicuous enhancement in the performance of both detectors, attributable to the increased availability of data samples. Notably, it is discernible that in this scenario, the performance of the two detectors converges significantly, indicating that the Complex Rao and Wald tests exhibit asymptotic equivalence as the dataset size grows.

Both the Complex Rao test statistic and Complex Wald test under the null hypothesis

H_{0}

are chi-squared distributed with one degree of freedom,

T_{R} (X) \sim χ_{1}^{2}

[28]. The performance of the Rao test and Wald test can be found asymptotically or as

N \to \infty

. Estimated probability density function (PDFs), shown as bar plots, for both the Rao test and the Wald test, and the theoretical asymptotic

χ_{1}^{2}

PDF are shown in Figure 5. Notably, even with a limited data record of

N = 4

, the estimated PDFs remarkably align with the theoretical distribution.

The detectors’ performance is also dependent on the base matrix

D_{B}

. Next, we double the magnitude of each element of

D_{B}

; that is,

D_{B} = [\begin{matrix} 1.06 & 0 & 0 & 0 \\ - 0.52 + 0.5 j & 1.06 & 0 & 0 \\ - 0.24 + 0.2 j & - 0.66 + 0.56 j & 1.02 & 0 \\ 0 & - 0.34 - 0.20 j & 0.4 - 0.54 j & 1.0 \end{matrix}]

(50)

and keep the rest of the setup unchanged with

N = 4

and

b_{1} = 0.5 + 0.5 j

. The two detectors’ performances can be found in Figure 6.

In comparison with Figure 2, it is evident that both detectors’ performances have degraded due to the larger base matrix. Next, we show the detectors’ performance change when the base matrix become smaller. We half each element of

D_{B}

; that is,

D_{B} = [\begin{matrix} 0.265 & 0 & 0 & 0 \\ - 0.13 + 0.125 j & 0.265 & 0 & 0 \\ - 0.06 + 0.05 j & - 0.165 + 0.14 j & 0.255 & 0 \\ 0 & - 0.085 - 0.05 j & 0.1 - 0.135 j & 0.25 \end{matrix}]

(51)

and keep the rest of the setup unchanged with

N = 4

and

b_{1} = 0.5 + 0.5 j

. The two detectors’ performances can be found in Figure 7. Compared with Figure 2, clearly both detectors’ performances have improved due to the smaller base matrix.

4.2. Equivalence among Complex Wald, Rao Test and GlRT for Linear Model

When the unknown parameter is present in the covariance matrix, the structure of the FIM does not attain the special form that allows one to use the Reduced Complex Wald Test and the Reduced Complex Rao Test [19]. One such case is the problem of testing the bandedness of the covariance matrix discussed above. As a counter example, in this section, we show the equivalence between the Complex Rao Test and the Complex Wald Test for an important case of general practical interest—the complex Gaussian linear model. A large number of signal processing, detection and estimation problems like radar signal processing and communications can be represented by a linear model, and hence it is of practical importance to discuss the topic.

4.2.1. Complex Classical Linear Model Testing Problem

First, we apply complex Wald test to the complex linear model problem. Assume the data are modeled according to [23]:

\tilde{x} = \tilde{H} \tilde{θ} + \tilde{w},

where

\tilde{H} \in C^{N \times p}

is a known matrix with

N > p

and full rank,

\tilde{θ}

is an unknown complex

p \times 1

parameter vector, and

\tilde{w}

is a complex

N \times 1

random vector with PDF

\tilde{w} \sim CN (0, \tilde{C})

, with

\tilde{C} \in C^{N \times N}

. The testing problem is equivalent to deciding between two hypotheses:

\begin{matrix} H_{0} : \tilde{θ} = 0 \\ H_{1} : \tilde{θ} \neq 0 \end{matrix}

Then, based on the properties of the complex Gaussian PDF

\tilde{x} \sim CN (\tilde{μ}, \tilde{C})

so that

\tilde{μ} = \tilde{H} \tilde{θ}

and

\tilde{C} (\tilde{θ}) = \tilde{C}

(not dependent on

\tilde{θ}

or

{\tilde{θ}}^{*}

). The PDF is

p (\tilde{x}; \tilde{θ}) = \frac{1}{π^{N} det (\tilde{C})} exp [- {(\tilde{x} - \tilde{H} \tilde{θ})}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} \tilde{θ})] .

We have

\begin{matrix} \frac{\partial ln p (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} & = - \frac{\partial {(\tilde{x} - \tilde{H} \tilde{θ})}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} \tilde{θ})}{\partial {\tilde{θ}}^{*}} \\ = {\tilde{H}}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} \tilde{θ}) \end{matrix}

Therefore, the MLE of

\tilde{θ}

under

H_{1}

is

{\hat{\tilde{θ}}}_{1} = {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x}

. Also we have, as shown in [23],

\tilde{I} (\tilde{θ}) = {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H},

and the real FIM has the special form

I^{- 1} (ξ) = \frac{1}{2} [\begin{matrix} Re {{({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1}} & - Im {{({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1}} \\ Im {{({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1}} & Re {{({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1}} \end{matrix}] .

4.2.2. Generalized Likelihood Ratio Test (GlRT)

The GLRT decides

H_{1}

if

L_{G} (\tilde{x}) = \frac{p (\tilde{x}; {\hat{\tilde{θ}}}_{1})}{p (\tilde{x}; {\tilde{θ}}_{0} = 0)} > γ

(52)

We have

\begin{matrix} 2 ln L_{G} (\tilde{x}) & = & 2 ln \frac{p (\tilde{x}; {\hat{\tilde{θ}}}_{1})}{p (\tilde{x}; {\tilde{θ}}_{0} = 0)} \\ = & 2 ({\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{x} - {(\tilde{x} - \tilde{H} {\hat{\tilde{θ}}}_{1})}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} {\hat{\tilde{θ}}}_{1})) \\ = & 2 ({\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{x} - {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{x} \\ + {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \\ + {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \\ - {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x}) \\ = & 2 {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \end{matrix}

(53)

4.2.3. Complex Rao Test

The complex Rao test is [19]

\begin{matrix} T_{\tilde{R}} (\tilde{x}) & = & 2 {\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}}}^{H} {\tilde{I}}^{- 1} (\tilde{θ}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}} \\ = & 2 {[{\tilde{H}}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} \tilde{θ})]}^{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} [{\tilde{H}}^{H} {\tilde{C}}^{- 1} (\tilde{x} - \tilde{H} \tilde{θ})] |_{\tilde{θ} = 0} \\ = & 2 {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \end{matrix}

(54)

4.2.4. Complex Wald Test

The real FIM of this problem has a special form, so Theorem 2 applies. The complex Wald test is

\begin{matrix} T_{\tilde{W}} (\tilde{x}) & = & 2 {(\tilde{θ} - {\tilde{θ}}_{0})}^{H} \tilde{I} (\tilde{θ}) (\tilde{θ} - {\tilde{θ}}_{0}) |_{\tilde{θ} = {\hat{\tilde{θ}}}_{1}} \\ = & 2 {[{({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x}]}^{H} ({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H}) {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \\ = & 2 {\tilde{x}}^{H} {\tilde{C}}^{- 1} \tilde{H} {({\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H})}^{- 1} {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{x} \end{matrix}

(55)

Compared with the results obtained by using GLRT and the complex Rao Test for the same problem, all three detectors are equivalent in this case. Next, we show the root cause of this equivalence.

4.2.5. The Root Cause of the Equivalence

First,

\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} = \tilde{I} (\tilde{θ}) (\hat{\tilde{θ}} - \tilde{θ})

(56)

where

\tilde{θ}

is the true value. Because the complex FIM is not dependent on the true value of the unknown parameters,

\tilde{I} (\tilde{θ}) = \tilde{I} ({\hat{\tilde{θ}}}_{1})

, we have

\frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} = \tilde{I} ({\hat{\tilde{θ}}}_{1}) (\hat{\tilde{θ}} - \tilde{θ})

(57)

So, by integrating with respect to

{\tilde{θ}}^{*}

, it produces

ln p_{\tilde{x}} (\tilde{x}; \tilde{θ}) = - {(\hat{\tilde{θ}} - \tilde{θ})}^{H} \tilde{I} (\hat{\tilde{θ}}) (\hat{\tilde{θ}} - \tilde{θ}) + c (\hat{\tilde{θ}})

(58)

since the constant of integration must be

c (\hat{\tilde{θ}}) = ln p_{\tilde{x}} (\tilde{x}; \hat{\tilde{θ}})

.

The GLRT becomes

L_{G} (\tilde{x}) = \frac{p (\tilde{x}; {\hat{\tilde{θ}}}_{1})}{p (\tilde{x}; {\tilde{θ}}_{0})} = \frac{p (\tilde{x}; {\hat{\tilde{θ}}}_{1})}{p (\tilde{x}; {\hat{\tilde{θ}}}_{1}) exp [- {({\hat{\tilde{θ}}}_{1} - {\tilde{θ}}_{0})}^{H} \tilde{I} ({\hat{\tilde{θ}}}_{1}) ({\hat{\tilde{θ}}}_{1} - {\tilde{θ}}_{0})]}

(59)

Thus,

2 ln L_{G} (\tilde{x}) = 2 {({\hat{\tilde{θ}}}_{1} - {\tilde{θ}}_{0})}^{H} \tilde{I} ({\hat{\tilde{θ}}}_{1}) ({\hat{\tilde{θ}}}_{1} - {\tilde{θ}}_{0})

(60)

This explains why the GLRT and the complex Wald test coincide for the complex linear model. Furthermore, to establish the equivalence between the complex Rao test and the Generalized Likelihood Ratio Test (GLRT), we use Equation (56)

(\hat{\tilde{θ}} - \tilde{θ}) = {\tilde{I}}^{- 1} (\tilde{θ}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}}

(61)

Specially,

({\hat{\tilde{θ}}}_{1} - {\tilde{θ}}_{0}) = {\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}}

(62)

By substituting (62) to (60), we have

\begin{matrix} \begin{matrix} 2 ln L_{G} (\tilde{x}) & = 2 {({\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}})}^{H} \tilde{I} ({\hat{\tilde{θ}}}_{1}) ({\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}}) \\ = 2 \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}}^{H} {\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \tilde{I} ({\hat{\tilde{θ}}}_{1}) {\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}} \\ = 2 \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}}^{H} {\tilde{I}}^{- 1} ({\tilde{θ}}_{0}) \frac{\partial ln p_{\tilde{x}} (\tilde{x}; \tilde{θ})}{\partial {\tilde{θ}}^{*}} |_{\tilde{θ} = {\tilde{θ}}_{0}} \end{matrix} \end{matrix}

(63)

where we have used the property of

\tilde{I} (\tilde{θ}) = \tilde{I} ({\tilde{θ}}_{0}) = \tilde{I} ({\hat{\tilde{θ}}}_{1})

. This completes the proof that the complex Rao test and the GLRT are equivalent for the complex linear model. Consequently, it also proves the equivalence among the aforementioned three detectors for the complex linear model. In summary, the equivalence stems from the fact that the complex FIM of the linear model

\tilde{I} (\tilde{θ}) = {\tilde{H}}^{H} {\tilde{C}}^{- 1} \tilde{H}

does not depend on the true value of

\tilde{θ}

. In particular, we have

\tilde{I} (\tilde{θ}) = \tilde{I} ({\tilde{θ}}_{0}) = \tilde{I} ({\hat{\tilde{θ}}}_{1})

, and hence the complex Rao test and Wald test are both equivalent to the GLRT without extra constraints.

5. Conclusions

The utilization of banding techniques has gained prominence in the estimation of covariance matrices, particularly in scenarios with limited sample sizes within high-dimensional signal processing. Before adopting such techniques, assessing the matrix’s ‘bandability’ becomes crucial. To this end, we have derived both the complex Rao test and the complex Wald test, specifically tailored for evaluating the bandedness of the Cholesky factor matrix within the inverse of the covariance matrix. The computational cost of the Rao test is comparatively lower, while implementing the complex Wald test demands obtaining maximum likelihood estimates under the alternative hypothesis. Consequently, the latter proofs more challenging to derive and incur higher computational expenses. We present examples and simulations to assess the performance of these proposed detectors. In our evaluations, the Wald test exhibits slightly superior performance in cases with smaller ‘signal’ magnitudes and significantly outperforms the complex parameter Rao test as the tested parameter grows larger. However, as the sample size increases, the performance gap between the two detectors diminishes. Notably, both detectors demonstrate asymptotic optimality with a substantial volume of available data.These derived detectors can serve as a preparatory step before implementing banding techniques for covariance matrix estimation. Furthermore, they extend applicability beyond bandedness assessment by enabling tests for zero elements within a matrix, achieved by appropriately modifying the basis matrix

Φ_{k}

. Moreover, our investigation reveals the equivalence between the complex Rao test, Wald test, and GLRT within the general complex Gaussian linear model, shedding light on the underlying mechanisms of this equivalence. In our forthcoming research, we aim to delve deeper into the computational costs associated with these two detectors.

Funding

This research received no external funding. This work was partially presented at the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The author declares no conflicts of interest.

References

Melvin, W.L. A STAP Overview. IEEE AES Syst. Mag.—Spec. Tutor. Issue 2004, 19, 19–35. [Google Scholar] [CrossRef]
Anderson, T. An Introduction to Multivariate Statistical Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 2003. [Google Scholar]
Melvin, W.L.; Showman, G.A. An Approach to Knowledge-Aided Covariance Estimation. IEEE Trans. Aerosp. Electron. Syst. 2006, 42, 1021–1042. [Google Scholar] [CrossRef]
Bickel, P.J.; Levina, E. Covariance regularization by thresholding. Ann. Statist. 2008, 36, 2577–2604. [Google Scholar] [CrossRef] [PubMed]
Rothman, A.; Levina, L.; Zhu, J. Generalized thresholding of large covariance matrices. J. Am. Statist. Assoc. 2009, 104, 177–186. [Google Scholar] [CrossRef]
Bickel, P.J.; Levina, E. Regularized estimation of large covariance matrices. Ann. Statist. 2008, 36, 199–227. [Google Scholar] [CrossRef]
Cai, T.T.; Zhang, C.; Zhou, H. Optimal rates of convergence for covariance matrix estimation. Ann. Statist. 2010, 38, 2118–2144. [Google Scholar] [CrossRef]
Rothman, A.; Levina, L.; Zhu, J. A new approach to Cholesky-based covariance regularization in high dimensions. Biometrika 2010, 97, 539–550. [Google Scholar] [CrossRef]
An, B.; Guo, J.; Liu, Y. Hypothesis testing for band size detection of high-dimensional banded precision matrices. Biometrika 2014, 101, 477–483. [Google Scholar] [CrossRef]
Wu, W.B.; Pourahmadi, M. Nonparametric estimation of large covariance matrices of longitudinal data. Biometrika 2003, 90, 831–844. [Google Scholar] [CrossRef]
Qian, F.; Zhang, W.; Chen, Y. Adaptive banding covariance estimation for high-dimensional multivariate longitudinal data. Can. J. Stat. 2021, 49, 906–938. [Google Scholar] [CrossRef]
Wiesel, A.; Bibi, O.; Globerson, A. Time varying autoregressive moving average models for covariance estimation. IEEE Trans. Signal Process. 2013, 61, 2791–2801. [Google Scholar] [CrossRef]
Zhu, Z.; Kay, S. On Bayesian Exponentially Embedded Family for model order selection. IEEE Trans. Signal Process. 2018, 66, 933–943. [Google Scholar] [CrossRef]
Qiu, Y.-M.; Chen, S.X. Test for bandedness of high dimensional covariance matrices with bandwidth estimation. Ann. Stat. 2002, 40, 1285–1314. [Google Scholar] [CrossRef]
Peng, L.; Chen, S.X.; Zhou, W. More powerful tests for sparse high-dimensional covariances matrices. J. Multivar. Anal. 2016, 149, 124–143. [Google Scholar] [CrossRef]
Zhu, Z.; Kay, S.; Raghavan, R.S. Information-theoretic optimal radar waveform design. IEEE Signal Process. Lett. 2017, 24, 274–278. [Google Scholar] [CrossRef]
Schreier, P.J.; Scharf, L.L. Statistical Signal Processing of Complex-Valued Data: The Theory of Improper and Noncircular Signals; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Adali, T.; Schreier, P.J.; Scharf, L.L. Complex-valued signal processing: The proper way to deal with impropriety. IEEE Trans. Signal Process. 2011, 59, 5101–5125. [Google Scholar] [CrossRef]
Kay, S.; Zhu, Z. The complex parameter Rao test. IEEE Trans. Signal Process. 2016, 94, 6580–6588. [Google Scholar] [CrossRef]
Sun, M.; Liu, W.; Liu, J.; Hao, C. Complex parameter Rao, Wald, Gradient, and Durbin Tests for multichannel signal detection. IEEE Trans. Signal Process. 2021, 70, 117–131. [Google Scholar] [CrossRef]
Liu, W.; Xie, W.; Wang, Y. Rao and Wald tests for distributed targets detection with unknown signal steering. IEEE Signal Process. Lett. 2013, 20, 1086–1089. [Google Scholar]
De Maio, A.; Iommelli, S. Coincidence of the Rao Test, Wald Test, and GLRT in partially homogeneous environment. IEEE Signal Process. Lett. 2008, 15, 385–388. [Google Scholar] [CrossRef]
Kay, S. Fundamentals of Statistical Signal Processing: Estimation Theory; Prentice-Hall: Englewood Cliffs, NJ, USA, 1993. [Google Scholar]
Liu, W.; Wang, Y.; Xie, W. Fisher information matrix, Rao test, and Wald test for complex-valued signals and their applications. Signal Process. 2014, 94, 1–5. [Google Scholar] [CrossRef]
Zhu, Z.; Kay, S.; Cogun, F.; Raghavan, R.S. On detection of nonstationarity in radar signal processing. In Proceedings of the 2016 IEEE Radar Conference (RadarConf), Philadelphia, PA, USA, 2–6 May 2016; pp. 1–4. [Google Scholar]
Zhu, Z.; Kay, S. The Rao test for testing handedness of complex-valued covariance matrix. In Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 20–25 March 2016; pp. 3960–3963. [Google Scholar]
Huang, B.; Basit, A.; Wang, W.; Zhang, S. Adaptive detection with Bayesian framework for FDA-MIMO radar. IEEE Geosci. Remote Sens. Lett. 2021, 19, 3509505. [Google Scholar] [CrossRef]
Kay, S. Fundamentals of Statistical Signal Processing: Detection Theory; Prentice-Hall: Englewood Cliffs, NJ, USA, 1998. [Google Scholar]

Figure 1. Illustration of Monte Carlo Simulation Steps for each simulation setup.

Figure 2. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.5 + 0.5 j

.

Figure 2. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.5 + 0.5 j

.

Figure 3. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.3 + 0.3 j

.

Figure 3. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.3 + 0.3 j

.

Figure 4. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.3 + 0.3 j

and

N = 10

.

Figure 4. ROC curve of the complex Rao and Wald test detector with

b_{1} = 0.3 + 0.3 j

and

N = 10

.

Figure 5. Estimated and theoretical PDFs of the test statistic for N = 4.

Figure 6. Complex Rao and Wald test performance with “larger”

D_{B}

.

Figure 6. Complex Rao and Wald test performance with “larger”

D_{B}

.

Figure 7. Complex Rao and Wald test performance with “smaller”

D_{B}

.

Figure 7. Complex Rao and Wald test performance with “smaller”

D_{B}

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, Z. Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix. Signals 2024, 5, 1-17. https://doi.org/10.3390/signals5010001

AMA Style

Zhu Z. Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix. Signals. 2024; 5(1):1-17. https://doi.org/10.3390/signals5010001

Chicago/Turabian Style

Zhu, Zhenghan. 2024. "Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix" Signals 5, no. 1: 1-17. https://doi.org/10.3390/signals5010001

Article Menu

Complex Parameter Rao and Wald Tests for Assessing the Bandedness of a Complex-Valued Covariance Matrix

Abstract

1. Introduction

2. Problem Formulation

3. Methods

3.1. The Complex Rao Test for Testing the Bandedness

3.2. Complex Parameter Wald Test

3.2.1. Complex Wald Test for General Cases

3.2.2. Complex Wald Test for Special Fisher Information Matrix

3.2.3. The Complex Wald Test for Testing Bandedness

4. Simulations, Results and Discussion

4.1. Simulations and Result Discussion on Complex Rao and Wald Tests for Bandedness Testing

4.2. Equivalence among Complex Wald, Rao Test and GlRT for Linear Model

4.2.1. Complex Classical Linear Model Testing Problem

4.2.2. Generalized Likelihood Ratio Test (GlRT)

4.2.3. Complex Rao Test

4.2.4. Complex Wald Test

4.2.5. The Root Cause of the Equivalence

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI