Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation

He, Fei; Harms, Andrew; Yang, Lamar Yaoqing

doi:10.3390/signals3040040

Open AccessArticle

Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation

by

Fei He

,

Andrew Harms

and

Lamar Yaoqing Yang

^*

Department of Electrical and Computer Engineering, University of Nebraska-Lincoln, Omaha, NE 68182, USA

^*

Author to whom correspondence should be addressed.

Signals 2022, 3(4), 664-681; https://doi.org/10.3390/signals3040040

Submission received: 7 August 2022 / Revised: 10 September 2022 / Accepted: 19 September 2022 / Published: 24 September 2022

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a novel method of tensor rank regularization with bias compensation for channel estimation in a hybrid millimeter wave MIMO-OFDM system. Channel estimation is challenging due to the unknown number of multipath components that determines the channel rank. In general, finding the intrinsic rank of a tensor is a non-deterministic polynomial-time (NP) hard problem. However, by leveraging the sparse characteristics of millimeter wave channels, we propose a modified CANDECOMP/PARAFAC (CP) decomposition-based method that jointly estimates the tensor rank and channel component matrices. Our approach differs from most existing works that assume the number of channel paths is known and the proposed method is able to estimate channel parameters accurately without the prior knowledge of number of multipaths. The objective of this work is to estimate the tensor rank by a novel sparsity-promoting prior that is incorporated into a standard alternating least squares (ALS) function. We introduce a weighting parameter to control the impact of the previous estimate and the tensor rank estimation bias compensation in the regularized ALS. The channel information is then extracted from the estimated component matrices. Simulation results show that the proposed scheme outperforms the baseline l1 strategy in terms of accuracy and robustness. It also shows that this method significantly improves rank estimation success at the expense of slightly more iterations.

Keywords:

tensor rank; sparsity; CP tensor decomposition; channel estimation; millimeter wave; hybrid-MIMO

1. Introduction

Millimeter wave (mmWave) transmission technology is an attractive candidate technology for next-generation wireless communication systems [1]. The main benefit of utilizing mmWave carrier frequencies is the larger spectral bandwidth available for higher data rates facilitating the ever-increasing communication traffic [2]. To compensate for the significant path-loss at such high frequencies, large antenna arrays are common at both the base station (BS) and the mobile station (MS) to provide sufficient beamforming gain [3]. Compared to conventional multiple-input multiple-output (MIMO) systems, the large antenna arrays in mmWave systems make it unlikely to have a dedicated radio frequency (RF) chain for each antenna without costly hardware [3]. A hybrid analog/digital structure uses fewer RF chains to reduce the number of power-consuming devices, such as analog-to-digital converters (ADCs) or digital-to-analog converters (DACs) [3]. To achieve considerable beamforming gain in the precoding stage, accurate channel state information (CSI) is required through channel estimation.

Channel estimation in mmWave hybrid systems is challenging due to the hybrid precoding structure and the large number of antennas. The mmWave channel can be modeled in parametric form based on the path angles, i.e., directions of arrival and departure (DoA/DoD), and the corresponding complex path gains. As a result, mmWave channel estimation is a problem of estimating the path angles and gains rather than estimating the conventional MIMO channel matrix [4,5]. An attractive approach to directly estimate the mmWave channel parameters is to use compressive sensing (CS) based methods [6,7,8,9,10,11,12]. Due to the limited propagation range of mmWave channels, the parametric form of the channel description generally has few active paths. Therefore, these CS-based methods leverage the sparsity of the channel to cast estimation as a sparse recovery problem. The virtual angular representation [6] or angle grids [7] are used to describe the path angles. There is a gain coefficient for each angle grid point that represents a pair of DoA/DoD. Because of the spatial sparsity of the mmWave channel, most of the gain coefficients are negligible. The sparse recovery approach intends to recover the significant gain coefficients providing estimates of the path directions and reducing training overhead. A compressive architecture for estimating and tracking sparse spatial channels in mmWave picocellular networks with large arrays was proposed and investigated in [8]. In [9], it considered multi-user massive MIMO systems and deployed CS-based techniques to reduce the training and feedback overhead in the channel state information at the transmitter estimation. In [10,11], the authors showed that significant reductions in training overhead can be achieved via CS-based methods in mmWave MIMO communication systems. [12] developed a system that uses CS-based estimation on the uplink to configure precoders and combiners for the downlink in a hybrid mmWave MIMO system.

In addition to CS-based methods, tensor-based methods for mmWave MIMO communication have been studied in [13,14,15,16,17,18], which formulate a third order tensor with multiple dimensions containing information about the channel parameters. A two-stage tensor decomposition-based method for mmWave time-varying channel estimation was proposed in [13]. The CP decomposition aided method is used to estimate the DOAs/DODs in the first stage. The estimated angles are then used to estimate the path gains and doppler shifts. A spatial channel covariance estimation method, proposed in [14], is based on higher-order tensor decomposition for the hybrid single-input multiple-output (SIMO) architecture over uplink time-varying frequency-selective channels. The Vandermonde structure of the component matrix of a third-order low-rank tensor model was proposed in [15], which developed a non-iterative structured CP decomposition-aided channel estimation algorithm and showed the advantages of avoiding random initialization and iterations. The authors in [15] also proposed a tensor rank estimation method by applying singular value decomposition (SVD) to a tensor unfolding matrix, which is suitable in a noiseless channel. A CP decomposition-based method for mmWave channel parameter estimation with frequency selectivity in a hybrid orthogonal frequency division multiplexing (OFDM)-MIMO scenario was proposed in [16], which showed advantages over a compressed sensing (CS)-based method. A novel tensor-decomposition channel estimation method for IRS-assisted mmWave OFDM systems was proposed in [17]. By exploiting the inherent sparse structure of the cascade channel, the authors formulate the received signal as a low-rank third-order tensor. Only a very small amount of training overhead is used to obtain a reliable estimate of the cascade channel because of the low rank structure. A sparse Bayes tensor and DOA tracking inspired channel estimation for V2X millimeter wave massive MIMO system was proposed in [18]. This paper dealt with accurate channel estimation in highly mobile scenarios. The sparse Bayes tensor is used to calculate the angle offset. Then a direction of arrival (DOA) tracking method is developed to acquire the angle value at the next moment by exploiting the sparse Bayes tensor.

These methods rely on knowing, or estimating, the rank of the tensors. Tensor rank is analogous to matrix rank, but the properties of matrix and tensor ranks are quite different. It is not clear how to define a nuclear norm surrogate for tensors because singular values defined by the Tucker decomposition are not generally related to the rank of a tensor. Usually, finding the tensor rank is NP-hard [19], and even very simple tensors stubbornly resist rank determination, such as for the

3^{2}

×

3^{2}

×

3^{2}

tensor whose rank is only known to be between 19 and 23 [20]. While the tensor rank is difficult to compute precisely, it is not known whether the rank can be approximated. A regularization term of the CP decomposition factors capturing the tensor’s rank was proposed in [21]. The proposed regularization term relies on an alternative characterization of the nuclear norm based on a low-rank factorization of its matrix argument. A method to simultaneously perform CP decomposition and tensor rank approximation was proposed in [22], which divided a third-order low rank CP tensor into 4 blocks, i.e., 3 component matrices and 1 weighting vector. In this paper, we propose a novel CP decomposition based method to jointly estimate the tensor rank and channel component matrices for the hybrid MIMO architecture. We construct the received signal into a third-order tensor and exploit the low rank characteristics of mmWave channel.

The main contributions of the paper are summarized as follows:

First, we propose a novel CP decomposition-based method to jointly estimate both the tensor rank and component matrices of the received signal tensor. We formulate the received signals into a third-order tensor in the form of the CP structure of a hybrid MIMO-OFDM system. Unlike the conventional tensor signal analysis assumed a-priori knowledge of the rank, we focus on determining the tensor rank which is often unknown in practice. We also develop a novel sparsity-promoting prior to determine tensor rank, and then estimate channel information from low rank component matrix representations.
Second, we analyze the effectiveness of the weighting parameter in our proposed new prior. Based on the numerical results, we show that the proposed scheme outperforms the conventional strategies in [16,21] in terms of estimation accuracy and robustness.
Third, we discuss the trade-off between convergence and rank estimation accuracy for our proposed rank regularization method. Through numerical experiments, we find that our method significantly improves rank estimation success at the expense of slightly more iterations.

The rest of the paper is organized as follows. Section 2 briefly introduces some tensor basics. The system model is discussed in Section 3. In Section 4, we propose a novel CP decomposition-based joint method for mmWave channel estimation. The computational complexity of the proposed method and the conventional CP decomposition method without known rank are analyzed in Section 5. Simulation results are shown in Section 6. Conclusions are provided in Section 7.

Notations: We use the following notations throughout this paper: y is a scalar, y is a vector, Y is a matrix, and

Y

is a tensor.

Y^{T}

,

Y^{*}

,

Y^{H}

, and

Y^{†}

are transpose, conjugate, conjugate transpose, and Moore-Penrose pseudoinverse, respectively.

{[Y]}_{i, :}

and

{[Y]}_{:, j}

are the i-th row and the j-th column of the matrix Y. A⊗B, A⊛B, and A⊙B denote the Kronecker product, the Hadamard product, and the column-wise Khatri-Rao product. a∘b denotes the outer product, which is also known as the tensor product. Let Rank (

Y

) and

k_{Y}

denote the rank and Kruskal-rank of a matrix

Y

, respectively. Let Re(Y) and Im(Y) denote the real part and the imaginary part of Y. d(Y) denotes a vector of diagonal entries of Y, and D(y) denotes a diagonal matrix constructed from y.

2. Tensor Preliminaries

In this section, we briefly introduce the foundations of tensor algebra that will be used in this paper. Readers who are interested in more details about tensors can refer to [23,24].

2.1. Tensor Basics

Simply speaking, a tensor is a multi-dimensional array. The order of a tensor is defined as the number of dimensions of the tensor. Vectors and matrices can be viewed as special cases of tensors with one and two orders, respectively. Let an N-th order tensor be denoted as

Y \in ℂ^{I_{1} \times I_{2} \times I_{3} \dots I_{N}}

with (

i_{1}, i_{2}, \dots, i_{N}

)-th entry

y_{i_{1} \dots i_{N}}

. Fibers are the higher-order analogue of matrix rows and columns. The mode-n fibers of

Y

are defined as

I_{n}

—dimensional vectors obtained by fixing all but one index

i_{n}

. Slices are two-dimensional sections of a tensor, defined by fixing all but two indices. The mode-n unfolding (i.e., matricization) is an operation that transforms a tensor into a matrix. The mode-n unfolding of a tensor

Y

, denoted as

Y_{(n)}

, arranges the mode-n fibers as the columns of the resulting unfolding matrix.

2.2. CP Tensor Decomposition

A tensor

Y \in ℂ^{I_{1} \times I_{2} \times I_{3} \dots I_{N}}

is called a rank-one tensor if it can be written as the outer product of vectors

Y = y^{(1)} \circ y^{(2)} \circ \dots \circ y^{(N)},

(1)

where

y^{(n)} \in ℂ^{I_{n} \times 1}

,

\forall n

.

The canonical polyadic decomposition (CPD), which is also known as CANDECOMP/PARAFAC decomposition, factorizes a tensor into a sum of component rank-one tensors. The CPD of

Y \in ℂ^{I_{1} \times I_{2} \times I_{3} \dots I_{N}}

is defined as

Y = \sum_{r = 1}^{R} y_{r}^{(1)} \circ y_{r}^{(2)} \circ \dots \circ y_{r}^{(N)},

(2)

where

y_{r}^{(n)} \in ℂ^{I_{n} \times 1}

for

r = 1, 2, \dots R

. The minimum achievable value

R

is referred to as the rank of the tensor.

3. Signal Model

We consider a massive MIMO-OFDM system consisting of a base station and multiple mobile stations operating at mmWave frequencies. To facilitate the hardware implementation, hybrid analog/digital architectures are employed by both the BS and the MS, shown in Figure 1. We assume that the BS is equipped with

N_{b s}

antennas and

M_{b s}

RF chains (

N_{b s} \geq M_{b s}

), and each MS is equipped with

N_{m s}

antennas and

M_{m s}

RF chains (

N_{m s} \geq M_{m s}

). The OFDM system occupies

K_{0}

subcarriers for data transmission, among which

K

subcarriers are selected for training purposes. In the downlink scenario, we only need to consider a single user system because the channel estimation is conducted by each mobile user [25]. For the

k

-th subcarrier, the BS utilizes an analog RF precoder

F_{A} \in ℂ^{N_{b s} \times M_{b s}}

. Similarly, the MS employs an analog combiner

W_{A} \in ℂ^{N_{m s} \times M_{m s}}

for analog signal processing. The digital processors follow

F_{A}

and

W_{A}

in BS, and MS performs channel estimation and digital precoding [26]. The elements of

F_{A}

and

W_{A}

are assumed to be random phases with a unit amplitude.

Both the BS and MS employ uniform linear arrays (ULA) with antenna element spacing of half the signal wavelength. The extended Saleh-Valenzuela model [15,16], is adopted to characterize the sparse mmWave channel with

R

multipath between the MS and the BS. The frequency-domain channel matrix

H_{k}

associated with the k-th subcarrier can be obtained as

H_{k} = \sum_{r = 1}^{R} α_{r} e^{- j 2 π \frac{k f_{s}}{K_{0}} τ_{r}} a_{M S} (φ_{r}) a_{B S}^{T} (θ_{r}),

(3)

where

f_{s}

is the system sampling rate,

α_{r}

is the complex gain of the rth path in the frequency domain,

τ_{r}

is the time delay, the DoD—θ_r and the DoA—φ_r.

a_{M S} (φ_{r}) = {[1, e^{j π \sin (φ_{r})}, \dots, e^{j π (N_{m s} - 1) \sin (φ_{r})}]}^{T}

and

a_{B S} (θ_{r}) = {[1, e^{j π \sin (θ_{r})}, \dots, e^{j π (N_{b s} - 1) \sin (θ_{r})}]}^{T}

are the steering vectors of the MS and BS, respectively.

The MS employs

M_{m s}

combining vectors with all the RF chains to simultaneously combine the n-th pilot

s_{n}

precoded by a beamforming vector

f_{n} \in ℂ^{N_{b s}}

. The BS switches beamforming vectors at

M_{b s}

successive time slots, while the MS maintains the combining matrix

W_{A}

. Then, adopting the same strategy of [15], the received signals for the

k

-th subcarrier can be represented as

Y_{k} = W_{A}^{T} H_{k} F_{A} S + N_{k},

(4)

where

Y_{k} \in ℂ^{M_{m s} \times M_{b s}}

,

S ≜ D ({[s_{1}, \dots s_{M_{b s}}]}^{T}) \in ℂ^{M_{b s} \times M_{b s}}

is the transmitted symbol matrix for

M_{b s}

successive time slots, and

N_{k} \in ℂ^{M_{m s} \times M_{b s}}

is the equivalent noise after the combining process. We assume that the pilot matrix is

S = I_{M_{b s}}

for simplicity [15].

Then substituting the frequency domain channel matrix

H_{k}

(3) into

Y_{k}

(4), we obtain

Y_{k} = \sum_{r = 1}^{R} α_{r} e^{- j 2 π \frac{k f_{s}}{K_{0}} τ_{r}} W_{A}^{T} a_{M S} (φ_{r}) a_{B S}^{T} (θ_{r}) F_{A} + N_{k}

= \sum_{r = 1}^{R} α_{r} e^{- j 2 π \frac{k f_{s}}{K_{0}} τ_{r}} {\tilde{a}}_{M S} (φ_{r}) {\tilde{a}}_{B S}^{T} + N_{k},

(5)

where

{\tilde{a}}_{M S} (φ_{r}) ≜ W_{A}^{T} a_{M S} (φ_{r}) \in ℂ^{M_{m s}}

and

{\tilde{a}}_{B S} (θ_{r}) ≜ F_{A}^{T} a_{M S} (θ_{r}) \in ℂ^{M_{b s}}

are the equivalent array response vectors.

By combining

Y_{k}

of

K

selected subcarriers, the received signals can be expressed by a third-order tensor

Y \in ℂ^{M_{m s} \times M_{b s} \times K}

. We notice that each frontal slice

Y_{k}

of the tensor

Y

is a weighted sum of a common set of rank-one outer products, which makes the tensor

Y

admit the definition of the CP model as

Y = \sum_{r = 1}^{R} {\tilde{a}}_{MS} (φ_{r}) \circ {\tilde{a}}_{BS} (θ_{r}) \circ (α_{r} g (τ_{r})) + N,

(6)

where we assume subcarriers

\{1, 2, \dots, K\}

are assigned for the training process for simplicity and

g (τ_{r}) ≜ {[e^{- j 2 π \frac{1 f_{s}}{K_{0}} τ_{r}}, \dots, e^{- j 2 π \frac{K f_{s}}{K_{0}} τ_{r}}]}^{T} \in ℂ^{K}

is the phase vector caused by delay.

N \in ℂ^{M_{m s} \times M_{b s} \times K}

is the equivalent noise tensor.

Due to the sparse nature of mmWave channels, the number of paths

R

is typically small relative to the dimension of the tensor

Y

[15,16]. Therefore,

Y

inherently has a low-rank nature, which ensures that the CP decomposition is unique up to scaling and permutation ambiguities. An estimation of channel parameters {

φ_{r}

,

θ_{r}

,

τ_{r}

,

α_{r}

} can be obtained by processing the CP decomposition results from

Y

. Define

A ≜ [{\tilde{a}}_{M S} (φ_{1}), \dots {\tilde{a}}_{M S} (φ_{R})] \in ℂ^{M_{m s} \times R},

(7)

B ≜ [{\tilde{a}}_{B S} (θ_{1}), \dots {\tilde{a}}_{B S} (θ_{R})] \in ℂ^{M_{b s} \times R},

(8)

C ≜ [α_{1} g (τ_{1}), \dots α_{R} g (τ_{R})] \in ℂ^{K \times R},

(9)

These three component matrices

A

,

B

, and

C

are associated with a noiseless version of

Y

.

4. The Proposed Algorithm

4.1. Joint CP Tensor Decomposition

If the number of active paths, or rank

R

, is known, the CP decomposition of

Y

can be accomplished by a standard alternating least squares (ALS) method [16].

It is likely that the rank

R

is unknown in practice. In this case, the CP decomposition of

Y

can be accomplished by minimizing the cost function

\min_{\hat{A}, \hat{B}, \hat{C}} {‖ Y - \sum_{r = 1}^{\hat{R}} {\hat{a}}_{r} \circ {\hat{b}}_{r} \circ {\hat{c}}_{r} ‖}_{F}^{2}

(10)

where

\hat{R}

is an estimation of the true rank

R

. Instead of estimating tensor rank individually, we seek a more comprehensive CP decomposition-based technique to estimate component matrices and rank jointly. A method of combining CP decomposition and sparsity-promoting prior is presented as an l1 regularization LS method [21] by solving

\min_{\hat{A}, \hat{B}, \hat{C}} {‖ Y - X ‖}_{F}^{2} + λ ({‖ \hat{A} ‖}_{F}^{2} + {‖ \hat{B} ‖}_{F}^{2} + {‖ \hat{C} ‖}_{F}^{2})

(11)

s . t . X = \sum_{r = 1}^{\hat{R}} {\hat{a}}_{r} \circ {\hat{b}}_{r} \circ {\hat{c}}_{r}

where

λ

is a tuning parameter for the rank regularization term. The regularizer relies on an alternative characterization of the nuclear norm based on a low-rank factorization of its matrix argument.

The optimization problem (11) can also be solved by ALS. However, because the rank

R

is unknown, an overestimated CP rank,

\tilde{R}

, must be chosen initially. The objective function (11) can be divided into three sub-problems:

A^{(n + 1)} = \underset{\hat{A}}{argmin} {‖ Y_{(1)} - \hat{A} {(C^{(n)} ⊙ B^{(n)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{A} ‖}_{F}^{2}

(12)

B^{(n + 1)} = \underset{\hat{B}}{argmin} {‖ Y_{(2)} - \hat{B} {(C^{(n)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{B} ‖}_{F}^{2}

(13)

C^{(n + 1)} = \underset{\hat{C}}{argmin} {‖ Y_{(3)} - \hat{C} {(B^{(n + 1)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{C} ‖}_{F}^{2}

(14)

Sub-problems (12)–(14) do not contain sparsity-promoting prior. However, the entire ALS procedure promotes sparse results because the restraint in (11) is incorporated in the three sub-problems.

\hat{R}

can be estimated by removing all negligible rank-one tensor components after convergence. The least squares solutions are derived as follows:

A^{(n + 1)^{T}} = {(({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I))}^{- 1} ({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T})

(15)

B^{(n + 1)^{T}} = {(({(C^{(n)} ⊙ A^{(n + 1)})}^{H} (C^{(n)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(C^{(n)} ⊙ A^{(n + 1)})}^{H} Y_{(2)}^{T})

(16)

C^{(n + 1)^{T}} = {(({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} (B^{(n + 1)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} Y_{(3)}^{T})

(17)

According to [27], (15)–(17) are also special cases of the iteratively regularized Gauss-Newton method with a linear bounded operator and initial guess set as 0. Equations (15)–(17) have corresponding iterative sequences. Consider (15) as an example and transform to its iterative sequence as follows:

A^{(n + 1)^{T}} = A^{(n)^{T}} - {({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I)}^{- 1} (({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I) A^{(n)^{T}} - {(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T})

(18)

From the above iterative sequence, one can easily get back to (15) when (18) approaches the convergence.

Equations (15)–(17) are the well-known ridge regression solutions. However, there is a bias issue according to [28,29]. For example, the bias can be demonstrated by transforming the least squares solution (15) as follows,

{(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) A^{(n + 1)^{T}} + λ I A^{(n + 1)^{T}} = {(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T}

(19)

When the solution converges,

A^{(n)} \to \hat{A}

,

B^{(n)} \to \hat{B}

,

C^{(n)} \to \hat{C}

,

Y_{(1)}^{T} \approx (\hat{C} ⊙ \hat{B}) {\hat{A}}^{T}

(with noise in

Y

). Substitute

\hat{B}

,

\hat{C}

,

Y_{(1)}^{T}

into the right-hand side of (19), we can get

{(\hat{C} ⊙ \hat{B})}^{H} (\hat{C} ⊙ \hat{B}) {\hat{A}}^{T}

. Substitute

\hat{A}

,

\hat{B}

,

\hat{C}

into the left-hand side of (19), we can get

{(\hat{C} ⊙ \hat{B})}^{H} (\hat{C} ⊙ \hat{B}) {\hat{A}}^{T} + λ I {\hat{A}}^{T}

. If we compare two sides, it leads to (19) invalid due to common term

{(\hat{C} ⊙ \hat{B})}^{H} (\hat{C} ⊙ \hat{B}) {\hat{A}}^{T}

exists in both sides. Therefore,

λ I A^{(n + 1)^{T}}

in (19) is called the bias term. It is desirable to compensate for the effect of the bias term by introducing an additional term as the algorithm converges to the desired solution. To compensate the regularization term

λ I A^{(n + 1)^{T}}

we can add to the right-hand side the similar term

λ I A^{(n)^{T}}

which gradually removes the bias as

A^{(n + 1)} \to \hat{A}

, where

\hat{A}

is the estimate of

A

{(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) A^{(n + 1)^{T}} + λ I A^{(n + 1)^{T}} = {(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ I {(A^{(n)})}^{T}

(20)

After some manipulation we get

A^{(n + 1)^{T}} = {(({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I))}^{- 1} ({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ I {(A^{(n)})}^{T})

(21)

Likewise, we can compensate the estimation bias of

{\hat{B}}^{T}

and

{\hat{C}}^{T}

by using

B^{(n + 1)^{T}} = {(({(C^{(n)} ⊙ A^{(n + 1)})}^{H} (C^{(n)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(C^{(n)} ⊙ A^{(n + 1)})}^{H} Y_{(2)}^{T} + λ I {(B^{(n)})}^{T})

(22)

C^{(n + 1)^{T}} = {(({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} (B^{(n + 1)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} Y_{(3)}^{T} + λ I {(C^{(n)})}^{T})

(23)

According to [30,31], (21)–(23) are least square solutions of the regularized alternating least squares (RALS) objective function

\min_{\hat{A}, \hat{B}, \hat{C}} {‖ Y - \sum_{r = 1}^{R} {\hat{a}}_{r} \circ {\hat{b}}_{r} \circ {\hat{c}}_{r} ‖}_{F}^{2} + λ ({‖ \hat{A} - A^{(n)} ‖}_{F}^{2} + {‖ \hat{B} - B^{(n)} ‖}_{F}^{2} + {‖ \hat{C} - C^{(n)} ‖}_{F}^{2})

(24)

The second term in (24) penalizes the difference between the current values and previous iterates. However, the limit points of (24) are the critical points of a standard ALS-the first term in (24) without the regularization term. Equation (24) can be divided into three sub-problems:

A^{(n + 1)} = \underset{\hat{A}}{argmin} {‖ Y_{(1)} - \hat{A} {(C^{(n)} ⊙ B^{(n)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{A} - A^{(n)} ‖}_{F}^{2}

(25)

B^{(n + 1)} = \underset{\hat{B}}{argmin} {‖ Y_{(2)} - \hat{B} {(C^{(n)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{B} - B^{(n)} ‖}_{F}^{2}

(26)

C^{(n + 1)} = \underset{\hat{C}}{argmin} {‖ Y_{(3)} - \hat{C} {(B^{(n + 1)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{C} - C^{(n)} ‖}_{F}^{2}

(27)

The iteration approaches (25)–(27) work well for CP tensor decomposition with faster convergence if the rank

R

is known. If the rank

R

is unknown, objective function (24) is incapable of obtaining tensor rank because its solutions are not critical points of the regularized version. According to [32,33], the sub-problems (25)–(27) are variational characterizations of the Levenberg–Marquardt method with linear bounded operator. We need to seek a compromise between estimation bias and sparsity promotion.

4.2. Proposed CP Tensor Decomposition with Weighted Bias

We propose a novel regularized alternating least squares (RALS) with sparsity promoting and weighted bias compensation as follows,

\min_{\hat{A}, \hat{B}, \hat{C}} {‖ Y - \sum_{r = 1}^{\hat{R}} {\hat{a}}_{r} \circ {\hat{b}}_{r} \circ {\hat{c}}_{r} ‖}_{F}^{2} + λ ({‖ \hat{A} - γ A^{(n)} ‖}_{F}^{2} + {‖ \hat{B} - γ B^{(n)} ‖}_{F}^{2} + {‖ \hat{C} - γ C^{(n)} ‖}_{F}^{2})

(28)

where

γ

is the weighting parameter. The objective function (28) can be divided into three sub-problems:

A^{(n + 1)} = \underset{\hat{A}}{argmin} {‖ Y_{(1)} - \hat{A} {(C^{(n)} ⊙ B^{(n)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{A} - γ A^{(n)} ‖}_{F}^{2}

(29)

B^{(n + 1)} = \underset{\hat{B}}{argmin} {‖ Y_{(2)} - \hat{B} {(C^{(n)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{B} - γ B^{(n)} ‖}_{F}^{2}

(30)

C^{(n + 1)} = \underset{\hat{C}}{argmin} {‖ Y_{(3)} - \hat{C} {(B^{(n + 1)} ⊙ A^{(n + 1)})}^{T} ‖}_{F}^{2} + λ {‖ \hat{C} - γ C^{(n)} ‖}_{F}^{2}

(31)

The least square solutions of (29)–(31) as follows

A^{(n + 1)^{T}} = {({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I)}^{- 1} ({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ γ I {(A^{(n)})}^{T})

(32)

B^{(n + 1)^{T}} = {(({(C^{(n)} ⊙ A^{(n + 1)})}^{H} (C^{(n)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(C^{(n)} ⊙ A^{(n + 1)})}^{H} Y_{(2)}^{T} + λ γ I {(B^{(n)})}^{T})

(33)

C^{(n + 1)^{T}} = {(({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} (B^{(n + 1)} ⊙ A^{(n + 1)}) + λ I))}^{- 1} ({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} Y_{(3)}^{T} + λ γ I {(C^{(n)})}^{T})

(34)

To explore the role of the weighting parameter

γ

, we will consider the iteration of matrix A from (32). First, (32) can be transformed to an iterative sequence as follows

A^{(n + 1)^{T}} = A^{(n)^{T}} - {(({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I))}^{- 1} \cdot (({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I (1 - γ)) A^{(n)^{T}} - {(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T})

(35)

When the solution in (35) approaches convergence:

A^{(n + 1)} \to A^{(n)}

, we have

A^{(n + 1)^{T}} = {({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I (1 - γ))}^{- 1} {(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T}

(36)

According to [34], (36) is a modified Levenberg-Marquardt function with variable decay rate when

0 < γ < 1

. We will therefore restrict the range of

γ

to (0, 1). Sub-problems (29)–(31) can be considered modified Levenberg–Marquardt variational characterizations. Compared to

l 1

LS solution (15) which is a Levenberg-Marquardt function, we know that

λ I (1 - γ)

will affect both the direction and step size of each iteration [35]. The scaled identity matrix

λ I (1 - γ)

in the modified Levenberg-Marquardt method in (36) also controls the direction of each iteration to favor low-rank solutions through the modified pseudoinverse calculation

({({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I (1 - γ))}^{- 1} \cdot {(C^{(n)} ⊙ B^{(n)})}^{H}

) of (36). Compared to that of the

l 1

LS solution (15), the added term 1

-

γ

will slightly change the direction of (15) and still favor low rank solutions because we restrict

0 < γ < 1

.

Moreover, compared to the

l 1

LS solution (15) and RALS solution (21), the weighting parameter

γ

in (32) controls the bias compensation. The analyses to

B^{(n + 1)^{T}}

and

C^{(n + 1)^{T}}

are similar.

The estimated component matrices from the novel joint CP tensor decomposition-based method are associated with the weighting parameter. The novel joint CP tensor decomposition-based method is summarized in Algorithm 1.

Algorithm 1. The Proposed Joint CP Tensor Decomposition-Based Estimation Method.

Input

: Observation signal tensor Y \in ℂ^{M_{m s} \times M_{b s} \times K}

, an initial selection

\tilde{R}

for rank(

Y

),
and the regularization parameters

λ

, weighting parameter

γ

, threshold

β

, number of
Iterations iter = 0,
Output: Estimated Rank

\hat{R}

, Estimated Component Matrices

\hat{A}

,

\hat{B}

,

\hat{C}

, iter
1. Derive unfolding matrices:

Y_{(1)}

,

Y_{(2)}

,

Y_{(3)}

2. Generate normally distributed pseudorandom initial matrices

A^{(0)} \in ℂ^{M_{m s} \times \tilde{R}}

,

B^{(0)} \in ℂ^{M_{b s} \times \tilde{R}}

,

C^{(0)} \in ℂ^{K \times \tilde{R}}

3. The initialization of the cost function is

c o s t^{0}

:

‖ Y - ⟦ A^{(0)}, B^{(0)}, C^{(0)} ⟧ ‖_{F}^{2}

4. while

|c o s t^{i t e r + 1} - c o s t^{i t e r}| > ε

do
5. iter = iter + 1
6. Calculate

{\hat{A}}^{T} \leftarrow {({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I)}^{- 1}

\cdot ({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ γ I {(A^{(n)})}^{T})

7. Calculate

{\hat{B}}^{T} \leftarrow {(({(C^{(n)} ⊙ A^{(n + 1)})}^{H} (C^{(n)} ⊙ A^{(n + 1)}) + λ I))}^{- 1}

\cdot ({(C^{(n)} ⊙ A^{(n + 1)})}^{H} Y_{(2)}^{T} + λ γ I {(B^{(n)})}^{T})

8. Calculate

{\hat{C}}^{T} \leftarrow {(({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} (B^{(n + 1)} ⊙ A^{(n + 1)}) + λ I))}^{- 1}

\cdot ({(B^{(n + 1)} ⊙ A^{(n + 1)})}^{H} Y_{(3)}^{T} + λ γ I {(C^{(n)})}^{T})

9. Recalculate cost in Equation (10)
10. End while
11. Calculate column power of

\hat{C} \in ℂ^{K \times \tilde{R}}

12. Set the number of columns whose power >

β

as

\hat{R}

and construct the new

\hat{C}

by
using these columns
13. Based on the index number obtained from 11, we select the columns from

\hat{A}

and

\hat{B}

to construct new

\hat{A}

and new

\hat{B}

14. Return

\hat{A} \in ℂ^{M_{m s} \times \hat{R}}

,

\hat{B} \in ℂ^{M_{b s} \times \hat{R}}

,

\hat{C} \in ℂ^{K \times \hat{R}}

After obtaining the estimated component matrices:

\hat{A}

,

\hat{B}

and

\hat{C}

, we turn to the channel parameters estimation by using the correlation-based scheme adopted by [16].

5. Computational Complexity Analysis

We analyze the computational complexity of the proposed CP decomposition-based joint estimation method and the baseline l1 regularization LS method.

The major computational task of the proposed joint rank and component matrices estimation method involves solving the three least squares problems (32)–(34) at each iteration. Considering the calculation of

A^{(n + 1)^{T}}

, we have (32). The complexity of calculating

A^{(n + 1)^{T}}

consists of three steps:

(1): To calculate ${({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I)}^{- 1}$ , the complexity is $O (M_{b s} K {\tilde{R}}^{2} + {\tilde{R}}^{3})$ ,
(2): To calculate $({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ γ I A^{(n)^{T}})$ , the complexity is $O (M_{m s} M_{b s} K \tilde{R})$ ,
(3): To calculate ${({(C^{(n)} ⊙ B^{(n)})}^{H} (C^{(n)} ⊙ B^{(n)}) + λ I)}^{- 1} ({(C^{(n)} ⊙ B^{(n)})}^{H} Y_{(1)}^{T} + λ γ I A^{(n)^{T}})$ , the complexity is $O (M_{m s} {\tilde{R}}^{2})$ .

Thus, the total number of flops required to compute

A^{(n + 1)^{T}}

is of order

O (M_{m s} M_{b s} K \tilde{R} + M_{b s} K {\tilde{R}}^{2} + {\tilde{R}}^{3} + M_{m s} {\tilde{R}}^{2})

.

The major computational task of l1 regularization LS joint rank and component matrices estimation method involves solving three least squares problems (15)–(17) at each iteration. The complexity of the

A^{(n + 1)^{T}}

calculation in (15) is the same as the computational complexity (32) above because the calculations of matrix sum, matrix transpose, matrix conjugate, and matrix multiplication by a constant can be ignored in the complexity evaluation.

Similarly, we can analyze the complexity of

B^{(n + 1)^{T}}

(33) and

C^{(n + 1)^{T}}

(34) as above.

6. Numerical Experiments

We present simulation results to demonstrate the performance of the proposed method. We consider a scenario where the BS employs a uniform linear array with

N_{b s}

= 64 antennas and RF chains

M_{b s}

= 6, the MS employs a uniform linear array with

N_{m s}

= 32 antennas and RF chains

M_{m s}

= 6. The separation between neighboring antenna elements is assumed to be half the signal wavelength. In our simulations, the mmWave MIMO channel is generated according to the wideband geometric channel model, in which the DoAs and DoDs are randomly distributed in [0, 2π], the delay spread

τ_{r}

for each path is uniformly distributed between 0 and 100 nanoseconds, and the complex gain

α_{r}

is a random variable following a circularly symmetric Gaussian distribution (0, 1/R). We set

f_{c}

= 28 GHz, the total number of subcarriers

K_{0}

= 128, the number of subcarriers selected for training K = 6, and the sampling rate

f_{s}

= 0.32 GHz.

The beamforming matrix

W_{A}

and the combining matrix

F_{A}

are randomly generated with their entries uniformly chosen from a unit circle. Because the sparse scattering nature of mmWave channels, the number of paths R is usually small [15,16]. Thus, we select the number of paths equal to R = 4, 5, and 6 to form a low rank channel scenario in our simulations. We assume the initial rank

\tilde{R}

to be an overestimation of the true rank. To compare the rank estimation performance, the signal-to-noise ratio (SNR) is defined as the ratio of the signal component to the noise component, i.e.,

SNR ≜ \frac{{‖ Y - N ‖}_{F}^{2}}{{‖ N ‖}_{F}^{2}}

Figure 2 shows the estimated ranks versus regularization tuning parameter

λ

in the interval

10^{- 5}

to 1 based on the baseline l1 regularization LS method (11) when R = 4 with the selected SNR values. It is observed that tensor rank can be approximately recovered when

λ

=

10^{- 4}

.

To evaluate the rank estimation performance of the proposed method as a function of weighting parameter

γ

, Figure 3 depicts the performance comparison of rank estimation versus SNR with

λ

=

10^{- 4}

and real rank

R

= 4. It also indicates the proposed method leads to smaller rank estimation error when

γ

is no larger than 0.5 with SNRs between 20 dB and 30 dB than that of the l1 regularization LS method.

The comparison between the proposed method and the l1 regularization LS in terms of robustness is shown in Figure 4. The simulation is based on the true rank R = 4,

λ

=

10^{- 4}

,

γ

= 0.3, SNR = 20 dB. We select the initial rank

\tilde{R}

as some integers between 4 and 32 and calculate the estimated rank with the same stop criterion. The selection of the upper bound of the initial rank is bounded by the characteristics of

C^{(n)} ⊙ B^{(n)}

,

C^{(n)} ⊙ A^{(n + 1)}

and

B^{(n + 1)} ⊙ A^{(n + 1)}

. We make these three Khatri-Rao products as overdetermined matrices to guarantee the ALS-based method has a unique solution. So,

\tilde{R} < M_{b s} K = 36 .

It can be observed from Figure 4 that the proposed method (the top red line) converges to the real rank R = 4 under the different initializations, while l1 regularization LS method only converges at a large initial

\tilde{R}

. Thus, the rank estimation performance of the proposed method is much more robust than that of the l1 regularization LS in terms of initialization.

The performance comparisons between the proposed method and the l1 regularization LS in terms of robustness in success rate is shown in Figure 5. The simulation settings are the same as that of Figure 4. In Figure 5, the success rate is defined as the ratio of the number of the correctly estimated ranks over the total number of trails. This result shows the proposed method can converge to the true rank value under different initial rank selections with a success rate of over 90%.

The performance comparisons between the proposed method and the l1 regularization LS (

γ = 0

) in terms of success rate for different values of

γ

with same initial rank 24 are shown in Figure 6. It shows the proposed method with some values of

γ

can achieve much better rank estimation success rate than that of the baseline l1 regularization method when SNR = 20 dB. For example, the rank estimation success rate is above 98% when

γ

= 0.3.

The plots in Figure 7a–d show the relative error versus the number of iterations converging to the threshold when

γ =

0, 0.3, 0.5, 0.7, respectively. For example, the proposed method takes about 300 iterations to reach the threshold in Figure 7b, compared to the baseline l1 regularization with roughly 200 iterations in Figure 7a. Rank estimation is crucial for channel estimation when the number of multipath is unknown. It is worthy to get a significantly improved rank estimation success rate at the expense of slightly more iterations.

We then turn to the overall channel estimation performance of our proposed scheme, which is measured by the normalized mean squared error: Channel NMSE =

\sum_{k = 1}^{K} {‖ {\hat{H}}_{k} - H_{k} ‖}_{F}^{2} / \sum_{k = 1}^{K} {‖ H_{k} ‖}_{F}^{2}

. Figure 8 depicts the channel NMSE performance versus the system SNR derived by the proposed scheme, which is compared with that of l1 regularization LS method (

γ = 0

). It is observed that some values of

γ

(i.e.,

γ = 0.3

at SNR = 20 dB) give us smaller NMSE or better channel estimation performance.

The robustness of the proposed method for different true rank R in terms of success rate is shown in Table 1. The simulation is based on the true rank R = 4, 5, and 6,

λ

=

10^{- 4}

,

γ

= 0.3, SNR = 20 dB. It can be observed that our proposed method can converge to the true ranks with high probability for different true rank R.

The rank estimation performance versus different

γ

and ranks is shown in Figure 9. In this simulation, we assume the true rank R = 4, 5, and 6,

λ

=

10^{- 4}

, SNR = 20 dB. When

γ = 0

in (28) the proposed algorithm is equivalent to l1 regularization LS (11). It can be observed that the average estimated ranks are very close to the true ranks with weighting parameter

γ

from 0.1 to 0.4. Otherwise, we get large rank estimation errors.

Figure 10 depicts the number of iterations that the proposed method converges for each trail as a function of weighting parameter

γ

with real rank R = 4, 5, and 6, SNR = 20 dB,

λ

=

10^{- 4}

. Our simulation shows fast convergence for the weighting parameter

γ <

0.5. It can be observed that the number of iterations to converge to the real ranks is much smaller than that of

γ >

0.5.

7. Conclusions

In this paper, we presented a novel CP decomposition-based method to jointly estimate both tensor rank and component matrices of the received signal for the mmWave hybrid MIMO-OFDM systems. The proposed method is able to estimate channel parameters accurately without the prior knowledge of number of multipaths. Our approach differs from most existing works that assume the number of channel paths is known, instead, we aimed to determine the tensor rank or the number of multipath in the mmWave channel. We proposed a novel rank regularization method with a weighting parameter that can control the impact of the estimates from the previous iteration and compensate the tensor estimation bias. Compared to the baseline l1 regularization LS method, our proposed method shows that we can significantly increase the rank estimation success by reducing rank estimation bias. Compared to the RALS, our proposed method is capable of promoting sparsity for low rank CP tensor with selected values of the weighting parameter. Numerical experiments show that the proposed method outperforms the conventional l1 strategy in terms of accuracy and robustness in high SNR situation with a marginal cost to computation.

Author Contributions

Conceptualization, F.H., A.H. and L.Y.Y.; formal analysis, F.H., methodology, F.H., A.H. and L.Y.Y.; writing—original draft preparation, F.H.; writing—review and editing, F.H., A.H. and L.Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pi, Z.; Khan, F. An introduction to millimeter-wave mobile broad-band systems. IEEE Commun. Mag. 2011, 49, 101–107. [Google Scholar] [CrossRef]
Rangan, S.; Rappaport, T.S.; Erkip, E. Millimeter-Wave Cellular Wireless Networks: Potentials and Challenges. Proc. IEEE. 2014, 102, 366–385. [Google Scholar] [CrossRef]
Heath, R.W., Jr.; Gonzalez-Prelcic, N.; Rangan, S.; Roh, W.; Sayeed, A.M. An Overview of Signal Processing Techniques for Millimeter Wave MIMO Systems. IEEE J. Sel. Top. Signal Process. 2016, 10, 436–453. [Google Scholar] [CrossRef]
Andrews, J.G.; Bai, T.; Kulkarni, M.N.; Alkhateeb, A.; Gupta, A.K.; Heath, R.W., Jr. Modeling and Analyzing Millimeter Wave Cellular Systems. IEEE Trans. Commun. 2017, 65, 403–430. [Google Scholar] [CrossRef]
Mo, J.; Schniter, P.; Prelcic, N.G.; Heath, R.W. Channel estimation in millimeter wave MIMO systems with one-bit quantization. In Proceedings of the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 2–5 November 2014; pp. 957–961. [Google Scholar]
Bajwa, W.U.; Haupt, J.; Sayeed, A.M.; Nowak, R. Compressed Channel Sensing: A New Approach to Estimating Sparse Multipath Channels. Proc. IEEE 2010, 98, 1058–1076. [Google Scholar] [CrossRef]
Lee, J.; Gil, G.-T.; Lee, Y.H. Exploiting spatial sparsity for estimating channels of hybrid MIMO systems in millimeter wave communications. In Proceedings of the IEEE Global Communications Conference, Austin, TX, USA, 8–12 December 2014; pp. 3326–3331. [Google Scholar]
Marzi, Z.; Ramasamy, D.; Madhow, U. Compressive Channel Estimation and Tracking for Large Arrays in mm-Wave Picocells. IEEE J. Sel. Top. Signal Process. 2016, 10, 514–527. [Google Scholar] [CrossRef]
Rao, X.; Lau, V.K.N. Distributed Compressive CSIT Estimation and Feedback for FDD Multi-User Massive MIMO Systems. IEEE Trans. Signal Process. 2014, 62, 3261–3271. [Google Scholar]
Alkhateeb, A.; El Ayach, O.; Leus, G.; Heath, R.W. Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 831–846. [Google Scholar] [CrossRef]
Alkhateeb, A.; Leus, G.; Heath, R.W. Compressed sensing based multi-user millimeter wave systems: How many measurements are needed? In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’15), South Brisbane, QLD, Australia, 19–24 April 2015; pp. 2909–2913. [Google Scholar]
González-Coma, J.P.; Rodriguez-Fernandez, J.; González-Prelcic, N.; Castedo, L.; Heath, R.W. Channel estimation and hybrid precoding for frequency selective multiuser mmWave MIMO systems. IEEE J. Sel. Top. Signal Proc. 2018, 12, 353–367. [Google Scholar] [CrossRef]
Cheng, L.; Yue, G.; Xiong, X.; Liang, Y.; Li, S. Tensor Decomposition-Aided Time-Varying Channel Estimation for Millimeter Wave MIMO Systems. IEEE Wirel. Commun. Lett. 2019, 8, 1216–1219. [Google Scholar] [CrossRef]
Park, S.; Ali, A.; Gonzalez-Prelcic, N.; Heath, R.W. Spatial Channel Covariance Estimation for Hybrid Architectures Based on Tensor Decompositions. IEEE Trans. Wirel. Commun. 2019, 19, 1084–1097. [Google Scholar] [CrossRef] [Green Version]
Lin, Y.; Jin, S.; Matthaiou, M.; You, X. Structured Tensor Decomposition-Based Channel Estimation for Wideband Millimeter Wave MIMO. In Proceedings of the Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 3–6 November 2019; pp. 421–426. [Google Scholar]
Zhou, Z.; Fang, J.; Yang, L.; Li, H.; Chen, Z.; Blum, R.S. Low-Rank Tensor Decomposition-Aided Channel Estimation for Millimeter Wave MIMO-OFDM Systems. IEEE J. Sel. Areas Commun. 2017, 35, 1524–1538. [Google Scholar] [CrossRef]
Zheng, X.; Wang, P.; Fang, J.; Li, H. Compressed Channel Estimation for IRS-Assisted Millimeter Wave OFDM Systems: A Low-Rank Tensor Decomposition-Based Approach. IEEE Wirel. Commun. Lett. 2022, 11, 1258–1262. Available online: https://arxiv.org/pdf/2203.16164.pdf (accessed on 30 March 2022). [CrossRef]
Luo, K.; Zhou, X.; Wang, B.; Huang, J.; Liu, H. Sparse Bayes Tensor and DOA Tracking Inspired Channel Estimation for V2X Millimeter Wave Massive MIMO System. Sensors 2021, 21, 4021. [Google Scholar] [CrossRef]
Håstad, J. Tensor rank is NP-complete. J. Algorithms 1990, 11, 644–654. [Google Scholar] [CrossRef]
Swernofsky, J. Tensor Rank Is Hard to Approximate; Electronic Colloquium on Computational Complexity, Report No. 86; Schloss Dagstuhl—Leibniz Center for Informatics: Wadern, Germany, 2018. [Google Scholar]
Bazerque, J.A.; Mateos, G.; Giannakis, G.B. Rank Regularization and Bayesian Inference for Tensor Completion and Extrapolation. IEEE Trans. Signal Process. 2013, 61, 5689–5703. [Google Scholar] [CrossRef]
Karim, R.G.; Guo, G.; Yan, D.; Navasca, C. Accurate Tensor Decomposition with Simultaneous Rank Approximation for Surveillance Videos. In Proceedings of the Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 3–6 November 2020; pp. 842–846. [Google Scholar]
Kolda, T.; Bader, B.W. Tensor Decompositions and Applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar] [CrossRef]
Sidiropoulos, N.D.; De Lathauwer, L.; Fu, X.; Huang, K.; Papalexakis, E.E.; Faloutsos, C. Tensor Decomposition for Signal Processing and Machine Learning. IEEE Trans. Signal Process. 2017, 65, 3551–3582. [Google Scholar] [CrossRef]
Dai, L.; Wang, Z.; Yang, Z. Spectrally Efficient Time-Frequency Training OFDM for Mobile Large-Scale MIMO Systems. IEEE J. Sel. Areas Commun. 2013, 31, 251–263. [Google Scholar] [CrossRef]
Guo, Z.; Wang, X.; Heng, W. Millimeter-Wave Channel Estimation Based on 2-D Beamspace MUSIC Method. IEEE Trans. Wirel. Commun. 2017, 16, 5384–5394. [Google Scholar] [CrossRef]
Qi-Nian, J. On the iteratively regularized Gauss-Newton method for solving nonlinear ill-posed problems. Math. Comput. 2000, 69, 1603–1624. [Google Scholar] [CrossRef] [Green Version]
Cichocki, A.; Zdunek, R. Regularized Alternating Least Squares Algorithms for Non-negative Matrix/Tensor Factorization. In Proceedings of the 4th International Symposium on Neural Networks, Nanjing, China, 3–7 June 2007. [Google Scholar]
Wang, J.-H.; Hopke, P.K.; Hancewicz, T.M.; Zhang, S.L. Application of modified alternating least squares regression to spectroscopic image analysis. Anal. Chim. Acta 2003, 476, 93–109. [Google Scholar] [CrossRef]
Wang, X.; Navasca, C.; Kindermann, S. On accelerating the regularized alternating least-squares algorithm for tensors. ETNA-Electron. Trans. Numer. Anal. 2018, 48, 1–14. [Google Scholar] [CrossRef]
Li, N.; Kindermann, S.; Navasca, C. Some convergence results on the Regularized Alternating Least-Squares method for tensor decomposition. Linear Algebra Its Appl. 2013, 438, 796–812. [Google Scholar] [CrossRef]
Deuflhard, P.; Engl, H.W.; Scherzer, O. A convergence analysis of iterative methods for the solution of non-linear ill-posed problems under affinely invariant conditions. Inverse Probl. 1998, 14, 1081–1106. [Google Scholar] [CrossRef]
Kaltenbacher, B. Some Newton-type methods for the regularization of nonlinear ill-posed problems. Inverse Probl. 1997, 13, 729–753. [Google Scholar] [CrossRef]
Chen, T.C.; Han, D.J.; Au, F.T.; Tham, L.G. Acceleration of Levenberg-Marquardt Training of Neural Net-works with Variable Decay Rate. In Proceedings of the International Joint Conference on Neural Networks, Portland, OR, USA, 20–24 July 2003; pp. 1873–1878. [Google Scholar]
Madsen, K.; Nielsen, H.B.; Tingleff, O. Methods for Non-Linear Least Squares Problems, 2nd ed.; Technical University of Denmark: Lyngby, Denmark, 2004; pp. 24–26. [Google Scholar]

Figure 1. Architecture of the proposed Hybrid-MIMO system.

Figure 2. Rank estimation performance of the l1 Regularization LS method as a function of

λ

with real rank

R

= 4.

Figure 2. Rank estimation performance of the l1 Regularization LS method as a function of

λ

with real rank

R

= 4.

Figure 3. Comparison of the rank estimation performance of the proposed method as a function of

γ

and the l1 regularization LS versus SNR with

λ

=

10^{- 4}

and real rank

R

= 4.

Figure 3. Comparison of the rank estimation performance of the proposed method as a function of

γ

and the l1 regularization LS versus SNR with

λ

=

10^{- 4}

and real rank

R

= 4.

Figure 4. Robustness of the proposed method regrading different initial rank values with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

and

γ

= 0.3.

Figure 4. Robustness of the proposed method regrading different initial rank values with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

and

γ

= 0.3.

Figure 5. Robustness of the proposed method in success rate regrading different values of initial rank with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

and

γ

= 0.3.

Figure 5. Robustness of the proposed method in success rate regrading different values of initial rank with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

and

γ

= 0.3.

Figure 6. Success rate regrading different values of

γ

with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

.

Figure 6. Success rate regrading different values of

γ

with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

.

Figure 7. Relative error vs. Iteration using the proposed method for different values of

γ

with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

. (a)

γ = 0

, (b)

γ = 0.3

, (c)

γ = 0.5

, (d)

γ = 0.7

.

Figure 7. Relative error vs. Iteration using the proposed method for different values of

γ

with

R

= 4, SNR = 20 dB,

λ

=

10^{- 4}

. (a)

γ = 0

, (b)

γ = 0.3

, (c)

γ = 0.5

, (d)

γ = 0.7

.

Figure 8. NMSEs of channel estimation schemes versus the system SNR and values of

γ

with

R

= 4,

λ

=

10^{- 4}

.

Figure 8. NMSEs of channel estimation schemes versus the system SNR and values of

γ

with

R

= 4,

λ

=

10^{- 4}

.

Figure 9. Rank estimation performance of the proposed method as a function of weighting parameter

γ

with real rank

R

= 4, 5, and 6, SNR = 20 dB,

λ

=

10^{- 4}

.

Figure 9. Rank estimation performance of the proposed method as a function of weighting parameter

γ

with real rank

R

= 4, 5, and 6, SNR = 20 dB,

λ

=

10^{- 4}

.

Figure 10. Number of iterations for each trail of the proposed method as a function of weighting parameter

γ

with real rank

R

= 4, 5, and 6, SNR = 20 dB,

λ

=

10^{- 4}

.

Figure 10. Number of iterations for each trail of the proposed method as a function of weighting parameter

γ

with real rank

R

= 4, 5, and 6, SNR = 20 dB,

λ

=

10^{- 4}

.

Table 1. Robustness of the proposed method in success rate with different real rank.

	8	16	24
Real Rank	8	16	24
$R =$ 4	97.4%	97.96%	98.36%
$R =$ 5	98.4%	98.52%	98.6%
$R =$ 6	99.36%	99.47%	99.48%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

He, F.; Harms, A.; Yang, L.Y. Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation. Signals 2022, 3, 664-681. https://doi.org/10.3390/signals3040040

AMA Style

He F, Harms A, Yang LY. Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation. Signals. 2022; 3(4):664-681. https://doi.org/10.3390/signals3040040

Chicago/Turabian Style

He, Fei, Andrew Harms, and Lamar Yaoqing Yang. 2022. "Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation" Signals 3, no. 4: 664-681. https://doi.org/10.3390/signals3040040

Article Menu

Tensor Rank Regularization with Bias Compensation for Millimeter Wave Channel Estimation

Abstract

1. Introduction

2. Tensor Preliminaries

2.1. Tensor Basics

2.2. CP Tensor Decomposition

3. Signal Model

4. The Proposed Algorithm

4.1. Joint CP Tensor Decomposition

4.2. Proposed CP Tensor Decomposition with Weighted Bias

5. Computational Complexity Analysis

6. Numerical Experiments

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI