On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations

Ivanov, Ivan G.; Yang, Hongli

doi:10.3390/math11214436

Open AccessArticle

On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations

by

Ivan G. Ivanov

^1,*,†

and

Hongli Yang

^2,†

¹

Faculty of Economics and Business Administration, Sofia University “St.Kl.Ohridski”, 1000 Sofia, Bulgaria

²

College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2023, 11(21), 4436; https://doi.org/10.3390/math11214436

Submission received: 12 September 2023 / Revised: 8 October 2023 / Accepted: 23 October 2023 / Published: 26 October 2023

(This article belongs to the Special Issue Numerical Analysis and Matrix Computations: Theory and Applications)

Download Versions Notes

Abstract

:

In this paper, we investigate the iterative methods for the solution of different types of nonlinear matrix equations. More specifically, we consider iterative methods for the minimal nonnegative solution of a set of Riccati equations, a nonnegative solution of a quadratic matrix equation, and the maximal positive definite solution of the equation

X + A^{*} X^{- 1} A = Q

. We study the recent iterative methods for computing the solution to the above specific type of equations and propose more effective modifications of these iterative methods. In addition, we make comments and comparisons of the existing methods and show the effectiveness of our methods by illustration examples.

Keywords:

Riccati equation; nonlinear matrix equation; M-matrix; minimal nonnegative solution

MSC:

15A24; 15A45; 65F10; 65F35

1. Introduction

Nonlinear matrix equations are commonly used in many fields of scientific and engineering computing. Research on the existence and properties of the solution to the matrix equations, as well as the corresponding numerical methods, has important theoretical significance and practical value. In this paper, we focus on the iterative methods for the solution of different types of nonlinear matrix equations. More specifically, we consider the iterative methods for computing the minimal nonnegative solution of a set of Riccati equations, a nonnegative solution of the quadratic matrix equation, and the maximal positive definite solution of the equation

X + A^{*} X^{- 1} A = Q

. Iterative methods for solving the nonlinear matrix equation, which avoid the calculation of the inverse matrix at each iteration step, have gained wide popularity [1,2]. Without commenting on the reliability of this approach, this is an efficient approach that speeds up convergence. Users of similar methods should be aware of the possibility that some of these methods may lose accuracy during the calculations and may not reach the result. Moreover, we study the recent iterative methods for computing the above specific type of equations and propose more effective modifications of these iterative methods.

The investigated equations can be encountered in various applied tasks, for example, in the solution of problems for stability analysis [3,4]. There have been many published papers on the field of matrix iterative schemes and their applications. We cite some of them related to our investigation [5,6,7,8].

We will exploit a class of nonnegative matrices for the first two equations. Some notations are made throughout this paper. A matrix is nonnegative if all entries are either greater than zero or equal to zero. The set of real

r \times n

matrices is denoted as

R^{r \times n}

. The notations I or

I_{r}

are used for an unit

r \times r

matrix. We need an elementwise order relation. The inequality

A \geq B (A > B)

for

A = (a_{i j}), B = (b_{i j})

means that

a_{i j} \geq b_{i j} (a_{i j} > b_{i j})

for all indexes i and j. A matrix

A = (a_{i j}) \in R^{p \times p}

is said to be a Z-matrix if it has non-positive off-diagonal elements. A Z-matrix Q has the presentation

Q = γ I - P

, with P being a nonnegative matrix. Each M-matrix is a Z-matrix if

γ \geq ρ (P)

, where

ρ (P)

is the spectral radius of P. A Z-matrix Q is called a non-singular M-matrix if

γ > ρ (P)

; otherwise, it is a singular M-matrix. For the third equations, we will exploit a class of Hermitian matrices. If the matrix Q is positive definite, we write

Q ≻ 0

or

Q ⪰ 0

for positive semidefinite. Therefore,

P ⪰ Q

means the matrix

P - Q

is positive semidefinite.

2. Numerical Methods for the Solution of a Set of Riccati Equations

In this section, we investigate different iterative methods to compute the minimal nonnegative solution of a set of matrix Riccati equations, where the matrix coefficients of each equation are associated with an M-matrix. Our investigation follows the ideas of Bai and coauthors in [9], Ma and Lu [10], Guan and Lu [11], Guan [12], and Ivanov and Yang [13]. In fact, we propose a new modification of the alternate linear implicit method introduced in [14] and modified in [15]. We propose a different iterative method to compute the minimal nonnegative solution and derive convergence properties of the new iteration. We apply some properties of M-matrices in the proof and show that the new iteration method is faster than Newton’s method investigated in [16] by Liu, Zhang, and Luo.

Consider a set of nonsymmetric coupled Riccati equations (SNCRE) associated with M-matrices:

M_{i} (X_{1}, \dots, X_{q}) : = X_{i} C_{i} X_{i} - X_{i} D_{i} - A_{i} X_{i} + B_{i} + \sum_{j \neq i} e_{i j} X_{j} = 0,

(1)

i = 1, \dots, q,

, which is introduced in [14]. The coefficients of matrix

X_{i}

are

A_{i} = (a_{k p}^{i}) \in R^{m \times m}, B_{i} \in R^{m \times n}, C_{i} \in R^{n \times m}, D_{i} = (d_{k p}^{i}) \in R^{n \times n}

. Let

(X_{1}, \dots, X_{s})

be a solution of the set of Equation (1) with

X_{i} \in R^{m \times n}, i = 1, \dots, q

. Entries of

E = (e_{i j})

are nonnegative constants.

The couple of matrices

({\tilde{X}}_{1}, \dots, {\tilde{X}}_{q})

is the minimal nonnegative solution to (1) if

{\tilde{X}}_{i} \leq X_{i}, i = 1, \dots, q

(elementwise order) for any nonnegative solution

(X_{1}, \dots, X_{q})

to (1).

Zhang and Tan [14] have investigated the inexact Newton method and the alternate linear implicit method (ALI) to compute the minimal nonnegative solution of the SNCRE (1). They have proved the convergence properties of these iterations. We define the ALI iterative method with initial matrices

X_{i}^{(0)} = 0 \in R^{n \times n} (m = n)

. The method uses positive constants

γ_{i}, i = 1, \dots, q

, which are computed via ((31), [14]):

γ_{i} = max {m a x_{j} a_{j j}^{i}, m a x_{j} d_{j j}^{i}} .

(2)

\begin{matrix} for k = 0, 1, 2, \dots : \\ Y_{i}^{(k)} (γ_{i} I_{n} + D_{i} - C_{i} X_{i}^{(k)}) = (γ_{i} I_{n} - A_{i}) X_{i}^{(k)} + B_{i} + \sum_{j \neq i} e_{i j} X_{j}^{(k)}, \\ (γ_{i} I_{n} + A_{i} - Y_{i}^{(k)} C_{i}) X_{i}^{k + 1} = Y_{i}^{(k)} (γ_{i} I_{n} - D_{i}) + B_{i} + \sum_{j \neq i} e_{i j} Y_{j}^{(k)} . \end{matrix}

(3)

Iteration (3) computes two inverse matrices at each iteration step. In order to avoid the computation of inverse matrices at each step, we have proposed a modification, as in [15]:

\begin{matrix} X_{i}^{(0)} = 0, i = 1, 2, \dots q, \\ k = 0, 1, 2, \dots : \\ Y_{i}^{(k)} (γ_{i} I_{n} + D_{i}) = (γ_{i} I_{n} - A_{i} + X_{i}^{(k)} C_{i}) X_{i}^{(k)} + B_{i} + \sum_{j \neq i} e_{i j} X_{j}^{(k)}, \\ (γ_{i} I_{n} + A_{i}) X_{i}^{k + 1} = Y_{i}^{(k)} (γ_{i} I_{n} - D_{i} + C_{i} Y_{i}^{(k)}) + B_{i} + \sum_{j \neq i} e_{i j} Y_{j}^{(k)} . \end{matrix}

(4)

Iteration (4) and its convergence properties are derived in [15]. Here, the computations of the inverse matrices of

(γ_{i} I_{n} + D_{i})

and

(γ_{i} I_{n} + A_{i})

are executed in the beginning of the iterative process, i.e., it operates only one time. This fact significantly reduces the computational cost throughout the iteration process, which is confirmed by the numerical experiments executed in [15].

2.1. Newton Method and Its Modifications

Authors Liu, Zhang, and Luo [16] investigated the Newton method to compute the positive minimal solution to the set of Riccati Equation (1).

\begin{matrix} X_{i}^{(0)} = 0, i = 1, 2, \dots q, \\ k = 0, 1, 2, \dots : \\ (A_{i} - X_{i}^{(k)} C_{i}) X_{i}^{(k + 1)} + X_{i}^{(k + 1)} (D_{i} - C_{i} X_{i}^{(k)}) = B_{i} + \\ + \sum_{j \neq i} e_{i j} X_{j}^{(k)} - C_{i} X_{i}^{(k)} C_{i} . \end{matrix}

(5)

Together with (5), the following modifications are studied by the same authors:

\begin{matrix} X_{i}^{(0)} = 0, i = 1, 2, \dots q, \\ k = 0, 1, 2, \dots : \\ (A_{i} - X_{i}^{(k)} C_{i}) X_{i}^{(k + 1)} + X_{i}^{(k + 1)} (D_{i} - C_{i} X_{i}^{(k)}) = B_{i} + \\ + \sum_{j < i} e_{i j} X_{j}^{(k + 1)} + \sum_{j > i} e_{i j} X_{j}^{(k)} - C_{i} X_{i}^{(k)} C_{i} . \end{matrix}

(6)

and

\begin{matrix} X_{i}^{(0)} = 0, i = 1, 2, \dots q, \\ k = 0, 1, 2, \dots : \\ (A_{i} - X_{i}^{(k)} C_{i}) X_{i}^{(k + 1)} + X_{i}^{(k + 1)} (D_{i} - C_{i} X_{i}^{(k)}) = B_{i} + \\ + \sum_{j < i} e_{i j} (ω X_{j}^{(k + 1)} + (1 - ω) X_{j}^{(k)}) + \sum_{j > i} e_{i j} X_{j}^{(k)} - C_{i} X_{i}^{(k)} C_{i} . \end{matrix}

(7)

The proof of the convergence for (5) is derived by Liu, Zhang, and Luo in [16], whereas the iterations (6) and (7) are used by them as an empirical experiment. The idea of applying approximation

X_{i}^{(k + 1)}

for computing

X_{j}^{(k + 1)}, j > i

, as in (6), is an effective one.

2.2. Our New Iteration Scheme and Convergence Proof

Here, we propose the following iteration strategy to compute the minimal nonnegative solution to (1):

\begin{matrix} X_{i}^{(0)} = 0, i = 1, 2, \dots q \\ γ_{i} as in (2), i = 1, 2, \dots q \\ k = 0, 1, 2, \dots :, 0 \leq ω \\ Y_{i}^{(k)} (γ_{i} I + D_{i}) = (γ_{i} I - A_{i} + X_{i}^{(k)} C_{i}) X_{i}^{(k)} + B_{i} \\ + \sum_{j < i} e_{i j} [ω Y_{j}^{(k)} + (1 - ω) X_{j}^{(k)}] + \sum_{j > i} e_{i j} X_{j}^{(k)}, \\ (γ_{i} I + A_{i}) X_{i}^{(k + 1)} = Y_{i}^{(k)} (γ_{i} I - D_{i} + C_{i} Y_{i}^{(k)}) + B_{i} \\ + \sum_{j < i} e_{i j} [ω X_{j}^{(k + 1)} + (1 - ω) Y_{j}^{(k)}] + \sum_{j > i} e_{i j} Y_{j}^{(k)} . \end{matrix}

(8)

We derive several matrix identities for matrices obtained by iteration (8) in the lemma.

Lemma 1.

The matrix sequences

{X_{i}^{(k)}, Y_{i}^{(k)}}_{k = 0}^{\infty}

are constructed by iteration (8) with initial values

X_{i}^{(0)} = 0, i = 1, 2, . . q

. The following matrix identities are satisfied for

k = 0, 1, \dots, \infty

:

\begin{matrix} \begin{matrix} (i) (Y_{i}^{(k)} - X_{i}^{(k)}) (γ_{i} I + D_{i}) = (X_{i}^{(k)} - Y_{i}^{(k - 1)}) (γ_{i} I - D_{i}) \\ + X_{i}^{(k)} C_{i} (X_{i}^{(k)} - Y_{i}^{(k - 1)}) + (X_{i}^{(k)} - Y_{i}^{(k - 1)}) C_{i} {Y_{i}}^{(k - 1)} \\ + \sum_{j < i} e_{i j} [ω (X_{j}^{(k + 1)} - X_{j}^{(k)}) + (1 - ω) (Y_{j}^{(k)} - X_{j}^{(k)})] \\ + \sum_{j > i} e_{i j} (X_{j}^{(k)} - Y_{j}^{(k)}), \end{matrix} \\ \begin{matrix} (i i) (γ_{i} I + A_{i}) (X_{i}^{(k + 1)} - Y_{i}^{(k)}) = (γ_{i} I - A_{i}) (Y_{i}^{(k)} - X_{i}^{(k)}) \\ + Y_{i}^{(k)} C_{i} (Y_{i}^{(k)} - X_{i}^{(k)}) + (Y_{i}^{(k)} - X_{i}^{(k)}) C_{i} X_{i}^{(k)} \\ + \sum_{j < i} e_{i j} [ω (X_{j}^{(k + 1)} - Y_{j}^{(k)}) + (1 - ω) (Y_{j}^{(k)} - X_{j}^{(k)})] \\ + \sum_{j > i} e_{i j} (Y_{j}^{(k)} - X_{j}^{(k)}), \end{matrix} \end{matrix}

where I is an identity

n \times n

matrix.

Moreover, if

({\tilde{X}}_{1}, \dots, {\tilde{X}}_{q})

is an exact nonnegative solution of

M_{i} (X_{1}, \dots, X_{q}) = 0

, the subsequent identities can be verified:

\begin{matrix} \begin{matrix} (i i i) ({\tilde{X}}_{i} - Y_{i}^{(k)}) (γ_{i} I + D_{i}) = (γ_{i} I - A_{i}) ({\tilde{X}}_{i} - X_{i}^{(k)}) \\ + X_{i}^{(k)} C_{i} ({\tilde{X}}_{i} - X_{i}^{(k)}) + ({\tilde{X}}_{i} - X_{i}^{(k)}) C_{i} {\tilde{X}}_{i} \\ + \sum_{j < i} e_{i j} [ω ({\tilde{X}}_{j} - Y_{j}^{(k)}) + (1 - ω) ({\tilde{X}}_{j} - X_{j}^{(k)})] \\ + \sum_{j > i} e_{i j} ({\tilde{X}}_{j} - X_{j}^{(k)}) \end{matrix} \\ \begin{matrix} (i v) (γ_{i} I + A_{i}) ({\tilde{X}}_{i} - X_{i}^{(k + 1)}) = ({\tilde{X}}_{i} - Y_{i}^{(k)}) (γ_{i} I - D_{i}) \\ + ({\tilde{X}}_{i} - Y_{i}^{(k)}) C_{i} {\tilde{X}}_{i} + Y_{i}^{(k)} C_{i} ({\tilde{X}}_{i} - Y_{i}^{(k)}) \\ + \sum_{j < i} e_{i j} [ω ({\tilde{X}}_{i} - X_{j}^{(k + 1)}) + (1 - ω) ({\tilde{X}}_{i} - Y_{j}^{(k)})] \\ + \sum_{j > i} e_{i j} ({\tilde{X}}_{i} - Y_{j}^{(k)}) \end{matrix} \end{matrix}

Proof.

The proof is completed by direct calculations and matrices manipulations. We rewrite Equation (8) for

X_{i}^{(k)}

and consider the difference

Y_{i}^{(k)} (γ I + A_{i}) - (γ I_{2 n} + D_{i}) X_{i}^{(k)}

. After some matrix calculations, we obtain the matrix identity (i). Subtracting matrix equations in (8), we derive (ii). □

We prove the convergence of the matrix sequence generated by (8).

Theorem 1.

Suppose the matrix coefficients

A_{i}, D_{i}

of (1) are Z-matrices and

B_{i}, C_{i}, (i = 1, \dots, q)

are nonnegative. There must exist positive scalars

γ_{i}

such that

(γ_{i} I + A_{i})

and

(γ_{i} I + D_{i})

are nonsingular M-matrices.

If there exits a nonnegative solution to set of matrix Equation (1), then the matrix sequences

{X_{i}^{(k)}, Y_{i}^{(k)}}_{k = 0}^{\infty} i = 1, \dots, q

generated by (8) satisfy the following monotonicity property:

(i)

{\hat{X}}_{i} \geq X_{i}^{(k + 1)} \geq Y_{i}^{(k)} \geq X_{i}^{(k)}

for

i = 1, \dots, q, k = 0, 1, \dots

for an exact nonnegative solution

({\hat{X}}_{1}, \dots, {\hat{X}}_{q})

of (1). Moreover, the same matrix sequences converge to the nonnegative minimal solution of (1).

(ii) Moreover, if

A_{i} - {\hat{X}}_{i} C_{i}

and

D_{i} - C_{i} {\hat{X}}_{i}, i = 1, \dots, q

are nonsingular M-matrices, then

A_{i} - {\tilde{X}}_{i} C_{i}

and

D_{i} - C_{i} {\tilde{X}}_{i}, i = 1, \dots, q

are nonsingular M-matrices, i.e., matrices

- A_{i} + {\tilde{X}}_{i} C_{i}

and

- D_{i} + C_{i} {\tilde{X}}_{i}, i = 1, \dots, q

are c-stable.

Proof.

Under assumptions that we have

{(γ_{i} I + A_{i})}^{- 1} \geq 0

and

{(γ_{i} I + D_{i})}^{- 1} \geq 0, i = 1,

\dots, q

. Apply recurrence Equation (8) with

X_{1}^{(0)} = \dots = X_{q}^{(0)} = 0

and

γ_{i}

computed by (2).

For

Y_{1}^{(0)}

, we have

Y_{1}^{(0)} (γ_{1} I + D_{1}) = B_{1} \geq 0

and

Y_{1}^{(0)} = B_{1} {(γ_{i} I + D_{1})}^{- 1} \geq 0

. For

Y_{2}^{(0)}

, we have

Y_{2}^{(0)} (γ_{2} I + D_{2}) = B_{2} + e_{21} ω Y_{1}^{(0)} \geq 0 .

Thus,

Y_{2}^{(0)} \geq 0 .

Therefore

Y_{i}^{(0)} \geq 0,

and

Y_{i}^{(0)} \geq X_{i}^{(0)} = 0, i = 1, \dots, q .

Construct matrix sequences

{X_{i}^{(k)}, Y_{i}^{(k)}}_{k = 0}^{\infty} i = 1, \dots, q

by (8) and exploit the facts

γ_{i} I - D_{i} \geq 0

and

γ_{i} I - A_{i} \geq 0, i = 1, \dots, q

.

We assume that the inequalities are true

X_{i}^{(p)} \geq Y_{i}^{(p - 1)} \geq X_{i}^{(p - 1)} \geq 0

for some integer p.

Next, we prove that

X_{i}^{(p + 1)} \geq Y_{i}^{(p)} \geq X_{i}^{(p)} \geq 0, i = 1, \dots, q

.

Taking into account of Lemma 1(i), we get:

(Y_{i}^{(p)} - X_{i}^{(p)}) = F_{i}^{(p)} {(γ_{i} I + D_{i})}^{- 1} \geq 0,

because

\begin{matrix} F_{i}^{(p)} : = (X_{i}^{(p)} - Y_{i}^{(p - 1)}) (γ_{i} I - D_{i}) \\ + X_{i}^{(p)} C_{i} (X_{i}^{(p)} - Y_{i}^{(p - 1)}) + (X_{i}^{(p)} - Y_{i}^{(p - 1)}) C_{i} {Y_{i}}^{(p - 1)} \\ + \sum_{j < i} e_{i j} [ω (X_{j}^{(p + 1)} - X_{j}^{(p)}) + (1 - ω) (Y_{j}^{(p)} - X_{j}^{(p)})] \\ + \sum_{j > i} e_{i j} (X_{j}^{(p)} - Y_{j}^{(p)}) \geq 0 . \end{matrix}

Note that:

ω (X_{j}^{(p + 1)} - X_{j}^{(p)}) + (1 - ω) (Y_{j}^{(p)} - X_{j}^{(p)}) = (Y_{j}^{(p)} - X_{j}^{(p)}) + ω (X_{j}^{(p + 1)} - Y_{j}^{(p)}) \geq 0 .

Thus,

ω (X_{j}^{(p + 1)} - X_{j}^{(p)}) + (1 - ω) (Y_{j}^{(p)} - X_{j}^{(p)}) \geq 0

for positive

ω

and all j.

Therefore,

Y_{i}^{(p)} - X_{i}^{(p)} \geq 0, i = 1, \dots, q

.

Taking account of Lemma 1(ii), we have:

(X_{i}^{(p + 1)} - Y_{i}^{(p)}) = {(γ_{i} I + A_{i})}^{- 1} G_{i}^{(p)},

where

\begin{matrix} G_{i}^{(p)} = (γ_{i} I - A_{i}) (Y_{i}^{(p)} - X_{i}^{(p)}) \\ + Y_{i}^{(p)} C_{i} (Y_{i}^{(p)} - X_{i}^{(p)}) + (Y_{i}^{(p)} - X_{i}^{(p)}) C_{i} X_{i}^{(p)} \\ + \sum_{j < i} e_{i j} [ω (X_{j}^{(p + 1)} - Y_{j}^{(p)}) + (1 - ω) (Y_{j}^{(p)} - X_{j}^{(p)})] \\ + \sum_{j > i} e_{i j} (Y_{j}^{(p)} - X_{j}^{(p)}) \geq 0 . \end{matrix}

Thus,

X_{i}^{(p + 1)} - Y_{i}^{(p)} \geq 0, i = 1, \dots, q

.

We conclude that the matrix sequences

{X_{i}^{(k)}, Y_{i}^{(k)}}_{k = 0}^{\infty}

are monotone increasing. We have to prove that they are bonded above. Consider any exact nonnegative solution

({\hat{X}}_{1}, \dots, {\hat{X}}_{q})

of (1). We shall prove that the solution is an upper bound of the matrix sequences.

For

k = 0

, we have

{\hat{X}}_{i} \geq X_{i}^{(0)} = 0

. We compute

Y_{i}^{(0)}, i = 1 \dots, q

, and by (Lemma 1(iii)):

{\hat{X}}_{i} - Y_{i}^{(0)} = (Q_{i}^{(0)} + S_{i}^{(0)}) {(γ_{i} I + D_{i})}^{- 1},

where

\begin{matrix} Q_{i}^{(0)} = (γ_{i} I - A_{i}) ({\hat{X}}_{i} - X_{i}^{(0)}) + X_{i}^{(0)} C_{i} ({\hat{X}}_{i} - X_{i}^{(0)}) + ({\hat{X}}_{i} - X_{i}^{(0)}) C_{i} {\hat{X}}_{i}, \end{matrix}

and

\begin{matrix} S_{i}^{(0)} = \sum_{j < i} e_{i j} [ω ({\tilde{X}}_{j} - Y_{j}^{(0)}) + (1 - ω) ({\tilde{X}}_{j} - X_{j}^{(0)})] \\ + \sum_{j > i} e_{i j} ({\tilde{X}}_{j} - X_{j}^{(0)}) . \end{matrix}

Note that

Q_{i}^{(0)} \geq 0, i = 1, \dots, q

.

Moreover, for i = 1 we have

{\hat{X}}_{1} - Y_{1}^{(0)} \geq 0,

because:

\begin{matrix} S_{1}^{(0)} = \sum_{j > 1} e_{1 j} ({\hat{X}}_{j} - X_{j}^{(0)}) \geq 0 . \end{matrix}

For

i = 2

, we obtain:

\begin{matrix} S_{2}^{(0)} = e_{21} [ω ({\hat{X}}_{1} - Y_{1}^{(0)}) + (1 - ω) ({\hat{X}}_{1} - X_{1}^{(0)})] \\ + \sum_{j > 2} e_{2 j} ({\hat{X}}_{j} - X_{j}^{(0)}) \geq 0 . \end{matrix}

Thus,

{\hat{X}}_{2} - Y_{2}^{(0)} \geq 0 .

We conclude that

{\hat{X}}_{j} - Y_{j}^{(0)} \geq 0 j = 1, \dots, q

.

Thus:

{\hat{X}}_{j} \geq Y_{j}^{(0)} \geq X_{j}^{(0)} \geq 0, j = 1, \dots, q .

We will prove:

{\hat{X}}_{j} \geq X_{j}^{(1)}, j = 1, \dots, q .

From Lemma 1(iv), for

k = 0

, we obtain:

{\hat{X}}_{i} - X_{i}^{(k + 1)} = {(γ_{i} I + A_{i})}^{- 1} (G_{Y i}^{(0)} + L_{i}^{(0)}),

where

G_{Y i}^{(0)} = ({\tilde{X}}_{i} - Y_{i}^{(0)}) (γ_{i} I - D_{i}) + ({\hat{X}}_{i} - Y_{i}^{(0)}) C_{i} {\hat{X}}_{i} + Y_{i}^{(0)} C_{i} ({\hat{X}}_{i} - Y_{i}^{(0)}),

which is a nonnegative matrix. For the matrix

L_{i}^{(0)}

, write:

\begin{matrix} L_{i}^{(0)} = \sum_{j < i} e_{i j} [ω ({\hat{X}}_{j} - X_{j}^{(1)}) + (1 - ω) ({\hat{X}}_{j} - Y_{j}^{(0)})] + \sum_{j > i} e_{i j} ({\hat{X}}_{j} - Y_{j}^{(0)}) . \end{matrix}

For

i = 1

, we obtain:

L_{1}^{(0)} = \sum_{j > 1} e_{1 j} ({\hat{X}}_{j} - Y_{j}^{(0)}) \geq 0,

and thus

{\hat{X}}_{1} - X_{1}^{(1)} \geq 0 .

For

i = 2

, write:

L_{2}^{(0)} = e_{21} [ω ({\tilde{X}}_{1} - X_{1}^{(1)}) + (1 - ω) ({\hat{X}}_{1} - Y_{1}^{(0)})] + \sum_{j > 2} e_{2 j} ({\hat{X}}_{j} - Y_{j}^{(0)}) \geq 0,

which leads to

{\hat{X}}_{2} - X_{2}^{(1)} \geq 0 .

Consequently, we infer

{\hat{X}}_{j} - X_{j}^{(1)} \geq 0, j = 1, \dots, q .

Assume:

{\hat{X}}_{j} \geq X_{j}^{(k)} \geq Y_{j}^{(k - 1)} \geq 0, j = 1, \dots, q .

With similar reasoning, we derive the inequalities:

{\hat{X}}_{j} \geq X_{j}^{(k + 1)} \geq Y_{j}^{(k)} \geq 0, j = 1, \dots, q .

Both matrix sequences are monotone increasing in the elemenwise order and bounded by the above. They converge to same limit

({\tilde{P}}_{1}, \dots, {\tilde{P}}_{q})

. Going to the limits in Equation (8), one concludes that

({\tilde{P}}_{1}, \dots, {\tilde{P}}_{q})

is a nonnegative solution of (1).

Suppose there is another solution

({\tilde{S}}_{1}, \dots, {\tilde{S}}_{q})

with

{\tilde{S}}_{j} \leq {\tilde{P}}_{j}

. The last inequalities lead us to a contradiction with the inequalities

{\tilde{S}}_{j} \leq {\tilde{P}}_{j} .

Therefore, the solution

({\tilde{S}}_{1}, \dots, {\tilde{S}}_{q})

is the minimal one.

Furthermore, we shall prove point (ii) of the theorem. Matrices

A_{i} - {\hat{X}}_{i} C_{i}

and

D_{i} - C_{i} {\hat{X}}_{i},

i = 1, \dots, q

are nonsingular M-matrices for an upper nonnegative limit

({\hat{X}}_{1}, \dots, {\hat{X}}_{q})

for nonnegative solutions of (1). According to properties of M-matrices, we conclude that

A_{i} - {\tilde{X}}_{i} C_{i}

is a nonsingular M-matrix for

i = 1, \dots, q

and, moreover,

- A_{i} + {\tilde{X}}_{i} C_{i}, i = 1, \dots, s

is c-stable for the minimal nonnegative solution

({\tilde{X}}_{1}, \dots, {\tilde{X}}_{q})

of (1). □

Remark 1.

The existence of a nonnegative solution for the set of matrix Equation (1) is commented by [16]. Two assumptions are necessary in [14], which involve the existence of a nonnegative matrices sequence

Z_{1}, \dots, Z_{q}

such that

M_{i} (Z_{1}, \dots, Z_{q}) \leq 0

. However, we drop this condition in our investigation. We derive a direct convergence proof for iteration (8) based on Lemma 1.

Remark 2.

We use the parameter ω, the values of which are bigger than 2 in (8) in order to speed up the rate of convergence for (8) comparing with the case

ω = 1

. We denote

W_{j, ω} = ω Y_{j}^{(k)} + (1 - ω) X_{j}^{(k)}

and

V_{j, ω} = ω X_{j}^{(k + 1)} + (1 - ω) Y_{j}^{(k)}

. For

ω > 2

, we have

W_{j, ω} - W_{j, ω = 1} = ω Y_{j}^{(k)} + (1 - ω) X_{j}^{(k)} - Y_{j}^{(k)} = (ω - 1) Y_{j}^{(k)} + (1 - ω) X_{j}^{(k)} \geq 0 .

That means

W_{j, ω > 2} - W_{j, ω = 1} \geq 0

. Analogously, the inequality

V_{j, ω > 2} - V_{j, ω = 1} \geq 0

is true for all values of j. We expect that iteration (8) for

ω \geq 2

makes a smaller number of iteration steps than the case

ω = 1

. We shall track this fact in numerical experiments.

The above remarks allow choosing

ω > 1

, and confirm that the choice preserves the monotony of the matrix sequences

{X_{i}^{(k)}, Y_{i}^{(k)}}_{k = 0}^{\infty} i = 1, \dots, q

.

2.3. Numerical Experiments

We provide numerical experiments to compute the minimal nonnegative solution to (1). We compare the results of iterations (5)–(7) with the results of the proposed new iterations (8). All experiments are performed in MATLAB (version R2018b) on a personal computer. The iterations stop when the current iterative step satisfies

R E S_{i} \leq 10 \times 10^{- 12}

, where

R E S_{i}

is defined as [14]:

R E S_{i} : = \frac{∥ M_{i} ({X_{1}}^{(k)}, \dots, {X_{q}}^{(k)}) ∥}{∥ M_{i} ({X_{1}}^{(0)}, \dots, {X_{q}}^{(0)}) ∥},

i = 1, \dots, q

.

In the experiments, we choose the parameters

γ_{i}

, as defined in (2). We take

X_{1}^{(0)} = \dots = X_{q}^{(0)} = 0

for all examples and all iterative methods. Thus,

M_{i} ({X_{1}}^{(0)}, \dots, {X_{q}}^{(0)}) = B_{i}

.

Example 1.

A set of

n \times n

matrix coefficients for different values of n are tested. The matrices

A_{i}, D_{i}, i = 1, 2, 3

are introduced following the Matlab terminology:

A_{1}

=

A_{2}

=

A_{3}

= zeros(n,n);

For

i = 1 : n

,

A_{1} (i, i) = 4;

A_{2} (i, i) = 3;

A_{3} (i, i) = 2;

end

For

i = 1 : n - 1

,

A_{1}

(i, i + 1) = - 0.5;

A_{1}

(i + 1,i) =

- 0.03;

end

For

i = 1 : n - 2

,

A_{1}

(i, i + 2) = - 0.25;

A_{1}

(i + 2, i) = - 0.9;

end

A_{1}

(1, n) = - 0.05;

A_{1} (n, 1) = - 0.4;

A_{2} = A_{1}

;

A_{2} (1, n) = - 0.8;

A_{2} (n, 1) = - 0.06;

A_{3} = A_{1}

;

A_{3} (1, n) = - 0.7;

A_{3} (n, 1) = - 0.09;

B_{1} = B_{2} = B_{3} = 0.75 I_{n}, B_{2} = B_{1}, B_{3} = B_{1}, C_{1} = 0.92 I_{n}, C_{2} = C_{1}, C_{3} = C_{1},

where

I_{n}

is an identity matrix order n.

E = (e_{i j}) = (\begin{matrix} 0.0661 & 0.4512 & 0.8887 \\ 0.4965 & 0.3156 & 0.8780 \\ 0.6542 & 0.8914 & 0.1947 \end{matrix}),

The results from the experiments are presented at Table 1. A hundred runs are executed for each example for

n = 12

and

n = 24

. Ten runs are executed for

n = 48

. In this case, iterations (5) and (6) are very slowly (in the used computer), whereas iteration (8) is fastest.

Example 2.

A set of

n \times n

matrix examples with the matrix coefficients for different values of n are tested.

The matrices

A_{i}, D_{i}, i = 1, 2, 4

are introduced following the Matlab terminology:

A 1 = g a l l e r y

(‘tridiag’

, n, 0, 1, - 1); A 1 = f u l l (A 1);

A 2 = g a l l e r y

(‘tridiag’

, n, 0, 2, - 1); A 2 = f u l l (A 2);

A 3 = g a l l e r y

(‘tridiag’

, n, 0, 3, - 1); A 3 = f u l l (A 3);

A 4 = g a l l e r y

(‘tridiag’

, n, 0, 4, - 1); A 4 = f u l l (A 4);

D 1 = g a l l e r y

(‘tridiag’

, n, 0, 2, - 1); D 1 = f u l l (D 1);

D 2 = g a l l e r y

(‘tridiag’

, n, 0, 4, - 1); D 2 = f u l l (D 2);

D 3 = g a l l e r y

(‘tridiag’

, n, 0, 6, - 1); D 3 = f u l l (D 3);

D 4 = g a l l e r y

(‘tridiag’

, n, 0, 8, - 1); D 4 = f u l l (D 4);

B 1 = 0.5 * e y e (n, n); B 2 = B 1; B 3 = B 1; B 4 = B 1;

C 1 = 0.2 * e y e (n, n); C 2 = C 1; C 3 = C 1; C 4 = C 1;

E = r a n d (4);

The results from the experiments are presented in Table 2.

The experiments with the above examples show the effectiveness of the proposed iteration Formula (8). Moreover, the high value of

ω

speeds up the convergence.

3. Numerical Method for the Maximal Solutions of Specifical Nonlinear Matrix Equations

Consider the iterative solution to the following nonlinear matrix equations:

X + A^{*} X^{- 1} A = Q,

M Y^{2} + N Y + P = 0,

investigated in [1,2,17]. Numerical methods on the specific solutions of the above matrix equations (maximal positive definite and minimal nonnegative) are investigated and some families of iterative formulas are proposed in [1,2,17]. However, comments and improvements of the proposed iteration schemes are provided to improve and accelerate the convergence. In this section, we will focus on the problem of how to accelerate the numerical solution of the above nonlinear matrix equations. The main tricks in the iterative methods proposed in these publications are to avoid the computation of an inverse matrix at each iteration step.

In general cases, the matrix A may be a real or complex square matrix. The notation

A^{*}

denotes a complex conjugate operation.

3.1. Iterative Solution of $X + A^{*} X^{- 1} A = Q$

We firstly list several known algorithms for computing the maximal solution of

X + A^{*} X^{- 1} A = Q

and compare their computational behavior.

Algorithm 1 follows iterative Formula (2.2) and the corresponding algorithm from [1].

Algorithm 1 For matrix equation

X + A^{*} X^{- 1} A = Q

1:: Introduce matrix coefficients $A, Q = I$ and a small positive number $t o l$ .
Take $X_{0} = Y_{0} = I$ (the identity matrix).
2:: $Y_{k + 1} = - I + Y_{k} (3 I + X_{k} - 2 X_{k} Y_{k})$ ,
$X_{k + 1} = I - A^{*} Y_{k + 1} A$ ,
3:: Stop if $∥ X_{k + 1} + A^{*} X_{k + 1}^{- 1} A - I ∥ \leq t o l$ . Otherwise, $k : = k + 1$ go to 2.
end

Algorithm 2 follows iterative Formula (3.3) and the corresponding algorithm from [2].

Algorithm 2 For matrix equation

X + A^{*} X^{- 1} A = Q

1:: Introduce matrix coefficients $A, Q = I$ and a small positive number $t o l$ . Choose $p = 1, m = 1, q_{1} = - 1$ for Equation (1.9) [2].
Take $X_{0} = Y_{0} = I$ (the identity matrix).
2:: $E_{k} = X_{k} Y_{k},$
$Y_{k + 1} = - \frac{2}{5} I + \frac{12}{5} Y_{k} + \frac{1}{5} (E_{k} + E_{k}^{*}) - \frac{7}{5} Y_{k} E_{k}),$
$X_{k + 1} = I - A^{*} Y_{k + 1} A,$
$R e s_{k} = ∥ X_{k + 1} + A^{∥} X_{k + 1}^{* 1} A - I ∥ \leq t o l$ .
3:: If $R e s_{k} \leq t o l$ then stop. Otherwise, $k : = k + 1$ go to 2.
end

In addition, we apply iterative Formula (3) from [18] to compute the same solution. The iteration (3) is:

X_{k + 1} = I - A^{*} X_{k}^{- 1} A, X_{0} = α I, 0.5 \leq α \leq 1, k = 0, 1, \dots .

(9)

Here, we apply Algorithms 1 and 2 and iteration (9) to compute the maximal positive definite solution to

X + A^{*} X^{- 1} A = I .

We use

t o l = 10^{- 16}

in examples. The computations are performed on a computer Intel(R) Core(TM) i7-1065G7 CPU @ 1.30 GHz via Matlab R2018b.

Example 3.

Consider the Example 3.1 introduced in [1]. The matrix is:

A = \frac{1}{40} (\begin{matrix} 2 & - 1 & 3 & 4 \\ 7 & 6 & - 5 & 9 \\ 4 & 8 & 10 & 6 \\ - 3 & 5 & 2 & 8 \end{matrix}) .

We have executed 100 runs with all algorithms. Algorithm 1 makes 26 iteration steps for 0.0276 s. Algorithm 2 makes 21 iteration steps for 0.0238 s. The computer realization of iteration (9) performs 21 iteration steps for 0.0186 s. The performance results of the three algorithms are comparable and show their applicability.

Example 4.

The example is considered in [2] as Example 4.1.

A = (\begin{matrix} 0.37 & 0.13 & 0.12 \\ - 0.30 & 0.34 & 0.12 \\ 0.11 & - 0.17 & 0.29 \end{matrix}) .

We have executed 100 runs with all algorithms. Algorithm 1 needs 81 iteration steps for 0.0754 s. Algorithm 2 needs 111 iteration steps for 0.1005 s. Iteration (9) performs 124 iteration steps for 0.0942 s. All three algorithms are working effectively for this example. The computational time is almost the same.

Example 5.

The example is introduced by Guo and Lancaster in [19] with:

A = (\begin{matrix} 0.2 & 0.2 & 0.1 \\ 0.2 & 0.15 & 0.15 \\ 0.1 & 0.15 & 0.25 \end{matrix}),

We have executed 100 runs with all algorithms using two different values of

t o l

. We take

t o l = 10^{- 4}

. Algorithm 1 needs 48 iteration steps to compute the solution for 0.05 s. Algorithm 2 needs 59 iteration steps for 0.0586 s. Iteration (9) applies only three iteration steps for 0.0064 s with

α = 0.5

. Further on, we take

t o l = 10^{- 8}

. Algorithm 1 needs 4714 iteration steps to compute the solution for 4.1306 s (for 100 runs). Algorithm 2 needs 5893 iteration steps for 5.7824 s (for 100 runs). However, iteration (9) has done only five iteration steps for 0.0101 s (for 100 runs) with

α = 0.5

. Thus, iteration (9) is superior than Algorithms 1 and 2 when the maximal solution is computed in this example.

Example 6.

The example is firstly considered in [20] and, next, is investigated in [18]. The matrix A is defined:

A = \frac{\tilde{A}}{2 ∥ \tilde{A} ∥}, \tilde{A} = (\begin{matrix} 0.1 & - 0.15 & - 0.2598076 \\ 0.15 & 0.2125 & - 0.0649519 \\ 0.2598076 & - 0.0649519 & 0.1375 \end{matrix}) .

Algorithms 1 and 2 do not converge for this example. Iteration (9) with

α = 0.5

converges to the maximal solution after 11 iteration steps for

t o l = 10^{- 7}

. The maximal solution

\tilde{X}

is:

\tilde{X} = (\begin{matrix} 0.500000082310064 & - 0.000000016964994 & 0.000000002309095 \\ - 0.000000016964994 & 0.729639588876686 & - 0.132582448109853 \\ 0.000000002309095 & - 0.132582448109853 & 0.576546597071862 \end{matrix}) .

The results of the experiments in this section show that the introduced iterative method (9) in [18] is effective and comparable to the iterative methods introduced in [1,2], and even better. Iterative method (9) uses the choice of an initial approximation depending on the value of

α

. How to make the choice of

α

can be read in [18]. Algorithms 1 and 2 avoid the computation of the inverse matrix, but this is not always reliable, as can be seen from the examples discussed in this section. Thus, we have to be careful where the inverse free algorithm is applied.

3.2. Numerical Method for the Solution of $M Y^{2} + N Y + P = 0$

In this section, we study square matrix equation

M Y^{2} + N Y + P = 0

, where

M, N, P

are real matrix coefficients. Different iterative methods are analyzed in [17]. The authors of [17] have investigated a family of iterative methods for finding the minimal nonnegative solution to

M Y^{2} + N Y + P = 0

. Their conclusion shows that Algorithms 1 and 6 defined in [17] are able to find the corresponding solution with the given accuracy. We will present these two algorithms and propose their modifications to improve their computational behavior, i.e., we will propose new modifications of both algorithms to make them more effective in the computational aspects.

We describe Algorithm 1 proposed in [17] as Algorithm 3 here.

Algorithm 3 Algorithm 1 [17]

1:: Input $n \times n$ matrices M, N, P.
2:: We take $Y 0$ and $α > 0$ .
3:: Compute $V_{M} = α M$ , and $W_{M} = (1 - α) M$ .
Note that $M = V_{M} + W_{M}$ .
4:: Compute $Y_{r + 1}$ from
$(V_{M} Y_{r} + N + R) Y_{r + 1} = (R - W_{M} Y_{r}) Y_{r} - P$ .
5:: If $∥ M Y_{r}^{2} + N Y_{r} + P ∥ < t o l$ then stop.

Now, we introduce our modifications to the above algorithms. The aim of the modifications is to use a diagonal matrix

W_{M} = ξ * I_{n}

. Then, the matrix multiplication

W_{M} Y_{r}

can be realized as

ξ * Y_{r}

in Matlab. Taking

W_{M}

as a diagonal matrix, we preserve the properties of Theorem 2.4 proved by Erfanifar and Hajarian [17]. Thus, the matrix

V_{M} Y_{r + 1} + N

is an M-matrix and the matrix sequence

{Y_{r}}

is monotone increasing and then converges to the minimal nonnegative solution. Moreover, applying a diagonal form for the matrix

W_{M}

, we avoid a matrix multiplication and replace it with a matrix multiplication by a number.

Compare the results from Algorithms 3 and 4 by Example 7.

Algorithm 4 Our modification to Algorithm 3

1:: Input $n \times n$ matrices M, N, P.
2:: Take $Y 0 = 0$ and $α > 0, R = α * I_{n}$ .
3:: Compute $V_{M} = M + R$ , and $W_{M} = - R$ , and $N N = N + R$ .
Note that $M = V_{M} + W_{M}$ .
4:: Compute $Y_{r + 1}$ using the equation
$(V_{M} Y_{r} + N + R) Y_{r + 1} = (R - W_{M} Y_{r}) Y_{r} - P$ .
             4.1: Compute $i m = i n v (V M * Y 0 + N N)$ . (Remark $Y 0 = Y_{r}$ .)
             4.2: Compute $t Q = (R + α * Y 0) * Y 0 - P$ .
             4.3: Compute $Y 0 = i m * t Q;$ . (Remark $Y 0 = Y_{r + 1}$ here).
             4.4: If $n o r m ((M * Y 0 + N) * Y 0 + P) \leq t o l$ then stop. Ortherwise, $r = r + 1$ and go to Step 4.2.
5:: The computed solution is $Y 0$ .

Example 7

(Example 4.1, [17]). For

s \times s

matrix coefficients

M = (m_{i j}), P = (p_{i j}), N = (n_{i j})

, we have:

\{\begin{matrix} m_{i i} = - 1.5, i = 1, \dots s; \\ m_{i, i + 1} = - 8, m_{i + 1, i} = - 5, i = 1, \dots s - 1; \\ p_{i i} = - 0.5, i = 1, \dots s; \\ p_{i, i + 1} = - 0.8, p_{i + 1, i} = - 1.5, i = 1, \dots s - 1; \\ n_{i i} = 45, i = 1, \dots s, \\ n_{i, i + 1} = - 6, n_{i + 1, i} = - 4, i = 1, \dots s - 1, \\ n_{11} = n_{s s} = 18 . \end{matrix}

Introducing a vector-row of size s of units, i.e.,

e = (1, \dots, 1)

, we compute

e m a t = 0.1 * e^{'} * e

, and

M = M - e m a t

.

Based on the matrices

M, N, P

, we compute a nonnegative solution of matrix Equation (1) with Algorithms 3 and 4 with the stop criterion with

t o l = 10^{- 14}

and compare numbers of iteration steps (It) and CPU time for 1000 runs for each value of s. The results are listed in Table 3.

Further on, we describe Algorithm 6 introduced in [17].

Applying the same approach, we obtain a modification of Algorithm 5.

Algorithm 5 Algorithm 6 [17]

1:: Input $n \times n$ matrices M, N, P.
2:: We take $Y 0$ and $α > 0, β > 0$ .
3:: Compute $V_{M} = α M$ , and $W_{M} = (1 - α) M$ .
$V_{N} = α N$ , and $W_{N} = (1 - β) N$ .
Note that $M = V_{M} + W_{M}$ and $N = V_{N} + W_{N}$ .
4:: Compute $Z_{r}, Y_{r + 1}$ from
$(V_{M} Y_{r} + V_{N} + R) Z_{r} = (R - W_{M} Y_{r} - W_{N}) Y_{r} - P$ ,
$(W_{M} Z_{r} + V_{N} + S) Y_{r + 1} = (S - V_{M} Z_{r} - W_{N}) Z_{r} - P$ .
5:: If $∥ M Y_{r}^{2} + N Y_{r} + P ∥ < t o l$ then stop. Ortherwise, $r = r + 1$ and go to Step 4.

We have performed experiments with Algorithms 4 and 5 for Example 7. The tol value is

t o l = 10^{- 14}

and 1000 runs for each value of s are played. The results can be found in Table 4.

Algorithm 6 Our modification of Algorithm 5

1:: Input $n \times n$ matrices M, N, P.
2:: We take $Y 0$ and $α > 0, β > 0, R = α * I_{n}, S = β * I_{n}$ .
3:: Compute $V_{M} = M + R$ , and $W_{M} = - R$ , and $V_{N} = β * N$ , $W_{N} = (1 - β) * N$ and $N N = V_{N} + R, N M = V_{N} + S$ .
Note that $M = V_{M} + W_{M}$ and $N = V_{N} + W_{N}$ .
4:: Compute $Z_{r}, Y_{r + 1}$ from matrix equations:
$(V_{M} Y_{r} + V_{N} + R) Z_{r} = (R - W_{M} Y_{r} - W_{N}) Y_{r} - P$ ,
$(W_{M} Z_{r} + V_{N} + S) Y_{r + 1} = (S - V_{M} Z_{r} - W_{N}) Z_{r} - P$ .
            4.1: Compute $i m = i n v (V_{M} * Y 0 + N N)$ . (Remark $Y 0 = Y_{r}$ .)
            4.2: Compute $t Q = (R + α * Y 0 - W_{N}) * Y 0 - P$ .
            4.3: Compute $Z 0 = i m * t Q;$ . (Remark $Z 0 = Z_{r}$ here).
            4.4: Compute $i m = i n v (N M - α * Z 0)$ .
            4.5: Compute $t Q = (S - V_{M} * X 0 - W_{N}) * Y 0 - P$ .
            4.6: Compute $Y 0 = i m * t Q;$ . (Remark $Y 0 = Y_{r + 1}$ here).
            4.7: If $n o r m ((M * Y 0 + N) * Y 0 + P) \leq t o l$ then stop. Ortherwise, $r = r + 1$ and go to Step 4.1.
5:: The computed solution is $Y 0$ .

Comparing Table 3 and Table 4, we conclude that Algorithm 5 is faster than Algorithm 3, and Algorithm 6 is faster than Algorithm 4. Algorithm 6 is faster than the remaining algorithms. The approach to divide the given iteration from Algorithm 3 in two parts, as it is shown in Algorithm 5, is more effective than the original one.

4. Conclusions

In this paper, we have studied numerical methods for three computational tasks: (a) to compute the minimal nonnegative solution of a set of Riccati equations, (b) to compute the maximal positive definite solution of the equation

X + A^{*} X^{- 1} A = Q

, and (c) to compute the minimal nonnegative solution to the quadratic matrix equation

M Y^{2} + N Y + P = 0

. We have considered the existing iterative methods and have proposed their improvements to accelerate the convergence process. We have performed several numerical experiments for each task where to show the effectiveness of the proposed modifications.

Moreover, as a weakness of the iterative methods for solving task (b), we note that the application of the inverse free approach, where the computation of an inverse matrix is avoided, will save the cost of computation. However, the use of this approach is limited. This fact is confirmed by experiments in Section 3.1. In recent years, this approach has been widely used in the analysis of iterative solutions of matrix equations. The effectiveness of this approach will be investigated in our future work more deeply.

Author Contributions

Conceptualization, I.G.I. and H.Y.; methodology, I.G.I. and H.Y.; software, I.G.I. and H.Y.; validation, I.G.I. and H.Y.; formal analysis, I.G.I. and H.Y.; investigation, I.G.I. and H.Y.; supervision, I.G.I. and H.Y.; writing—original draft and improved manuscript, I.G.I. and H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Acknowledgments

Thanks to the reviewers for their useful comments, remarks, and constructive recommendations, which have increased the value of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Erfanifar, R.; Sayevand, K.; Esmaeili, H. A novel iterative method for the solution of a nonlinear matrix equation. Appl. Numer. Math. 2020, 153, 503–518. [Google Scholar] [CrossRef]
Erfanifar, R.; Sayevand, K.; Hajarian, M. An efficient inversion-free method for solving the nonlinear matrix equation X^P + $\sum_{j = 1}^{m}$ Aj*X^−qjA_j = Q. J. Frankl. Inst. 2022, 359, 3071–3089. [Google Scholar]
Yang, Q.; Wang, X.; Cheng, X.; Du, B.; Zhao, Y. Positive periodic solution for neutral-type integral differential equation arising in epidemic model. Mathematics 2023, 11, 2701. [Google Scholar] [CrossRef]
Fan, L.; Zhu, Q.; Zheng, W.X. Stability analysis of switched stochastic nonlinear systems with state-dependent delay. IEEE Trans. Autom. Control 2023, 1–8. [Google Scholar] [CrossRef]
Erfanifar, R.; Sayevand, K.; Hajarian, M. Convergence analysis of Newton method without inversion for solving discrete algebraic Riccati equations. J. Frankl. Inst. 2022, 359, 7540–7561. [Google Scholar] [CrossRef]
Erfanifar, R.; Sayevand, K.; Hajarian, M. Solving system of nonlinear matrix equations over Hermitian positive definite matrices. Linear Multilinear Algebra 2023, 71, 597–630. [Google Scholar] [CrossRef]
Hasanov, V.I.; Ali, A.A. On convergence of three iterative methods for solving of the matrix equation X + A^∗X⁻¹A + B^∗X⁻¹B = Q. Comput. Appl. Math. 2017, 36, 79–87. [Google Scholar] [CrossRef]
El-Sayed, S.M.; Ivanov, I.G.; Petkov, M.G. A new modification of the Rojo method for solving symmetric circulant five-diagonal systems of linear equations. Comput. Math. Appl. 1998, 35, 35–44. [Google Scholar] [CrossRef]
Bai, Z.-Z.; Guo, X.-X.; Xu, S.-F. Alternately linearized implicit iteration methods for the minimal nonnegative solutions of the nonsymmetric algebraic Riccati equations. Numer. Linear Algebra Appl. 2006, 13, 655–674. [Google Scholar]
Ma, C.; Lu, H. Numerical Study on Nonsymmetric Algebraic Riccati Equations. Mediterr. J. Math. 2016, 13, 4961–4973. [Google Scholar] [CrossRef]
Guan, J.; Lu, L. New alternately linearized implicit iteration for M-matrix algebraic Riccati equations. J. Math. Study 2017, 50, 54–64. [Google Scholar]
Guan, J. Modified alternately linearized implicit iteration method for M-matrix algebraic Riccati equations. Appl. Math. Comput. 2019, 347, 442–448. [Google Scholar]
Ivanov, I.; Yang, H. An effective approach to solve a nonsymmetric algebraic Riccati equation. Innov. Model. Anal. J. Res. 2021, 6, 7–14. [Google Scholar]
Zhang, J.; Tan, F. Numerical methods for the minimal non-negative solution of the non-symmetric coupled algebraic Riccati equation. Asian J. Control 2021, 23, 374–386. [Google Scholar] [CrossRef]
Ivanov, I. Iterative computing the minimal solution of the coupled nonlinear matrix equations in terms of nonnegative matrices. Ann. Acad. Rom. Sci. Ser. Math. Appl. 2020, 12, 226–237. [Google Scholar] [CrossRef]
Liu, J.; Zhang, J.; Luo, F. Newton’s method for the positive solution of the coupled algebraic Riccati equation applied to automatic control. Comput. Appl. Math. 2020, 39, 113. [Google Scholar] [CrossRef]
Erfanifar, R.; Hajarian, M. Weight splitting iteration methods to solve quadratic nonlinear matrix equation MY² + NY + P = 0. J. Frankl. Inst. 2023, 360, 1904–1928. [Google Scholar]
Ivanov, I.G.; Hasanov, V.I.; Uhlig, F. Improved methods and starting values to solve the matrix equations X ± A ∗ X⁻¹A = I iteratively. Math. Comput. 2005, 74, 263–278. [Google Scholar]
Guo, C.-H.; Lancaster, P. Iterative Solution of Two Matrix Equations. Math. Comput. 1999, 68, 1589–1603. [Google Scholar] [CrossRef]
Zhan, X. Computing the extremal positive definite solutions of a matrix equation. SIAMJ Sci. Comput. 1996, 17, 1167–1174. [Google Scholar] [CrossRef]

Table 1. Example 1 with (5)–(8).

	(5)		(6)		(7), $ω = 1.2$		(8), $ω = 2.5$
n	$I t$	CPU	$I t$	CPU	$I t$	CPU	$I t$	CPU
12	34	4.3 s	19	2.57 s	18	2.44 s	25	0.10 s
24	38	128.7 s	21	81.9 s	19	70.7 s	28	0.33 s
	10 runs
48	22	323.0 s	22	281 s	20	220.8 s	33	0.23 s

Table 2. Example 2 for 10 runs with (5)–(8).

	(5)		(6)		(7), $ω = 1.2$		(8), $ω = 2.5$
n	$I t$	CPU	$I t$	CPU	$I t$	CPU	$I t$	CPU
12	31	0.63 s	17	0.32 s	14	0.3 s	17	0.03 s
24	32	10.7 s	16	4.62 s	16	4.6 s	19	0.03 s
48	28	406.2 s	25	386.6 s	19	291.9 s	29	0.24 s
96	slow convergence						35	0.98 s

Table 3. Example 7 for 1000 runs with Algorithms 3 and 4.

	Algorithm 3		Algorithm 4
$s (α)$	$I t$	CPU Time	$I t$	CPU Time
		Seconds		Seconds
10 (0.6)	14	0.20	13	0.19
20 (0.6)	14	0.48	13	0.42
30 (0.6)	14	0.80	13	0.74
40 (0.6)	14	1.45	13	1.41
50 (0.6)	14	4.35	13	3.87
60 (0.6)	14	5.56	13	5.25
70 (0.6)	15	8.60	14	8.22
80 (0.7)	no convergence		14	7.52
80 (0.9)	15	8.37	15	8.12
	$t o l$ = $10^{- 13}$
90 (0.6)	14	10.51	13	9.43
100 (0.6)	14	14.1	13	13.71

Table 4. Example 7 for 1000 runs with Algorithms 5 and 6.

	Algorithm 5		Algorithm 6		Algorithm 6
	$α = 0.8, β = 0.9$		$α = β = 0.94$		$α = 0.8; β = 0.95$
$n (α, β)$	$I t$	CPU Time	$I t$	CPU Time	$I t$	CPU Time
		Seconds		Seconds		Seconds
10	7	0.15	6	0.14	6	0.12
20	7	0.34	6	0.31	6	0.28
30	7	0.59	6	0.51	6	0.48
40	7	1.05	6	0.87	6	0.85
50	7	3.32	6	1.92	6	1.82
60	7	3.11	6	2.38	6	2.46
70	7	4.25	6	3.56	6	3.62
	$t o l$ = $10^{- 13}$
80	7	5.32	6	4.70	6	4.44
90	7	5.62	6	5.60	6	5.87
100	7	10.26	6	8.30	6	8.24

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ivanov, I.G.; Yang, H. On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations. Mathematics 2023, 11, 4436. https://doi.org/10.3390/math11214436

AMA Style

Ivanov IG, Yang H. On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations. Mathematics. 2023; 11(21):4436. https://doi.org/10.3390/math11214436

Chicago/Turabian Style

Ivanov, Ivan G., and Hongli Yang. 2023. "On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations" Mathematics 11, no. 21: 4436. https://doi.org/10.3390/math11214436

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations

Abstract

1. Introduction

2. Numerical Methods for the Solution of a Set of Riccati Equations

2.1. Newton Method and Its Modifications

2.2. Our New Iteration Scheme and Convergence Proof

2.3. Numerical Experiments

3. Numerical Method for the Maximal Solutions of Specifical Nonlinear Matrix Equations

3.1. Iterative Solution of $X + A^{*} X^{- 1} A = Q$

3.2. Numerical Method for the Solution of $M Y^{2} + N Y + P = 0$

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

On the Iterative Methods for the Solution of Three Types of Nonlinear Matrix Equations

Abstract

1. Introduction

2. Numerical Methods for the Solution of a Set of Riccati Equations

2.1. Newton Method and Its Modifications

2.2. Our New Iteration Scheme and Convergence Proof

2.3. Numerical Experiments

3. Numerical Method for the Maximal Solutions of Specifical Nonlinear Matrix Equations

3.1. Iterative Solution of X + A ∗ X − 1 A = Q

3.2. Numerical Method for the Solution of M Y 2 + N Y + P = 0

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. Iterative Solution of $X + A^{*} X^{- 1} A = Q$

3.2. Numerical Method for the Solution of $M Y^{2} + N Y + P = 0$