A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors

Chu, Wei; Zhao, Yao; Yuan, Hua

doi:10.3390/math10193636

Open AccessArticle

A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors

by

Wei Chu

¹

,

Yao Zhao

^1,2 and

Hua Yuan

^1,2,*

¹

School of Naval Architecture and Ocean Engineering, Huazhong University of Sciences and Technology, Wuhan 430074, China

²

Hubei Key Laboratory of Naval Architecture and Ocean Engineering Hydrodynamics (HUST), Wuhan 430074, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(19), 3636; https://doi.org/10.3390/math10193636

Submission received: 26 August 2022 / Revised: 18 September 2022 / Accepted: 29 September 2022 / Published: 5 October 2022

(This article belongs to the Special Issue Computational Methods and Applications for Numerical Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a novel method for computing the symmetric tridiagonal eigenvectors, which is the modification of the widely used Inverse Iteration method. We construct the corresponding algorithm by a new one-step iteration method, a new reorthogonalization method with the general Q iteration and a significant modification when calculating severely clustered eigenvectors. The numerical results show that this method is competitive with other existing methods, especially when computing part eigenvectors or severely clustered ones.

Keywords:

symmetric tridiagonal matrix; eigenvector solver; clustered eigenpairs; orthogonalization; general Q iteration

MSC:

65F15

1. Introduction

Computing the symmetric tridiagonal (ST) eigenvector is an important task in many research fields, such as the computational quantum physics [1], mathematics [2,3], dynamics [4], computational quantum chemistry [5], etc. The ST eigenvector problem also arises while solving any symmetric eigenproblem because it is a common practice to reduce the generalized symmetric eigenproblems to an ST one.

The Divide and Conquer (DC) algorithm [6] has a considerable advantage when calculating all the eigenpairs of an ST matrix. It is quite remarkable that the DC method, which is efficient for parallel computation, can also be faster than other implementations on a serial computer. However, this method does not support computing part eigenpairs or computing eigenvectors only. In practice, it is rare to compute the full eigenvectors of a large ST matrix. The famous QR method [7] has the same shortage while costing more time and is hard to be parallelized. This paper focuses on modifying the solution of computing part eigenvectors and gives a new method for eigenvectors of good accuracy and orthogonality.

Once an accurate eigenvalue approximation is known, the Inverse Iteration method [8] always computes an accurate eigenvector with an acceptable time cost. However, it does not guarantee the orthogonality when eigenvalues are close. A commonly used remedy is to reorthogonalize each approximate eigenvector, by the modified Gram–Schmidt method, against previously computed eigenvectors in the cluster. This remedy increases up to

2 n^{3}

operations if all the eigenvalues cluster, while the time cost for the eigenvectors themselves is only

O (n^{2})

.

Dhillion proposed the Multiple Relatively Robust Representations (MRRR) algorithm [9] to avoid reorthogonalization. This is an ambitious attempt as the MRRR algorithm computes all the accurate and numerically orthogonal eigenvectors with a time cost of

O (n^{2})

. Nevertheless, the MRRR algorithm can fail in calculating severely clustered eigenvalues of a large group, such as the glued Wilkinson matrices [10]. Dhillion fixed the problem and modified the MRRR method subtly and cleverly [11], without increasing its time complexity. However, this modified MRRR method, which applies the perturbation to the root representation of the ST matrix, costs even more time than the Inverse Iteration method with the modified Gram–Schmidt process. Even when computing random matrices, the MRRR algorithm has no advantage compared with the Inverse Iteration method. In addition, when computing part eigenvectors, the MRRR algorithm needs considerably accurate eigenvalues to guarantee natural orthogonality and thus calls the time-consuming Bisection method to obtain them. As a consequence, except for those cases with many eigenvalues clusters, the Inverse Iteration method is more efficient. More related details are presented in Section 6.

Mastronardi and Van Dooreen [12] proposed an ingenious method to determine the accurate eigenvector of a symmetric tridiagonal matrix once an approximation of the eigenvalue is known. In addition, they applied this method to calculate the weights of the Gaussian quadrature rules [3].

Our strategy is to improve the Inverse Iteration method with the three main modifications:

We replace the iteration process with a new one that only costs one step to guarantee convergence, similar to the MRRR method;
The envelope vector theory [13] is utilized to compute accurate and naturally orthogonal eigenvectors when the eigenvalues severely cluster. By combining the new iteration process, the time cost is even less than the cost of calculating isolated eigenvectors. In other words, the severely clustered eigenvalues accelerate the convergence;
We give a new orthogonalization method for the generally clustered groups of severely clustered eigenvalues. For k clustered eigenvalues in such a case, the new orthogonalization method decreases the time cost from $O (n k^{2})$ to $O (n k)$ .

The numerical results confirm our promise of accuracy and orthogonality. In addition, our new method supports computing part eigenvectors and embarrassingly parallelization, significantly improving the computational efficiency.

This paper focuses on the symmetric tridiagonal eigenvector problem. According to Weyl’s theorem, the real symmetric eigenvalue problem

A x = x λ

is well posed, in an absolute sense because an eigenvalue can change by no more than the spectral norm of the change in the matrix A [14]. However, for an unsymmetric matrix

\hat{A}

, some of its eigenvalues may be extremely sensitive to uncertainty in the matrix entries. Consequently, the assessment of error becomes a major concern. Some specific conclusions were introduced in [14]. Readers can also see more unsymmetric examples in [15,16].

The organization of the rest of this paper is as follows: Section 2 gives the modified iteration of the new method and an algorithm to compute an isolated eigenvector. Section 3 studies the computation of clustered eigenvectors. Section 4 introduces the general Q iteration and the new orthogonalization method. Section 5 concerns the overflow and underflow. Several corresponding pseudocodes are provided in the above sections. Section 6 shows some examples and numerical results. Finally, we discuss and assess the Modified Inverse Iteration method in Section 7.

2. Compute Isolated Eigenvectors

2.1. Theoretical Background

Consider a

n \times n

real unreduced ST matrix A (all the ST matrices discussed in this paper are real and unreduced), which has eigenvalues

λ_{1} \sim λ_{n}

in the increasing order and the corresponding eigenvectors

v_{1} \sim v_{n}

. Once an accurate eigenvalue approximation

u \to λ_{j}

is known, we have

(A - u I_{n \times n}) \tilde{v_{j}} = T \tilde{v_{j}} = 0,

(1)

where

\tilde{v_{j}}

is the eigenvector approximation and

I_{n \times n}

denotes the

n \times n

identity matrix.

When u is the exact eigenvalue, T has a rank of

n - 1

and (1) can be solved by ignoring any one of its n rows. However, since

u \neq λ_{j}

, T is not singular and thus (1) has no nonzero solution. If one still solves (1) by ignoring one of its n rows, say, the kth row, the actually solved equation is

T z^{k} = e_{k},

(2)

where

e_{k}

is the kth column of

I_{n \times n}

, and

z^{k}

denotes the solution when ignoring the k row. It is obvious that

z^{k}

is the kth column of

T^{- 1}

. From [10], we have

z^{k} = r_{j} v_{j} / (λ_{j} - u) + \sum_{i \neq j} r_{i} v_{i} / (λ_{i} - u),

(3)

where

r_{i} (i \in [1, n])

is the kth component of

v_{i}

, which can also be denoted by

v_{i} (k)

.

The main idea of the Inverse Iteration is to solve (2), substitute the result into the right side, and go on. As

u \to λ_{j}

,

z^{k}

will finally approach

v_{j}

. If

λ_{j}

is an isolated eigenvalue, (3) shows that the degree of approximation of

z^{k}

and

v_{j}

depends on the absolute value of

v_{j} (k)

. For example, if

| v_{j} (k) |

approximates to zero,

z^{k}

has nearly no ingredient of

v_{j}

. As a consequence, the iterations hardly converge. Therefore, the traditional Inverse Iteration method uses a vector with all components equal to 1 to be the original right side of (1). Within about two or three steps, the traditional Inverse Iteration method calculates an accurate eigenvector approximation

\tilde{v_{j}}

.

2.2. One-Step Iteration

To accelerate the iteration process, our task is to find the biggest

| v_{j} (k) | (k \in [1, n])

and to guarantee convergence in one step. From [9], we have

\frac{1}{γ_{k}} = e_{k}^{T} {(A - u I)}^{- 1} e_{k} = \frac{{|v_{j} (k)|}^{2}}{λ_{j} - u} + \sum_{i \neq j} \frac{{|v_{i} (k)|}^{2}}{λ_{i} - u}

(4)

where

1 / γ_{k}

is the kth component on the diagonal of

{(A - u I)}^{- 1}

, i.e., the kth component of

z^{k}

, and its absolute value reflects

| v_{j} (k) |

(recall

u \to λ_{j}

). The MRRR method finds the smallest

| γ_{k} |

by the twisted triangular factorization, while we give a new method in this section.

We denote the ith sequential principal minor of a ST matrix A by

A_{1 : i}

. The submatrix of A in rows i through j is denoted by

A_{i : j}

and its determinant by

d e t (A)

. We denote the characteristic polynomial

d e t (A - u I)

by

C_{1 : n}

,

C_{1 : n} (u)

, or

C_{1 : n}^{A} (u)

if necessary.

a_{i}

and

b_{i}

denote the ith component on the diagonal and sub-diagonal of A, respectively. According to [17], we have

z^{k} = [\begin{matrix} C_{k + 1 : n} (\prod_{t = 1}^{k - 1} - b_{t}) \\ C_{1} C_{k + 1 : n} (\prod_{t = 2}^{k - 1} - b_{t}) \\ \dots \\ C_{1 : k - 2} C_{k + 1 : n} (- b_{k - 1}) \\ C_{1 : k - 1} C_{k + 1 : n} \\ C_{1 : k - 1} C_{k + 2 : n} (- b_{k}) \\ \dots \\ C_{1 : k - 1} C_{n} (\prod_{t = k}^{n - 2} - b_{t}) \\ C_{1 : k - 1} (\prod_{t = k}^{n - 1} - b_{t}) \end{matrix}] / C_{1 : n}

(5)

and

\begin{matrix} C_{1 : n} & = d e t (A - u I) \\ = - b_{k - 1}^{2} C_{1 : k - 2} C_{k + 1 : n} + (a_{k} - u) C_{1 : k - 1} C_{k + 1 : n} - b_{k}^{2} C_{1 : k - 1} C_{k + 2 : n} \\ = C_{1 : k - 1} C_{k + 1 : n} (C_{1 : k} / C_{1 : k - 1} - b_{k}^{2} C_{k + 2 : n} / C_{k + 1 : n}) . \end{matrix}

(6)

Remark 1.

(5) is also introduced in [9], but in an incorrect form as missing the negative sign before each

b_{i}

. Dhillion worried about the overflow and underflow issues when calculating

z^{k}

by (5) and thus did not discuss it further. This paper will give a more practical form of (5), reduce its computational cost and solve the overflow or underflow problem (in Section 5).

By (5) and (6), we have

γ_{k} = q_{k} - b_{k}^{2} / p_{n - k}

(7)

where

q_{i} = C_{i} / C_{i - 1}

and

p_{i} = C_{n - i + 1} / C_{n - i + 2}

. As the sequential principal minors of an ST matrix form a Sturm sequence, we have [18]

\begin{matrix} q_{0} = 1, q_{1} = a_{1} - u, q_{i} = a_{i} - u - b_{i - 1}^{2} / q_{i - 1}; \\ p_{0} = 1, p_{1} = a_{n} - u, p_{i} = a_{n + 1 - i} - u - b_{n + 1 - i}^{2} / p_{i - 1} . \end{matrix}

(8)

(5) and (8) can be expressed as

z^{k} = x_{1} α + x_{2} β = [\begin{matrix} 1 \\ q_{1} / (- b_{1}) \\ q_{1} q_{2} / ((- b_{1}) (- b_{2})) \\ \dots \\ \prod_{i = 1}^{k - 1} q_{i} / (- b_{i}) \\ 0 \\ \dots \\ 0 \\ 0 \end{matrix}] α + [\begin{matrix} 0 \\ 0 \\ \dots \\ 0 \\ \prod_{i = 1}^{n - k - 1} p_{i} / (- b_{n - i}) \\ \dots \\ p_{1} p_{2} / ((- b_{n - 1}) (- b_{n - 2})) \\ p_{1} / (- b_{n - 1}) \\ 1 \end{matrix}] β,

(9)

where

x_{1}

and

x_{2}

are both

n \times 1

vectors, the

(k + 1) \sim n

th components of

x_{1}

are zeros while the

1 \sim k

th components of

x_{2}

are zeros.

α

and

β

are two coefficients to be determined.

It can be seen that (9) satisfies (2), except for the kth and

(k + 1)

th rows. As we only care about the direction of

z^{k}

, only the

(k + 1)

th row needs to be considered when determining

α

and

β

. Then, we have

α b_{k} \prod_{i = 1}^{k - 1} q_{i} / (- b_{i}) + β ((a_{k + 1} - u) (\prod_{i = 1}^{n - k - 1} p_{i} / (- b_{n - i})) + b_{k + 1} (\prod_{i = 1}^{n - k - 2} p_{i} / (- b_{n - i}))) = 0 .

Therefore, our scheme is to calculate

q_{i}

’s and

p_{i}

’s by (8) first, then find the smallest

| γ_{k} |

by (7). Note that (7) would not cost extra division operations if we save the

b_{i}^{2} / p_{n - i}

’s when calculating

p_{i}

’s by (8). Finally, we choose the corresponding k of the smallest

| γ_{k} |

and obtain

z^{k}

by (9). Our modified iteration method to calculate one isolated eigenvector is shown by Algorithm 1.

If

b_{i}^{2}

and

1 / b_{i}

are calculated and stored in advance, Algorithm 1 costs

8 n \sim 8.5 n

operations (note the cost of calculating

a - u

is shared in step 3 of Algorithm 1) per eigenvector while the version in [9] costs

11 n

.

Note that (8) computes p and q with no time cost savings per se. The two main contributors are: first, (7) reduces the cost of searching

min | γ_{k} |

; second, (9) divides the eigenvector computation into two parts, and even under the most adverse condition of

k = n / 2

, (9) can still reduce the multiplication operations by half compared to (5).

Algorithm 1: Compute one isolated eigenvector.

2.3. Accuracy Analysis of Algorithm 1

Let R denote the residual norm, i.e.,

R_{k} = ∥T z^{k}∥ / ∥z^{k}∥

, then we have

\begin{matrix} R_{k} & = \frac{∥T z^{k}∥}{∥z^{k}∥} = \frac{|γ_{k}|}{∥z^{k}∥} \\ = \sqrt{\frac{γ_{k}^{2}}{γ_{k}^{2} e_{k}^{T} {(A - u I)}^{- 1} {(A - u I)}^{- 1} e_{k}}} \\ = {(\sum_{} \frac{v_{i}^{2} (k)}{{(λ_{i} - u)}^{2}})}^{- 1 / 2} = \frac{|λ_{j} - u|}{|v_{j} (k)|} {(1 + \sum_{} \frac{{(λ_{j} - u)}^{2}}{{(λ_{i} - u)}^{2}} \frac{v_{i}^{2} (k)}{v_{j}^{2} (k)})}^{- 1 / 2} \\ \leq \frac{|λ_{j} - u|}{|v_{j} (k)|} . \end{matrix}

(10)

As Algorithm 1 ensures that

| v_{j} (k) |

is the biggest one among all the

| v_{j} (i) | (i \in [1, n])

, it is guaranteed that

| v_{j} (k) | \geq \sqrt{1 / n}

. Then, according to (10), we have

R_{k} \leq \sqrt{n} ϵ

where

ϵ

is the machine precision.

3. Computing Severely Clustered Eigenvectors

Now consider the case when eigenvalues clusters severely, for example, p eigenvalues that are equal in finite precision arithmetic. We will define “severely clustering” later in this section.

First, we introduce the two following lemmas from [13] to state our theorems.

Lemma 1

(The Envelope Vector). Define

S = s p a n {v_{1}, v_{2}, \dots, v_{p}}

, and the envelope vector of

S

is

E

given by

E_{i} = max {V_{i} : V \in S, ∥V∥ = 1} .

For p clustered eigenvalues, the envelope vector will undulate with p high hills separated by

p - 1

low valleys.

Lemma 2.

For an ST matrix A that has p clustered eigenvalues

λ_{1} \sim λ_{p}

, divide A into p submatrices:

A_{1 : η_{1}}

,

A_{η_{2}^{l} : η_{2}^{r}}

,…,

A_{η_{p - 1}^{l} : η_{p - 1}^{r}}

and

A_{η_{p} : n}

. Note that these submatrices can have overlaps. Then, for each submatrix, there exists at least one

A_{s u b}

, among all the possibilities of divisions that satisfies:

1: $A_{s u b}$ has an isolated sub-eigenvalue $κ \in [λ_{1}, λ_{p}]$ ;
2: For the 2nd to $(p - 1)$ th submatrices, the corresponding sub-eigenvector $s_{i} (i \in [2, p - 1])$ (with respect to κ) has small components at both its ends. For $A_{1 : η_{1}}$ , $s_{1} (η_{1}) \to 0$ and for $A_{η_{p} : n}$ , $s_{p} (1) \to 0$ .

Supplement zero components to obtain

{\tilde{v}}_{s} = [s; 0]

,

[0; s; 0]

, or

[0; s]

, which has the size of

n \times 1

. Then, the p

{\tilde{v}}_{s}

’s are approximations to

v_{t} (t \in [1, p])

. These eigenvector approximations are numerical orthogonal and satisfy

∥T v_{s}∥ < \sqrt{n / p} (λ_{p} - λ_{1}) / p

.

See the proofs and more details in [13].

Let us take a typical example of clustered eigenvalues to illustrate. Let

α_{0}

be a

200 \times 1

vector and

α_{0} (i) = i (i \in [1, 200])

and then construct

α \leftarrow [f l i p (α_{0}); 0; α_{0}]

. Then, repeat

α \leftarrow [α; α_{0}]

by eight times totally. Finally, we obtain a

2001 \times 1

vector

α

. Consider an ST matrix

Φ

, which has the diagonal equal to

α

and all the components on its sub-diagonal equal to 1.

Φ

is similar to the glued Wilkinson matrices in [11] and its biggest eight eigenvalues (

λ_{1} \sim λ_{8}

) cluster severely. Let

u_{1} \sim u_{8}

denote the approximations of the biggest eight eigenvalues of

Φ

; it shows

u_{8} - u_{1} = 0

in Matlab, i.e.,

λ_{1} \sim λ_{8}

severely clusters.

Let

u = u_{1}

and calculate

| γ_{k} | (k \in [1, 2001])

of

Φ

. The results are shown in Figure 1. According to Lemma 1, the low valley entries of the envelope vector correspond to small components of

v_{i} (i \in [1, p])

. Note that this means all the p eigenvectors have small components at this entry, thus the corresponding

| γ_{k} |

must be a big value according to (4). The case of high hills is similar. In other words, the

| γ_{k} |

curve undulates with p low valleys separated by

p - 1

high hills. Note these extreme points may not be exactly the same as the envelope vector. We show

| γ_{k} | (k \in [1, 2001])

of

Φ

in Figure 1. A logarithmic scale on the y-axis has been used to emphasize the small entries. The results confirm our point.

We give a method to find the applicable submatrices of Lemma 2 by Theorem 1.

Theorem 1.

If a submatrix satisfies Lemma 2, then the corresponding entries contain and only contain one low valley of the

| γ_{k} |

curve.

Proof.

Take the first submatrix

A_{1 : η_{1}}

(which is assumed to satisfy Lemma 2) as an example because the proofs of the others are similar.

Let X denote the eigenvector approximation from Lemma 2, and we have

X = \sum_{t = 1}^{p} x_{t} v_{t} = [s; 0]

. Thus, the corresponding entries of

A_{1 : η_{1}}

must contain at least one low valley, if not all the

x_{t}

’s will be small values and violate the equation

\sum_{t = 1}^{p} x_{t}^{2} = 1

.

If the corresponding entries of

A_{1 : η_{1}}

contain more than one low valley, say, two, it will also contain one high hill of the

| γ_{k} |

curve. This means X has a small component at the corresponding entry of the hill. In addition, X contains at least two major ingredients of

v_{i}

that has big components at the two valleys, respectively, or X contains one major ingredient of

v_{i}

that has big components at both entries. According to [10], if an eigenvector has one part that has both small ends, the corresponding eigenvalue must have a close neighbor. Therefore, if the corresponding entries of

A_{1 : η_{1}}

contain more than one low valley,

A_{1 : η_{1}}

has clustered sub-eigenvalues that

\in [λ_{1}, λ_{p}]

.

With the above conclusions, the proof is completed. □

To illustrate Theorem 1 more intuitively, and as a complementary argument to the above proof, we performed the following numerical test. We calculated the distances between

λ_{2001}

of

Φ

and the last two sub-eigenvalues of

Φ_{1 : η} (η \in [2, 2000])

. Because by the Interlacing Property from [17], the close sub-eigenvalues to

λ_{2001}

must be the last ones. The result is shown in Figure 2. A logarithmic scale on the y-axis has been used to emphasize the small entries. In Figure 2,

Φ_{1 : η}

starts to have one close eigenvalue when

η > 400

, which is the first low valley of the

| γ_{k} |

curve, and two close eigenvalues when

η > 600

, which is the second valley. We also present the results of the last eight sub-eigenvalues of

Φ_{1 : η} (η \in [8, 2000])

in Figure 3. It can be seen that, whenever

Φ_{1 : η}

“crosses” a low valley of

| γ_{k} |

, the clustered sub-eigenvalues are one more. Figure 2 and Figure 3 confirm Theorem 1 well. See more and detailed numerical examples and results for accuracy in Section 6.

According to [13], we have that (recall

E

is the envelope vector from Lemma 1)

b_{j} | s_{1} (j) | E (j + 1) \approx V,

where V is independent of j. This means that a big

E (j + 1)

corresponds to a small

| s_{1} (j) |

.

Therefore, our computation strategy of clustered eigenvalues is shown as follows:

1.: Every submatrix has one low valley of the $| γ_{k} |$ curve.
2.: The ends of the submatrix are the closest entries to the adjacent valleys.
3.: According to Lemma 2, $(λ_{p} - λ_{1}) < p \sqrt{p} ∥A∥ ϵ$ ensures $∥T {\tilde{v}}_{s}∥ < \sqrt{n} ϵ$ , thus it can be used as the “clustering” threshold.

We show the method for computing severely clustered eigenvalues by the following pseudocode Algorithm 2.

Algorithm 2: Compute severely clustered eigenvalues.

Assume that the p valleys are arranged uniformly. The cost calculation of p severely clustered

λ

’s by Algorithm 2 is twice as large as the cost of one isolated

λ

by Algorithm 1, while the Inverse Iteration method needs p times cost and a reorthogonalization. This means that Algorithm 2 saves time compared to the Inverse Iteration method even when disregarding its expensive orthogonal cost.

For the matrix

Φ

, we calculated R’s (recall

R = ∥T z∥ / ∥z∥

, the residual norm) and the dot products of its last eight eigenvector approximations obtained by Algorithm 2. We show the mean and maximal results in Table 1 and compare them to the results of the Inverse Iteration method and the MRRR method. The results were collected on an Intel Core i5-4590 3.3-GHz CPU and a 16-GB RAM machine. All codes were written in Matlab2017a and executed in IEEE double precision. The machine precision is

ϵ \approx 2.2 \times 10^{- 16}

. It can be seen that all the eight eigenvector approximations are accurate and numerically orthogonal. See more examples and numerical results in Section 6.

4. Reorthogonalization

4.1. General Q Iteration

For severely clustered eigenvalues, Algorithm 2 saves considerable time and avoids reorthogonalization. However, if the group of clustering p eigenvalues has a close eigenvalue neighbor or another group of clustering eigenvalues with the distance

\in (p \sqrt{p} ϵ, 10^{- 3}) ∥A∥

(note

p \sqrt{p} ∥A∥ ϵ

is the threshold of severely clustering), Algorithm 2 can not ensure the orthogonality between them. Therefore, a reorthogonalization is needed. This is quite frustrating, not only because of the high cost of orthogonalization but also because using the modified Gram–Schmidt method for orthogonalization destroys the orthogonality of the eigenvectors obtained by Algorithm 2. In other words, the method we proposed in the previous section is meaningless. For example, two groups of severely eigenvalues have approximations

u_{1}

and

u_{2}

, respectively, while

u_{1} - u_{2} < 10^{- 3} ∥A∥

. Each group’s eigenvectors are orthogonal, but Algorithm 2 can not ensure the orthogonality of two from different groups. If one uses the modified Gram–Schmidt method to reorthogonalize them, it makes no difference whether the original vectors are orthogonal in groups. Therefore, we give a new reorthogonalization method in this section.

In [9], Dhillon introduced the twisted Q factorization. For an

n \times n

ST matrix

T = A - λ_{1}

(

λ_{1}

is one eigenvalue of A) and a certain number

k (k \in [1, n])

, implement the Givens rotation to its columns to eliminated

1 \sim (k - 1)

th components on its super-diagonal and

k \sim (n - 1)

th components on the sub-diagonal. Finally, a singleton in the kth column is left. The process is shown in Figure 4 (from [9]), where

n = 5

and

k = 3

.

Let W denote the final form of the twisted Q factorization, and we have

\begin{matrix} T Q & = W; \\ T & = W Q^{T}; \\ Q & = G_{1} G_{2} \dots G_{n - 1}, \end{matrix}

where

G_{1} \sim G_{n - 1}

are Givens rotation matrices. Obviously,

W_{k, k} = R_{k}

. Therefore, at least one k satisfies

ζ = W_{k, k} \leq \sqrt{n} ϵ

according to Section 2.3.

Now, we introduce our so-called general Q iteration. For such a k that satisfies

ζ \leq \sqrt{n} ϵ

, we implement the corresponding Givens rotations to the rows of W. Using the example from Figure 4, the process is shown by

\begin{matrix} G_{1}^{T} T Q = [\begin{matrix} \times & \times \\ \times & \times \\ \times & \times & ζ & \times & \times \\ \times & \times & \times \\ \times \end{matrix}] \Rightarrow G_{2}^{T} \dots T Q = [\begin{matrix} \times & \times \\ \times & \times & s_{2} ζ & \times & \times \\ \times & c_{2} ζ & \times & \times \\ \times & \times & \times \\ \times \end{matrix}] \\ \Rightarrow G_{3}^{T} \dots T Q = [\begin{matrix} \times & \times \\ \times & \times & s_{2} ζ & \times & \times \\ s_{2} ζ & c_{3} c_{2} ζ & \times & \times \\ \times & - s_{3} c_{2} ζ & \times & \times \\ \times & \times \end{matrix}] \\ \Rightarrow G_{4}^{T} \dots T Q = [\begin{matrix} \times & \times \\ \times & \times & s_{2} ζ & \times & \times \\ s_{2} ζ & c_{3} c_{2} ζ & - c_{4} s_{3} c_{2} ζ & s_{4} s_{3} c_{2} ζ \\ \times & - c_{4} s_{3} c_{2} ζ & \times & \times \\ \times & s_{4} s_{3} c_{2} ζ & \times & \times \end{matrix}] = Q^{T} T Q \end{matrix}

(11)

where

c_{i}

and

s_{i}

constitutes

G_{i}

, i.e.,

G_{i} : = [\begin{matrix} c_{i} & - s_{i} \\ s_{i} & c_{i} \end{matrix}] .

Note that, in the last rotation of (11), the components on

(3, 4)

and

(3, 5)

have not changed. We obtain their values according to symmetry.

Finally, we have

A_{1} = Q^{T} T Q + λ_{1}

and complete one step of the general Q iteration. Obviously,

A_{1}

has the same eigenpairs to A. As all

| c_{i} |

’s and

| c_{i} |

’s are less than 1, all the rest components of the kth rows and columns of

A_{1}

are less than

ζ

. Therefore, deflation can arise as

A_{1} = [\begin{matrix} \times & \times \\ \times & \times & \times & \times \\ λ_{1} \\ \times & \times & \times \\ \times & \times & \times \end{matrix}] \Rightarrow [\begin{matrix} \times & \times \\ \times & \times & \times & \times \\ \times & \times & \times \\ \times & \times & \times \end{matrix}] = B .

Thus, B has the numerical equal eigenvalues to

λ_{2} \sim λ_{5}

and the corresponding eigenvectors can be calculated similarly to the QR method. For example, if

s_{2} = {[x_{1}, x_{2}, x_{3}, x_{4}]}^{T}

is the eigenvector of B with respect to

λ_{2}

, then

v_{2} = Q s_{2}

. These

v_{i}

’s are certainly orthogonal. Note that B can be transferred to an ST matrix by chasing and eliminating its bulge (for example, the

(2, 4)

and (4, 2) components of B) with Givens rotations. Therefore, it costs at most 1.5 times operations compared to the QR (or QL) iteration, which is the exceptional case when

k = n

or 1.

Therefore, the general Q iteration is to fulfill a deflation of a certain

λ

by QR-like transformation. For a normal ST matrix and one accurate approximation to

λ

,

k = 1

or n is enough. Thus, the cost of chasing the bulge can be saved. However, in some special cases,

| γ_{1} |

or

| γ_{n} |

can both be small, which means it costs numerous QR-like iterations to converge. This is similar to the solution of (2) by inverse iterations, considering the strong relationship between the Inverse Iteration method and the QR (or QL) method [7]. Recall that we give the one-step inverse iteration in Section 2, and the general Q iteration can be regarded as a one-step QR-like iteration. In our numerical experience, the case that several QR iterations (which use an accurate eigenvalue approximation as the shift) can not obtain convergence is not rare. For example, for a random

2000 \times 2000

ST matrix, its most

λ_{i}

’s can ensure one-step converges by QR iteration, but some

λ_{i}

’s may cost more than 50 steps. In addition, this case almost arises in every random matrix.

Mastronardi and Van Dooreen discovered this instability when obtaining an ST eigenvector and solved the problem by a modified implicit QR decomposition method [12]. Their method can ensure an accurate calculation. However, this paper uses a modified inverse iteration method to calculate the eigenvector. The implicit QR decomposition in our paper is used for deflation and guarantee of orthogonality in the case that the eigenvalues cluster generally.

The corresponding pseudocode for computing generally clustered eigenvectors is given in Algorithm 3. The generally clustering denotes that the span of the p clustered eigenvalues is not big enough to guarantee orthogonality of its corresponding eigenvectors (calculated by the Inverse Iteration method or Algorithms 1 and 2), i.e.,

λ_{p} - λ_{1} \leq 10^{- 3} ∥A∥

.

Algorithm 3: Computing generally clustered eigenvectors.

4.2. Cost of Reorthogonalization

This subsection concerns the cost of reorthogonalization in Algorithm 3. For k clustered eigenvalues, the last obtained v (line 7 in Algorithm 3) is a

(n + 1 - k) \times 1

vector. v has to be premultiplied

n - k

Givens rotation matrices to transfer to

(n + 2 - k) \times 1

. Repeat this process until the length reaches n. For every Givens rotation, the cost is six operations. Therefore, the total cost is

\begin{matrix} 6 \times ((n - k) \times 1 + (n + 1 - k) \times 2 + (n + 2 - k) \times 3 + \dots + (n - 2) \times (k - 1)) \\ = 6 \times n \times (1 + 2 + \dots + k - 1) - 6 \times (k \times 1 + (k - 1) \times 2 + \dots + 2 \times (k - 1)) \\ = 3 n k^{2} - (k^{3} + 3 k^{2} - 4 k + 3 n) . \end{matrix}

(12)

At first sight, (12) is hardly satisfactory, as the modified Gram–Schmidt method costs only

4 n \times (1 + 2 + 3 + \dots + k) = 2 n k^{2}

operations. Only when k is close to n, our method matches the efficiency of the modified Gram–Schmidt method. Moreover, those cases where we need to use the general Q iteration (the QR-like iterations cannot converge at one step) have not been considered. However, the cost will slump for cases with many severely clustered eigenvalues within groups.

For example, if m eigenvalues are severely clustered among the k eigenvalues, the cost is

3 n {(k - m)}^{2} - ({(k - m)}^{3} + 3 {(k - m)}^{2} - 4 (k - m) + 3 n) + 6 (n - m) m,

(13)

which decreases from

O (n k^{2})

to

O (n k)

if m is close to k. In addition, the cost for the modified Gram–Schmidt method, in this case, is

4 n (m + m + 1 + \dots + k) = 2 n (m + k) (k - m)

.

If the k eigenvalues can be divided into two severely clustering groups, the cost is

6 (n - m) m,

(14)

which decreases from

O (n k^{2})

to

O (n m)

. In addition, the cost for the modified Gram–Schmidt method, in this case, is

2 n (m + k) (k - m)

.

Therefore, Algorithm 3 calls the deflation method with the general Q iteration or the modified Gram–Schmidt method according to an advanced prediction by (12)–(14). However, both methods are time-consuming in cases where k is very close to n, and the eigenvalues have few severely clustering groups. In this case, the best method is the MRRR method. See more examples and numerical details in Section 6.

4.3. Modification of QR-Like Iteration

The general Q iteration can be seen as starting a QL iteration from the left of the matrix, stopping it at column k, and then doing a QR iteration from the right of the matrix till there is a singleton in the kth column. We give a subtle modification to the QR or QL iteration with the implicit shift to save some operations. Take the QR iteration as an example, and the traditional process is shown in Algorithm 4.

One step of QR iteration implemented into a

4 \times 4

ST matrix is shown as follows:

\begin{matrix} \begin{matrix} [\begin{matrix} c_{1} & s_{1} \\ - s_{1} & c_{1} \end{matrix}] \\ \begin{matrix}  \end{matrix} \end{matrix} [\begin{matrix} a_{1} & b_{1} \\ b_{1} & a_{2} & b_{2} \\ b_{2} & a_{3} & b_{3} \\ b_{3} & a_{4} \end{matrix}] \begin{matrix} [\begin{matrix} c_{1} & - s_{1} \\ s_{1} & c_{1} \end{matrix}] \\ \begin{matrix}  \end{matrix} \end{matrix} \\ \Rightarrow [\begin{matrix} \times & \times & s_{1} b_{2} \\ - s_{1} δ & π_{2} + c_{1} δ & c_{1} b_{2} \\ b_{2} & a_{3} & b_{3} \\ b_{3} & a_{4} \end{matrix}] \begin{matrix} [\begin{matrix} c_{1} & - s_{1} \\ s_{1} & c_{1} \end{matrix}] \\ \begin{matrix}  \end{matrix} \end{matrix} \\ \Rightarrow [\begin{matrix} {\bar{a}}_{1} & s_{1} π_{2} & s_{1} b_{2} \\ s_{1} π_{2} & c_{1} π_{2} + δ & c_{1} b_{2} \\ s_{1} b_{2} & c_{1} b_{2} & a_{3} & b_{3} \\ b_{3} & a_{4} \end{matrix}] \end{matrix}

(15)

Algorithm 4: QR iteration with the implicit shift.

In (15),

π_{i + 1}

is updated by

π_{i + 1} = c_{i} (a_{i + 1} - δ) - s_{i} b_{i}

, which corresponds to line 10 in Algorithm 4. This equation can be rewritten as

\begin{matrix} π_{i + 1} / c_{i} & = (a_{i + 1} - δ) - s_{i} b_{i} / c_{i} \\ = (a_{i + 1} - δ) - b_{i} c_{i - 1} b_{i} / π_{i} \\ = (a_{i + 1} - δ) - b_{i}^{2} / (π_{i} / c_{i - 1}) \end{matrix}

Without loss of generality, assume that

c_{0} = 1

, then

π_{1} / c_{0} = a_{1} - δ = q_{1}

(recall

q_{i}

is the Sturm sequence from (8)). Finally, we have

π_{i + 1} / c_{i} = q_{i + 1} (i \in [0, n - 1]) .

(16)

Note all the

q_{i}

’s have been calculated in advance when searching the smallest

| γ_{k} |

in our methods; thus, we can use (16) to update

π

’s instead. We show the modified QR iteration algorithm in Algorithm 5.

Algorithm 5 costs

6 n

multiplications,

2 n

divisions, and

(n - 1)

square roots while Algorithm 4 costs

9 n

multiplications,

2 n

divisions, and

(n - 1)

square roots. Thus, our modification saves

3 n

multiplications.

Algorithm 5: Modified QR iteration with the implicit shift.

5. Avoiding Overflow and Underflow

Our new method obtains an eigenvector essentially by the cumulative products of q’s, as shown in lines 9 and 12 of Algorithm 1. As is well known, the products can grow or decay rapidly; hence, the recurrences to compute them are susceptible to severe overflow and underflow problems. This section gives a relatively cheap algorithm to avoid overflow and underflow.

Let f denotes the overflow threshold, for example,

f = 2^{1023}

in IEEE double precision arithmetic. Whenever one intermediate product during the recurrences exceeds f, multiply it by

f^{- 1}

to normalize and continue the iteration. Similarly, whenever one

\leq f^{- 1}

, multiply it by f. At the same time, we save the corresponding entry and mark 1 for overflow and

- 1

for underflow.

Assume y positions, which divide the eigenvector approximation

\tilde{v}

into

y + 1

parts, are marked when the iteration is completed. Then, we have a

y \times 1

vector Y, with components of 1’s and

- 1

’s. For any certain position, the mark 1 means the components of

\tilde{v}

from it to the end are shrunk by a factor of f compared to v. In addition, the mark

- 1

means amplification by f. The mark before the first component of

\tilde{v}

is zero. Thus, we have

Y \leftarrow [0; Y]

.

Calculate the cumulative sums of Y from the first component to everyone and save the results at each entry. In this way, each component of Y corresponds to each part of

\tilde{v}

, and its value represents the specific degree to which the corresponding part has been enlarged or reduced. A positive value of m means that this part has been reduced by

f^{m}

times, while a negative value means enlarged. The corresponding part is not enlarged or reduced when the value is zero.

Revisiting

\tilde{v}

, all the components have not overflowed but are just to be restored to their true values. In addition, the biggest part after restoration corresponds to the biggest component of Y (recall each component of Y corresponds to each part of

\tilde{v}

) because it is reduced by the most significant times. Since

\tilde{v}

is ultimately normalized, we take the biggest part as the benchmark. Thus, the second biggest component of Y corresponds to the second biggest part of

\tilde{v}

after restoration, which should be divided by f. The rest parts, if they exist, need to be divided by

f^{2}

or more, thus directly taking zeros as its components.

We give the corresponding pseudocode in Algorithm 6, which corresponds to the details of lines 9 and 12 of Algorithm 1.

Algorithm 6: Compute

\prod q

without overflow and underflow.

Finally, we give the complete modified Inverse Iteration method by Algorithm 7.

Algorithm 7: Modified Inverse Iteration method.

6. Numerical Results

In this section, we present a numerical comparison among the modified Inverse Iteration method and four other widely used algorithms for computing eigenvectors:

1.: the Inverse Iteration method, by calling subroutine “dstein” from LAPACK in Matlab;
2.: the MRRR method, by calling subroutine “dstegr” from LAPACK in Matlab;
3.: the QR method, by calling subroutine “dsteqr” from LAPACK in Matlab;
4.: the DC method, by calling subroutine “dstedc” from LAPACK in Matlab.

Since the MRRR, QR, and DC methods compute the eigenpairs instead of only eigenvectors, we compared the total cost for eigenpairs in this section. To obtain eigenvalues for Algorithm 7 and the Inverse Iteration method, we use the PWK version of the QR method (by calling subroutine ‘dsterf’ from LAPACK in Matlab) when calculating more than

5 %

eigenpairs, otherwise use the Bisection method (by calling subroutine ‘dstebz’ from LAPACK in Matlab). Note the QR and DC methods are only available when computing all the eigenpairs and thus will not be compared in the cases when computing parts of the eigenpairs.

We use the following five types of

n \times n

matrices for tests:

1.: Matrix $Φ_{1}$ , which is constructed similarly to $Φ$ in Section 3 with $α_{0} =$ (1:200). We change the repeat times of $α \leftarrow [α; α_{0}]$ to adjust the size of Matrix $Φ_{1}$ . Note this matrix has many groups of clustered eigenvalues (severely and generally clusterings both exist) and has overflow issues if calculated directly.
2.: Matrix $Φ_{2}$ , which is constructed similarly to $Φ_{1}$ with $α_{0} =$ (1:80). This matrix also has many groups of clustered eigenvalues (severely and generally clusterings both exist) but has no overflow issue if calculated directly.
3.: Matrix $W_{1}$ , the famous Wilkinson matrix, which has the ith diagonal component equal to $| (n + 1) / 2 - i |$ (n is odd) and all off-diagonal components equal to 1. All its eigenvalues severely cluster in pairs.
4.: Matrix $W_{2}$ , another form of the Wilkinson matrix, which has the ith ( $i \in [1, (n + 1) / 2]$ ) diagonal component equal to $| (n + 1) / 2 - i |$ (n is odd), the ith ( $i \in [(n + 1) / 2 + 1, n]$ ) diagonal component equal to $- | (n + 1) / 2 - i |$ and all off-diagonal components equal to 1. Its eigenvalues do not cluster if the size is less than 2000.
5.: Random Matrix with both diagonal and off-diagonal elements being uniformly distributed random numbers in [−1, 1]. Note that all the Random Matrix results in this section are mean data of 20 times tests.

The results were collected on an Intel Core i5-4590 3.3-GHz CPU and 16-GB RAM machine. All codes were written in Matlab2017a and executed in IEEE double precision. The machine precision is

ϵ \approx 2.2 \times 10^{- 16}

.

6.1. Accuracy Test

Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 present the results of the residual norms, i.e.,

R = T \tilde{v} / ∥v∥

, where the Average Errors denote the means of R’s of all the calculated eigenvectors and the Maximal Errors denote the maximum. The results of dot products of the calculated eigenvectors are also presented to show orthogonality. Different sizes are used in our test, from

400 \times 400

to

2000 \times 2000

. We denote the corresponding 2-norm of the tested matrix, for example,

F = ∥Φ_{1}∥

in Figure 5. The results confirm that Algorithm 7 computes accurate and numerical orthogonal eigenvectors.

6.2. Efficiency Test of Part Eigenpairs

Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 show the time cost for computing

10 %

,

30 %

,

50 %

, and

70 %

eigenpairs of the above five types of matrices in each size. Note the cost of the Inverse Iteration method surges in Figure 12 because the eigenvalues start to cluster and need an expensive reorthogonalization by the modified Gram–Schmidt method as the size of Matrix

W_{1}

rises. The MRRR method costs the most in every matrix because it needs more accurate eigenvalues and calls the Bisection method, while the Inverse Iteration and Algorithm 7 call the PWK version of the QR method to obtain all eigenvalues. Finally, the results show that the modified Inverse Iteration method always costs the least time and has a surpassing efficiency when eigenvalues severely cluster, which confirms our points in Section 3.

6.3. Efficiency Test of Minor Eigenpairs

When it comes to a minor set of eigenpairs, it is inadvisable to calculate all the eigenvalues by the PWK version of the QR method for the Inverse Iteration and Algorithm 7. We use the Bisection method instead, similar to the MRRR algorithm. Thus, the result is more convictive in this case because all the methods obtain the eigenvalues at an identical cost.

We calculated

0.2 %

,

0.4 %

,

0.6 %

,

0.8 %

, and

1 %

eigenpairs of the above five types of matrices and used two sizes:

2001 \times 2001

and

10001 \times 10001

. The results are presented in Figure 15 and Figure 16. It can be seen that the cost of the MRRR method is close to the Inverse Iteration method when computing clustered eigenpairs but higher in other cases. Once again, the modified Inverse Iteration prevails in all cases.

6.4. Efficiency Test of All Eigenpairs

As discussed in previous sections, Algorithm 7 is not suitable for computing all the eigenvectors because the DC method has a significant advantage in this case. Nevertheless, we also performed the corresponding test and show the results in Figure 17. It can be seen in Figure 17b,c that the modified Inverse Iteration method has a close time cost to the DC method. The efficiency increase comes from the computation process for severely clustered eigenvectors, which is recurrent in Matrix

Φ_{2}

and

W_{1}

. The acceleration is not that distinct in Figure 17a (where many eigenvectors also cluster severely) because it takes extra operations to avoid overflows and underflows in Matrix

ε_{1}

, which will not arise in Matrix

ε_{2}

. However, the DC method is still recommended when computing all the eigenpairs.

6.5. Comparing with Mastronardi’s Method

Mastronardi [3,12] developed a procedure for computing an eigenvector of a symmetric tridiagonal matrix once its associate eigenvalue is known and gave the corresponding Matlab codes in [12].

We tested the Matlab routine, collected the residual norm errors (denoted by R), dot product errors, and time cost on the test matrices, and compared them with our new method. The results are shown in Table 2. Note that Mastronardi’s method is for one ST eigenvector; thus, we calculated the maximal eigenpairs of the test matrices. All the matrices in Table 2 have a size of 2001. The residual norm data have been scaled by the product of the machine precision and the 2-norm of the tested matrix.

Table 2 shows that Mastronardi’s method can provide a better result in Matrix

W_{2}

when considering orthogonality. However, Algorithm 7 has a significant advantage in time cost. In addition, Mastronardi seems unstable when computing the eigenvector (corresponding to the maximal eigenvalue) of Matrix

Φ_{1}

and

W_{1}

: the Matlab routine provided in [12] failed to converge. The instability also arises in computing some eigenvectors of the random matrices. As a consequence, we did not present the corresponding results of Matrix

Φ_{1}

and

W_{1}

in Table 2.

The test for calculating all eigenvectors stuck because of the instability too. However, the time cost of Mastronardi’s method is easy to conclude to be much more expensive than Algorithm 7, as the costs for one eigenvector have such a significant difference as shown in Table 2. In addition, Mastronardi’s method is unsuitable for computing all the eigenvectors, as the deflation process costs

O (2 n^{3})

operations [12] while it could not benefit from the sub-diagonal “zero”s like the traditional QR method.

7. Discussion

Algorithm 7 is a modified version of the MRRR, certainly of the Inverse Iteration method essentially, as the MRRR method implements inverse iterations in bidiagonal forms. The key improvements are:

1.: the one-step iteration method with Algorithm 6 to avoid overflow and underflow. Although the MRRR method uses another version of one-step iteration, the accompanying operations of square and square root slow down the routine.
2.: computing severely eigenvectors by the envelope vector theory. The severely clustering eigenvalues, which make the cost of the MRRR and Inverse Iteration method surge, bring a significant acceleration, on the contrary, for our new method. The scheme of the MRRR method for clustered eigenvalues is ingenious with time complexity of $O (n^{2})$ , but costs too many operations when searching the so-called “Relatively Robust Representation”. In terms of results, it is even the slowest when severely clustering eigenvalues arise.
3.: the novel reorthogonalization method. Dhillion also tried the envelope vectors when the MRRR method was stuck by the glued Wilkinson Matrices [11] but gave up because of the general clustering of severely clustered groups. This paper solves the problem by the general Q iteration. Note we also accelerate the QR-like iteration itself by Algorithm 5.

The results in Section 6 show that the modified Inverse Iteration method is suitable for computing part eigenpairs, especially the severely clustered ones. When computing a minor set, our new method is significantly faster. As the computations for every eigenpair are independent, our new method is flexible in calculating in any given order. However, when eigenvalues generally cluster without severely clustering groups, one should use the MRRR method. In addition, the DC method is absolutely the champion for computing all the eigenpairs in almost every type of matrix. Nevertheless, considering it is rare to calculate all the eigenpairs of a large matrix in practice, this paper provides a novel, practical, flexible, and fast method.

Algorithm 7 can be divided into roughly three steps: finding the smallest

| γ_{k} |

; computing the isolated or clustered eigenvectors; reorthogonalizing by premultiplying Givens’ rotation matrices. The consumption of the other calculation parts is not comparable to these three steps. Note that all these main steps can be implemented in parallel. Therefore, Algorithm 7 is suitable for parallel computation. We will focus on the parallel version of the modified Inverse Iteration method in our future research work.

Author Contributions

Formal analysis, W.C., Y.Z. and H.Y.; investigation, W.C. and Y.Z.; writing—original draft, W.C.; writing—review and editing, H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by the Talent Team Project of Zhangjiang City in 2021 and the R & D and industrialization project of the offshore aquaculture cage nets system of Guangdong Province of China (Grant No. 2021E05034). Huazhong University of Science and Technology funds the APC.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the editors and reviewers for their constructive comments, which will improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ST (matrix)	Symmetric Tridiagonal (matrix)
DC (algorithm)	Divided and Conquer (algorithm)
MRRR (algorithm)	Multiple Relatively Robust Representations (algorithm)

References

Xu, W.R.; Bebiano, N.; Chen, G.L. On the construction of real non-self adjoint tridiagonal matrices with prescribed three spectra. Electron. Trans. Numer. Anal. 2019, 51, 363–386. [Google Scholar] [CrossRef]
Van Dooren, P.; Laudadio, T.; Mastronardi, N. Computing the Eigenvectors of Nonsymmetric Tridiagonal Matrices. Comput. Math. Math. Phys. 2021, 61, 733–749. [Google Scholar] [CrossRef]
Laudadio, T.; Mastronardi, N.; Van Dooren, P. Computing Gaussian quadrature rules with high relative accuracy. Numer. Algorithms 2022. [Google Scholar] [CrossRef]
Nesterova, O.P.; Uzdin, A.M.; Fedorova, M.Y. Method for calculating strongly damped systems with non-proportional damping. Mag. Civ. Eng. 2018, 81, 64–72. [Google Scholar] [CrossRef]
Bahar, M.K. Charge-Current Output in Plasma-Immersed Hydrogen Atom with Noncentral Interaction. Ann. Phys. 2021, 533. [Google Scholar] [CrossRef]
Gu, M.; Eisenstat, S.C. A divide-and-conquer algorithm for the symmetric tridiagonal eigenproblem. SIAM J. Matrix Anal. Appl. 1995, 16, 172–191. [Google Scholar] [CrossRef] [Green Version]
Parlett, B.N. The Symmetric Eigenvalue Problem; SIAM: Philadelphia, PA, USA, 1997. [Google Scholar]
Peters, G.; Wilkinson, J.H. The calculation of specified eigenvectors by inverse iteration. In Handbook for Automatic Computation; Springer: Berlin/Heidelberg, Germany, 1971; pp. 418–439. [Google Scholar]
Dhillon, I.S. A New O(n²) Algorithm for the Symmetric Tridiagonal Eigenvalue/Eigenvector Problem. Ph.D. Thesis, University of California, Berkeley, CA, USA, 1997. [Google Scholar]
Wilkinson. The Algebraic Eigenvalue Problem. In Handbook for Automatic Computation, Volume II, Linear Algebra; Oxford University Press: Oxford, UK, 1969. [Google Scholar]
Dhillon, I.S.; Parlett, B.N.; Vömel, C. Glued matrices and the MRRR algorithm. SIAM J. Sci. Comput. 2005, 27, 496–510. [Google Scholar] [CrossRef] [Green Version]
Mastronardi, N.; Taeter, H.; Dooren, P. On computing eigenvectors of symmetric tridiagonal matrices. Springer INdAM Ser. 2019, 30, 181–195. [Google Scholar] [CrossRef]
Parlett, B.N. Invariant subspaces for tightly clustered eigenvalues of tridiagonals. BIT Numer. Math. 1996, 36, 542–562. [Google Scholar] [CrossRef]
Parlett, B.; Dopico, F.M.; Ferreira, C. The inverse eigenvector problem for real tridiagonal matrices. SIAM J. Matrix Anal. Appl. 2016, 37, 577–597. [Google Scholar] [CrossRef]
Kovačec, A. Schrödinger’s tridiagonal matrix. Spec. Matrices 2021, 9, 149–165. [Google Scholar] [CrossRef]
da Fonseca, C.M.; Kılıç, E. A new type of Sylvester–Kac matrix and its spectrum. Linear Multilinear Algebra 2021, 69, 1072–1082. [Google Scholar] [CrossRef]
Chu, W.; Zhao, Y.; Yuan, H. A Novel Divisional Bisection Method for the Symmetric Tridiagonal Eigenvalue Problem. Mathematics 2022, 10, 2782. [Google Scholar] [CrossRef]
Barth, W.; Martin, R.; Wilkinson, J. Calculation of the eigenvalues of a symmetric tridiagonal matrix by the method of bisection. Numer. Math. 1967, 9, 386–393. [Google Scholar] [CrossRef]

Figure 1. The

| γ_{k} |

curve of

Φ

.

Figure 1. The

| γ_{k} |

curve of

Φ

.

Figure 2. The distances of the last two sub-eigenvalues.

Figure 3. The distances of the last eight sub-eigenvalues.

Figure 4. The twisted Q factorization.

Figure 5. The accuracy results of Matrix

Φ_{1}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 5. The accuracy results of Matrix

Φ_{1}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 6. The accuracy results of Matrix

Φ_{2}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 6. The accuracy results of Matrix

Φ_{2}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 7. The accuracy results of Matrix

W_{1}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 7. The accuracy results of Matrix

W_{1}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 8. The accuracy results of Matrix

W_{2}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 8. The accuracy results of Matrix

W_{2}

: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 9. The accuracy results of Random Matrices: (a) the average residual norm; (b) the maximal residual norm; (c) the average dot product; (d) the maximal dot product.

Figure 10. The time cost for Matrix

Φ_{1}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 10. The time cost for Matrix

Φ_{1}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 11. The time cost for Matrix

Φ_{2}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 11. The time cost for Matrix

Φ_{2}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 12. The time cost for Matrix

W_{1}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 12. The time cost for Matrix

W_{1}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 13. The time cost for Matrix

W_{2}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 13. The time cost for Matrix

W_{2}

when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 14. The time cost for Random Matrix when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 14. The time cost for Random Matrix when calculating part eigenpairs: (a)

10 %

; (b)

30 %

; (c)

50 %

; (d)

70 %

.

Figure 15. The time cost for minor eigenpairs in

2001 \times 2001

: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Figure 15. The time cost for minor eigenpairs in

2001 \times 2001

: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Figure 16. The time cost for minor eigenpairs in

10001 \times 10001

: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Figure 16. The time cost for minor eigenpairs in

10001 \times 10001

: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Figure 17. The time cost for all eigenpairs in: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Figure 17. The time cost for all eigenpairs in: (a) Matrix

Φ_{1}

; (b) Matrix

Φ_{2}

; (c) Matrix

W_{1}

; (d) Matrix

W_{2}

; (e) Random Matrix.

Table 1. Accuracy and orthogonality.

Method	Mean $R (\times ε ∥Φ∥)$	Max $R (\times ε ∥Φ∥)$	Max Dot Product ( $\times ϵ^{- 1}$ )	Time Cost ( $\times 10^{- 2}$ s)
Algorithm 2	1.5	1.5	0	0.1
Inverse Iteration	1.2	1.2	0.05	2.9
MRRR	1.2	1.2	0	4.2

Table 2. Comparing with Mastronardi’s method when calculating one eigenvector.

Matrix	Method	$R (\times ε ∥A∥)$	Max Dot Product ( $\times ϵ^{- 1}$ )	Time Cost (s)
$Φ_{1}$	Mastronardi’s	-	-	-
$Φ_{1}$	Algorithm M	3.42	1.2	$5.5 \times 10^{- 4}$
$Φ_{2}$	Mastronardi’s	2.86	1.5	0.24
$Φ_{2}$	Algorithm M	3.01	0.7	$4.0 \times 10^{- 4}$
$W_{1}$	Mastronardi’s	-	-	-
$W_{1}$	Algorithm M	0.27	1.4	$4.6 \times 10^{- 3}$
$W_{2}$	Mastronardi’s	18.7	0	0.29
$W_{2}$	Algorithm M	0.27	1.2	$4.8 \times 10^{- 4}$
Random	Mastronardi’s	24.6	2.1	0.30
Random	Algorithm M	12.2	0	$5.5 \times 10^{- 4}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chu, W.; Zhao, Y.; Yuan, H. A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors. Mathematics 2022, 10, 3636. https://doi.org/10.3390/math10193636

AMA Style

Chu W, Zhao Y, Yuan H. A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors. Mathematics. 2022; 10(19):3636. https://doi.org/10.3390/math10193636

Chicago/Turabian Style

Chu, Wei, Yao Zhao, and Hua Yuan. 2022. "A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors" Mathematics 10, no. 19: 3636. https://doi.org/10.3390/math10193636

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Modified Inverse Iteration Method for Computing the Symmetric Tridiagonal Eigenvectors

Abstract

1. Introduction

2. Compute Isolated Eigenvectors

2.1. Theoretical Background

2.2. One-Step Iteration

2.3. Accuracy Analysis of Algorithm 1

3. Computing Severely Clustered Eigenvectors

4. Reorthogonalization

4.1. General Q Iteration

4.2. Cost of Reorthogonalization

4.3. Modification of QR-Like Iteration

5. Avoiding Overflow and Underflow

6. Numerical Results

6.1. Accuracy Test

6.2. Efficiency Test of Part Eigenpairs

6.3. Efficiency Test of Minor Eigenpairs

6.4. Efficiency Test of All Eigenpairs

6.5. Comparing with Mastronardi’s Method

7. Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI