The Canonical Forms of Permutation Matrices

Li, Wen-Wei; Hou, Xin; Wang, Qing-Wen

doi:10.3390/sym15020332

Open AccessArticle

The Canonical Forms of Permutation Matrices

by

Wen-Wei Li

^1,2

,

Xin Hou

³

and

Qing-Wen Wang

^4,*

¹

School of Mathematical Science, University of Science and Technology of China, Hefei 230026, China

²

School of Information and Mathematics, Anhui International Studies University, Hefei 231201, China

³

College of Elementary Education, Capital Normal University, Beijing 100048, China

⁴

Department of Mathematics, Shanghai University, Shanghai 200444, China

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(2), 332; https://doi.org/10.3390/sym15020332

Submission received: 5 January 2023 / Revised: 19 January 2023 / Accepted: 22 January 2023 / Published: 25 January 2023

(This article belongs to the Section Mathematics)

Download Versions Notes

Abstract

:

We address classification of permutation matrices, in terms of permutation similarity relations, which play an important role in investigating the reducible solutions of some symmetric matrix equations. We solve the three problems. First, what is the canonical form of a permutation similarity class? Second, how to obtain the standard form of arbitrary permutation matrix? Third, for any permutation matrix A, how to find the permutation matrix T, such that

T^{- 1} A T

is in canonical form? Besides, the decomposition theorem of permutation matrices and the factorization theorem of both permutation matrices and monomial matrices are demonstrated.

Keywords:

permutation matrix; monomial matrix; permutation similarity; canonical form; cycle matrix decomposition; cycle factorization

1. Introduction

The incidence matrix of a projective plane of order n is a 0-1 matrix of order

n^{2} + n + 1

. Two projective planes are isomorphic if the incidence matrix of one projective plane can be transformed into the incidence matrix of the other one by permutation of rows and/or columns. After sorting the rows and columns, the incidence matrix of a projective plane can be reduced to (not unique) a standard form. In the reduced form, the incidence matrix can be split into blocks. Most blocks are permutation matrices (see [1]). If we keep the position of every block of the reduced form and perform permutations of the rows and columns, every permutation matrix is transformed into another matrix that is permutationally similar to the original one.

The members in the symmetry group

S_{n}

of order n are called permutations. They are tightly connected with permutation matrices of order n. Permutation matrices are powerful tools in the representation theory of groups, discrete mathematics, applied mathematics, and some engineering technology (see [2,3,4,5]). They play an important role in the study of the reducible solutions of matrix equations (see [6]). Since the elementary row (or column) transformations are inevitable in solving matrix equations, which are equivalent to the multiplication by permutation matrices or diagonal matrices. The tricks of matrix transformations (especially the row or column permutations) are applicable.

This paper is devoted to the permutational similarity relation and to the classification of the permutation matrices. In particular, we focus on the standard structure of a general permutation matrix, on the canonical form of a permutation similarity class, and on how to generate the canonical form. Furthermore, a theorem is presented about the decomposition of a permutation matrix into a diagonal matrix and some generalized cycle matrices of type II. A factorization theorem shows that an arbitrary non-identity permutation matrix is the product of some generalized cycle matrices of type I. These contents are represented in Section 3 which is the main part of this paper.

The number of permutational similarity classes of a permutation matrices of order n is discussed in Section 4. A similar factorization for monomial matrices is discussed at the end of the paper.

2. Preliminary

Let n be a positive integer, P be a square matrix of order n. If P is a binary matrix (i.e. elements are either 0 or 1, also referred to as 0-1 matrix or (0, 1) matrix) and there is a unique “1” in every row and every column, then P is called a permutation matrix. If we substitute the “1”s in a permutation matrix by other non-zero elements, we obtain a monomial matrix, also referred to as a generalized permutation matrix.

As a matter of fact, there is a reason for the name “permutation matrix”. If a matrix T of size n × r is multiplied by a permutation matrix P of order n (from the left side of T), we obtain a permutation of the rows of T. If U is a matrix of size t by n, and P acts on U on the right, we have a permutation of the columns of U. The inverse of a permutation matrix

P^{- 1}

coincides with the transpose

P^{T}

while

P^{- 1}

itself is a permutation matrix.

Let k be a positive integer greater than 1, C be an invertible (0, 1) matrix of order k, if

C^{k}

=

I_{k}

(

I_{k}

is the identity matrix of order k) and

C^{i}

≠

I_{k}

for any i (

1 \leq i < k

), then C will be referred to as a cycle matrix of order k. A cycle matrix of order k of the form

[\begin{matrix} 0 & 1 \\ 1 & 0 \\ 1 & ⋱ \\ ⋱ & ⋱ \\ 1 & 0 \end{matrix}]

is a standard cycle matrix. The identity matrix of order 1 represents a cycle matrix of order 1.

If

C_{1}

is a permutation matrix of order n, and there are exactly k zero diagonal elements (here

2 ⩽ k ⩽ n

), if

C^{k}

=

I_{n}

and

C^{i}

≠

I_{n}

for any i (

1 ⩽ i < k

), then

C_{1}

is termed a generalized cycle matrix of Type I with cycle order k.

If

C_{2}

is a (0, 1) matrix of order n, rank

C_{2}

= k, with k non-zero entries, (

2 ⩽ k ⩽ n

), if

C^{k}

is a diagonal of rank k, and

C^{i}

is non-diagonal (

1 ⩽ i < k

), then

C_{2}

will be called a generalized cycle matrix of type II with cycle order k. Obviously, a generalized cycle matrix of type II plus some suitable diagonal (0, 1) matrix gives a generalized cycle matrix of type I with the same cycle order.

Let A and B be two monomial matrices of order n, if there is a permutation matrix T such that

B = T^{- 1} A T

, then A and B are permutationally similar. The permutation similarity relation is an equivalence relation. Hence the set of the permutation matrices (or monomial matrices) or order n may be naturally split into equivalence classes.

3. Main Results

In this section, we attend to give 3 main theorems about the canonical form, the decomposition and the factorization of a permutation matrix, respectively.

Theorem 1 solves the following three problems (which arise naturally from the definitions),

(a): What is the canonical form of a permutation similarity class?
(b): How to generate the canonical form of a given permutation matrix?
(c): If B is the canonical form of the permutation matrix A, how to find the permutation matrix T, such that $B = T^{- 1} A T$ ?

Now we give some theorems that would solve these problems.

Theorem 1. (Similarity Theorem)

For any permutation matrix A of order n, there is a permutation matrix T, such that,

T^{- 1} A T

=

diag

{I_{t}

,

N_{k_{1}}

, ⋯,

N_{k_{r}}}

, where

N_{k_{i}} = [\begin{matrix} 0 & 1 \\ 1 & 0 \\ 1 & ⋱ \\ ⋱ & ⋱ \\ 1 & 0 \end{matrix}]

is a cycle matrix of order

k_{i}

in standard form, (i=1, 2, ⋯, r), 2 ⩽

k_{1}

⩽

k_{2}

⋯⩽

k_{r}

,

0 ⩽ r ⩽ ⌊\frac{n}{2}⌋

,

0 ⩽ t ⩽ n

, and

\sum_{i = 1}^{r} k_{i}

+t=n. T, t, r,

k_{r}

are determined by A.

If A is an identity matrix, then t = n, r = 0. When A is a cycle matrix,

t = 0

,

r = 1

,

k_{1}

= n. In this theorem, the quasi-diagonal matrix (or block-diagonal matrices)

diag

{I_{t}

,

N_{k_{1}}

, ⋯,

N_{k_{r}}}

will be called the canonical form of a permutation matrix in permutational similarity relation.

The main idea of this proof is similar to that concerning the decomposition of a root subspace into cyclic subspaces.

In a root subspace

V_{λ}

associated with a linear transformation

B

and the eigenvalue

λ

of a matrix B, if v is a root vector of height n belonging to

B

, then the subspace spanned by

{{(B - λ I)}^{n - 1} v

,

{(B - λ I)}^{n - 2} v

, ⋯,

(B - λ I) v

,

v}

is a cyclic subspace, and

V_{λ}

is the direct sum of some cyclic subspaces.

Proof.

For any permutation matrix A of order n, let

A

be a linear transformation defined on the vector space

R^{n}

with bases

B = \{e_{1}, e_{2}, \dots, e_{n}\},

(1)

where

e_{i} = {(\underset{i - 1}{\underset{⏟}{0, \dots, 0}}, 1, \underset{n - i}{\underset{⏟}{0, \dots, 0}})}^{T}, (i = 1, 2, \dots, n) .

(2)

Here the regular letter “T” in the upper index means transposition. Suppose A is the matrix of the transformation

A

in the basis

B

, and for any vector

α \in R^{n}

with coordinates x (in the basis

B

), the coordinates of

A α

is

A x

, i.e.,

A α = B A x

. Here the coordinates are written as a column vector.

It is clear that the coordinates of

e_{i}

in the basis

B

is

{(\underset{i - 1}{\underset{⏟}{0, \dots, 0}}, 1, \underset{n - i}{\underset{⏟}{0, \dots, 0}})}^{T}

. Since A is a permutation matrix,

A e_{i}

is the i’th column of A.

We decompose

R^{n}

into some subspaces. In each subspace

V_{i}

, there is a basis

{e_{i}

,

A e_{i}

,

A^{2} e_{i}

, ⋯,

A^{k_{i} - 1} e_{i}}

, where

A^{k_{i}} e_{i} = e_{i}

. The positive

k_{i}

is the minimal integer satisfying this condition, i.e. the dimension of the cyclic subspace. Using this basis, the matrix of the transformation

A

restricted in

V_{i}

, can be written by

{[\begin{matrix} 0 & 1 \\ 1 & 0 \\ 1 & ⋱ \\ ⋱ & ⋱ \\ 1 & 0 \end{matrix}]}_{k_{i} \times k_{i}} .

Let us now find all these cyclic subspaces. In order to describe the procedure precisely and concisely, we will use some auxiliary variables.

Step 1: Let S = {1, 2, ⋯, n},

C

=

\{e_{i} | i \in S\}

,

a_{11}

= min S,

F_{1}

=

[a_{11}]

,

G_{1} = [e_{a_{11}}]

. (Here

F_{1}

and

G_{1}

are sequences, or sets equipped with precedence).

For the first cyclic subspace, of course

A e_{a_{11}} \in C

. If

A e_{a_{11}} \neq e_{a_{11}}

, assume

e_{a_{12}} = A e_{a_{11}}

, then put

a_{12}

and

e_{a_{12}}

at the end of the sequences

F_{1}

and

G_{1}

, respectively. If

A e_{a_{1, j}} \neq e_{a_{11}}

, assume

e_{a_{1, (j + 1)}} = A e_{a_{1, j}}

, (i.e.,

A^{j} e_{a_{11}} = e_{a_{1, (j + 1)}}

), then add

a_{1, (j + 1)}

and

e_{a_{1, (j + 1)}}

at the end of sequences

F_{1}

and

G_{1}

, respectively (j = 1, 2, ⋯ ). Since

A e_{i} \in C

(

\forall i \in S

), there is an integer

h_{1}

such that

A e_{a_{1, h_{1}}} = e_{a_{11}}

(otherwise the sequence

e_{a_{11}}

,

A e_{a_{11}}

,

A^{2} e_{a_{11}}

,

A^{3} e_{a_{11}}

, ⋯ is infinite). Suppose that

h_{1}

is the minimal integer satisfying this condition (

1 ⩽ h_{1} ⩽ n

). It is clear that

A^{h_{1}} e_{a_{11}} = e_{a_{11}}

,

A^{h_{1}} e_{a_{1 j}} = e_{a_{1 j}}

, (1 ⩽ j ⩽

h_{1}

). It is possible that

h_{1} = 1

or

h_{1} = n

. At last

|F_{1}|

=

|G_{1}|

=

h_{1}

. Finally, remove the elements of

G_{1}

from

C

, and the elements of

F_{1}

from S.

The first cyclic subspace is thus spanned by the basis

G_{1}

=

[e_{a_{11}}

,

e_{a_{12}}

, ⋯,

e_{a_{1 h_{1}}}]

. Usually, a basis of a linear space is denoted by braces, not brackets. However, braces denote sets, and this disregards the precedence. In order to avoid ambiguities, here we use brackets, which stand for sequences, where precedence is relevant. The dimension of this subspace is

|F_{1}| = h_{1}

. The matrix of the transformation

A

, restricted to this cyclic subspace, is

N_{h_{1}} = {[\begin{matrix} 0 & 1 \\ 1 & 0 \\ 1 & ⋱ \\ ⋱ & ⋱ \\ 1 & 0 \end{matrix}]}_{h_{1} \times h_{1}} .

Let us now search for the next cyclic subspace, if it exists.

Step 2: If S≠ Ø, let

a_{21}

= min S,

F_{2}

=

[a_{21}]

,

G_{2} = [e_{a_{21}}]

. It is clear that

A e_{a_{21}} \in C

. (Otherwise we would have

A e_{a_{21}} \in G_{1}

. However, since all the elements in

G_{1}

are removed from

C

, then it exists

k_{0}

, s.t.

A^{k_{0}} e_{a_{11}} = A e_{a_{21}}

with

k_{0} \neq 0

, so,

A^{k_{0} - 1} e_{a_{11}} = e_{a_{21}}

as A is invertible, which means that

e_{a_{21}}

=

A^{k_{0} - 1} e_{a_{11}}

is in the set

G_{1}

, which is a contradiction.) If

A^{i - 1} e_{a_{21}} \neq e_{a_{21}}

, suppose

A^{i - 1} e_{a_{21}} = e_{a_{2 i}}

(i = 2, 3, ⋯), then add

a_{2 i}

and

e_{a_{2 i}}

at the end of the sequences

F_{2}

and

G_{2}

, respectively. There will be a

h_{2}

, such that

A^{h_{2}} e_{a_{21}} = e_{a_{21}}

(let

h_{2}

be the minimal integer satisfying this condition. It is possible that

h_{2}

= 1 or

h_{2} = n - h_{1}

). Obviously,

A^{h_{2}} e_{a_{2 i}} = e_{a_{2 i}}

, (1 ⩽ i ⩽

h_{2}

). Then remove the elements of

G_{2}

from

C

, and remove the elements of

F_{2}

from S.

Now another cyclic subspace is spanned by the basis

G_{2}

=

[e_{a_{21}}

,

e_{a_{22}}

, ⋯,

e_{a_{2 h_{2}}}]

. The dimension of this subspace is

|F_{2}| = h_{2}

. The matrix of the transformation

A

restricted to this cyclic subspace is

N_{h_{2}}

.

Step 3: If S≠Ø, goto step 2 and construct

F_{3}

,

F_{4}

, ⋯ and

G_{3}

,

G_{4}

, ⋯. This leads to other cyclic subspaces, their basis, and the matrices of the transformation

A

restricted to these cyclic subspaces. The procedure stops after a finite number of steps since n is finite.

Assume that we have

F_{1}

,

F_{2}

, ⋯,

F_{u}

and

G_{1}

,

G_{2}

, ⋯,

G_{u}

, such that

⋃_{i = 1}^{u} F_{i}

= { 1, 2, ⋯, n},

⋃_{i = 1}^{u} G_{i}

= {

e_{1}

,

e_{2}

, ⋯,

e_{n}

},

F_{i} \cap F_{j}

=

G_{i} \cap G_{j}

= Ø, (

1 ⩽ i \neq j ⩽ u

).

There is a possibility that u = 1 (when A is a cycle matrix of order n) or n (when A is an identity matrix).

Step 4: Sort

F_{1}

,

F_{2}

, ⋯,

F_{u}

by candinality, s.t.

|F_{1}^{'}|

⩽

|F_{2}^{'}|

⩽⋯⩽

|F_{u}^{'}|

. Then sort

G_{i}

correspondingly, i.e.,

G_{i}^{'}

=

\{e_{x} |x \in F_{i}^{'}\}

(i = 1, 2, ⋯, u).

Suppose

|F_{1}^{'}|

=

|F_{2}^{'}|

= ⋯ =

|F_{t}^{'}|

= 1. If t = n then A is an identity matrix. It is possible that

t = 0

.

Let

r = u - t

. Denote the unique element in

G_{i}^{'}

by

e_{i}^{'}

(i = 1, 2, ⋯, t). Let

k_{j}

=

|G_{t + j}^{'}|

, and denote the elements in

G_{t + j}^{'}

by

e_{j, v}^{'}

(j = 1, 2, ⋯, r; v = 1, 2, ⋯,

k_{j}

). Then, the matrix of

A

restricted to the subspace spanned by the bases

D_{0}

=

\{e_{1}^{'}

,

e_{2}^{'}

, ⋯,

e_{t}^{'}\}

} is

I_{t}

since

A e_{i}^{'} = e_{i}^{'}

(i = 1, 2, ⋯, t), or

A D_{0}

=

D_{0} I_{t}

; and the matrix of

A

restricted in the subspace spanned by the bases

D_{j}

=

G_{t + j}^{'}

=

{e_{j, 1}^{'}

,

e_{j, 2}^{'}

, ⋯,

e_{j, k_{j}}^{'}}

(j = 1, 2, ⋯, r) is

N_{k_{j}} = [\begin{matrix} 0 & 1 \\ 1 & 0 \\ 1 & ⋱ \\ ⋱ & ⋱ \\ 1 & 0 \end{matrix}],

which is a cycle matrix of order

k_{j}

, as

A e_{j, v}^{'} = e_{j, v + 1}^{'}

, (v = 1, 2, ⋯,

k_{j} - 1

), and

A e_{j, k_{j}}^{'} = e_{j, 1}^{'}

, i.e.,

A D_{j}

=

D_{j} N_{k_{j}}

(j = 1, 2, ⋯, r). So, the matrix of

A

with bases

D = \{e_{1}^{'}, e_{2}^{'}, \dots, e_{t}^{'}; e_{1, 1}^{'}, e_{1, 2}^{'}, \dots, e_{1, k_{1}}^{'}; \dots \dots; e_{r, 1}^{'}, e_{r, 2}^{'}, \dots, e_{r, k_{r}}^{'}\}

(3)

is

B = I_{t} \oplus N_{k_{1}} \oplus \dots \oplus N_{k_{r}} = diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}} .

Since

D

is a reordering of

B

, there is a permutation matrix T, such that

D

=

B T

. Then

B = T^{- 1} A T

and Theorem 1 is proved.

Take the matrix

P_{2} = [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]

as an example, and assume it is the matrix of a transformation

P_{2}

in the basis

{e_{1}

,

e_{2}

,

e_{3}

, ⋯,

e_{7}}

in

R^{7}

.

Let us now search for the first cyclic subspace.

Since

P_{2} e_{1}

=

e_{6}

,

P_{2} e_{6}

=

e_{1}

, we have

F_{1}

= [1, 6], and the first cyclic subspace is spanned by the basis

{e_{1}

,

e_{6}}

. Its dimension is

|F_{1}| = 2

. The matrix of the transformation

P_{2}

restricted to this cyclic subspace is

N_{2}

=

[\begin{matrix} 1 \\ 1 \end{matrix}]

.

Since

P_{2} e_{2}

=

e_{3}

,

P_{2} e_{3}

=

e_{4}

,

P_{2} e_{4}

=

e_{2}

, so

F_{2}

= [2, 3, 4],

|F_{2}| = 3

. The second cyclic subspace is spanned by the basis

{e_{2}

,

e_{3}

,

e_{4}}

, and its dimension is

|F_{2}| = 3

. The matrix of the transformation

P_{2}

restricted to this cyclic subspace is

N_{3}

=

[\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}]

.

Since

P_{2} e_{5}

=

e_{5}

,

F_{3}

= [5],

|F_{3}| = 1

. The third cyclic subspace is spanned by the basis

\{e_{5}\}

, and the dimension is

|F_{3}| = 1

. The matrix of

P_{2}

restricted to this cyclic subspace is

N_{1}

=

[1]

. Finally, since

P_{2} e_{7}

=

e_{7}

,

F_{4}

= [7],

|F_{4}| = 1

. The fourth cyclic subspace is spanned by the basis

\{e_{7}\}

, the dimension is

|F_{4}| = 1

. The matrix of

P_{2}

restricted to this cyclic subspace is

N_{1}

=

[1]

.

Overall, we have that

P_{2}

is permutationally similar to the canonical form

B_{2} = diag {I_{2}, N_{2}, N_{3}} = [\begin{matrix} 1 \\ 1 \\ 0 & 1 \\ 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}],

or

\begin{matrix} P_{2} \{e_{5}; e_{7}; e_{1}, e_{6}; e_{2}, e_{3}, e_{4}\} \\ = & \{e_{5}; e_{7}; e_{6}, e_{1}; e_{3}, e_{4}, e_{2}\} \\ = & \{e_{5}; e_{7}; e_{1}, e_{6}; e_{2}, e_{3}, e_{4}\} [\begin{matrix} 1 \\ 1 \\ 0 & 1 \\ 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}] . \end{matrix}

Now we find

T_{2}

, such that

B_{2} = T_{2}^{- 1} P_{2} T_{2}

.

It follows from

(e_{5}, e_{7}, e_{1}, e_{6}, e_{2}, e_{3}, e_{4}) = (e_{1}, e_{2}, e_{3}, e_{4}, e_{5}, e_{6}, e_{7}) [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \\ 1 & 0 \\ 0 & 1 \\ 0 & 1 & 0 & 0 \end{matrix}],

denote

T_{2} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \\ 1 & 0 \\ 0 & 1 \\ 0 & 1 & 0 & 0 \end{matrix}],

which implying

T_{2}^{- 1} = T_{2}^{T} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 1 \\ 0 & 0 & 1 \\ 0 & 0 & 0 & 1 \end{matrix}],

so

P_{2}

=

T_{2}

B_{2}

T_{2}^{- 1}

, that is,

[\begin{matrix} 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]

= [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 \\ 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \\ 1 \\ 0 & 1 \\ 0 & 1 & 0 & 0 \end{matrix}] [\begin{matrix} 1 \\ 1 \\ 0 & 1 \\ 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}] [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 1 \\ 0 & 0 & 1 \\ 0 & 0 & 0 & 1 \end{matrix}] .

Then,

B_{2} = T_{2}^{- 1} P_{2} T_{2}

, i.e.,

\begin{matrix} [\begin{matrix} 1 \\ 1 \\ 0 & 1 \\ 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}] \\ = & [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 1 \\ 0 & 0 & 1 \\ 0 & 0 & 0 & 1 \end{matrix}] [\begin{matrix} 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ 1 & 0 & 0 \\ 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \\ 1 \\ 0 & 1 \\ 0 & 1 & 0 & 0 \end{matrix}] . \end{matrix}

□

Theorem 2. (Decomposition Theorem)

For any permutation matrix A of order n, if A is not the identity, then there are some generalized cycle matrices

Q_{1}

,

Q_{2}

, ⋯,

Q_{r}

of type II and a diagonal matrix

D_{t}

of rank t, such that, A=

Q_{1} + Q_{2} + \dots + Q_{r} + D_{t}

, where the non-zero elements in

D_{t}

are all ones,

\sum_{i = 1}^{r} rank Q_{i}

+t= n;

1 ⩽ r ⩽ ⌊\frac{n}{2}⌋

, r,

Q_{i}

(i=1, 2, ⋯, r) and

D_{t}

are determined by A.

If the cycle order of

Q_{i}

is

k_{i}

, (i = 1, 2, ⋯, r), then

2 ⩽ \sum_{i = 1}^{r} k_{i} ⩽ n

. If A is a cycle matrix, then

t = 0

,

r = 1

,

k_{1}

= n.

The main idea of the proof may be summarized as follow.

Denote by

O_{m}

a zero square matrix of order

m .

By Theorem 1, there is a permutation matrix T, such that,

T^{- 1} A T

=

diag

{I_{t}

,

N_{k_{1}}

, ⋯,

N_{k_{r}}}

. Then consider the matrices

M_{0}

=

diag

{I_{t}

,

O_{k_{1}}

, ⋯,

O_{k_{r}}}

,

M_{1}

=

diag

{O_{t}

,

N_{k_{1}}

,

O_{k_{2}}

, ⋯,

O_{k_{r}}}

,

M_{2}

=

diag

{O_{t}

,

O_{k_{1}}

,

N_{k_{2}}

,

O_{k_{3}}

, ⋯,

O_{k_{r}}}

,

\dots \dots

,

M_{r}

=

diag

{O_{t}

,

O_{k_{1}}

,

O_{k_{2}}

, ⋯,

O_{k_{r - 1}}, N_{k_{r}}}

,

Clearly,

diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}} = M_{0} + M_{1} + M_{2} + \dots + M_{r},

and

\begin{matrix} A & = T diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}} T^{- 1} \\ = T M_{0} T^{- 1} + T M_{1} T^{- 1} + T M_{2} T^{- 1} + \dots + T M_{r} T^{- 1} . \end{matrix}

It is clear that

rank M_{i}

=

k_{i}

(i = 1, 2, ⋯, r), and

rank M_{0}

= t. The matrix

M_{i}

is a generalized cycle matrix of type II with cycle order

k_{i}

. Since T is invertible,

rank

T M_{i} T^{- 1}

=

rank M_{i}

=

k_{i}

. T and

T^{- 1}

are permutation matrices, so

Q_{i}

=

T M_{i} T^{- 1}

is also a 0-1 matrix with the same rank. Since

N_{k_{i}}^{k_{i}} = I_{k_{i}}

,

M_{i}^{k_{i}} = diag {O_{t}, O_{k_{1}}, \dots, I_{k_{i}}, \dots, O_{k_{r}}},

is a diagonal matrix of rank

k_{i}

, and

{(T M_{i} T^{- 1})}^{k_{i}}

=

T M_{i}^{k_{i}} T^{- 1}

is a diagonal matrix of rank

k_{i}

, too. If the exponent is less than

k_{i}

, the conclusion does not hold (but it will cost us some more words to prove this proposition). Then,

Q_{i} = T M_{i} T^{- 1}

(4)

is a generalized cycle matrix of type II with cycle order

k_{i}

. Analogously,

D_{t} = T M_{0} T^{- 1} = T diag {I_{t}, O_{k_{1}}, \dots, O_{k_{r}}} T^{- 1}

(5)

is a diagonal matrix of rank

t .

Following this idea, we may prove Theorem 2 in a different way. However, this requires to obtain

Q_{i}

and

D_{t}

directly, which may be challenging. We prefer to move on with another proof following the idea of the proof of Theorem 1. In this way, we construct

Q_{i}

and

D_{t}

more conveniently.

Proof.

Starting from the

F_{i}^{'}

generated above, one may construct a 0-1 matrix

D_{t}

of order n, such that the j’th column of

D_{t}

is the j’th column of A (

\forall j \in ⋃_{i = 1}^{t} F_{i}^{'}

) and the other columns of

D_{t}

are 0 vectors. Of course,

D_{t}

is a diagonal matrix of rank t, as the j’th column of A is

e_{j}

(by definition,

A e_{j} = e_{j}

).

Then, construct a 0-1 matrix

Q_{i}

(i = 1, 2, ⋯, r) of order n, such that the j’th column of

Q_{i}

is the j’th column of A (

j \in F_{t + i}^{'}

) and the other columns of

Q_{i}

are 0 vectors. As

(⋃_{i = 1}^{t} F_{i}^{'}) ⋃ (⋃_{i = 1}^{r} F_{t + i}^{'}) = ⋃_{i = 1}^{u} F_{i} = {1, 2, \dots, n},

(6)

and

F_{i_{1}} \cap F_{i_{2}} = ⌀ (1 ⩽ i_{1} \neq i_{2} ⩽ u),

every column of A appears exact once in a matrix (

D_{t}

or

Q_{i}

, denoted by M) in the expression

\sum_{i = 1}^{r} Q_{i} + D_{t}

, in the same position as it appears in A. Besides, the columns in the same position in the matrices other than M appeared in the sum

\sum_{i = 1}^{r} Q_{i} + D_{t}

are all 0 vectors. Overall, we have

\sum_{i = 1}^{r} Q_{i} + D_{t} = A .

Let us now prove that

Q_{i}

is a generalized cycle matrix of type II with cycle order

k_{i}

.

Assume that the members in

F_{t + i}^{'}

(i = 1, 2, ⋯, r) are

a_{i, 1}^{'}

,

a_{i, 2}^{'}

, ⋯,

a_{i, k_{i}}^{'}

, and that

F_{t + i}^{'} = F_{s}

for some s (

1 ⩽ s ⩽ u

). Then we have a relation about the members in

G_{t + i}^{'}

and the members in a certain

G_{s}

, i.e.,

e_{i, v}^{'} = e_{a_{i, v}^{'}} \in G_{s}, v = 1, 2, \dots, k_{i}

. By the definition of

F_{s}

, we know that

A e_{a_{i, v}^{'}}

=

e_{a_{i, v + 1}^{'}}

, (v = 1, 2, ⋯,

k_{i} - 1

),

A e_{a_{i, k_{i}}^{'}}

=

e_{a_{i, 1}^{'}}

, so

e_{a_{i, v + 1}^{'}}

is the

a_{i, v}^{'}

’th column of A.

As

Q_{i}

is made of some 0 vectors and

k_{i}

columns of A, and the columns of A are linearly independent, the rank of

Q_{i}

is

k_{i}

. The

a_{i, v}^{'}

’th column of

Q_{i}

is the

a_{i, v}^{'}

’th column of A, so

Q_{i} e_{a_{i, v}^{'}}

=

e_{a_{i, v + 1}^{'}}

, (v = 1, 2, ⋯,

k_{i} - 1

),

Q_{i} e_{a_{i, k_{i}}^{'}}

=

e_{a_{i, 1}^{'}}

,

Q_{i} e_{l}

= 0 (

\forall e_{l} \in D ∖G_{t + i}^{'}

). Therefore

Q_{i}^{v} e_{a_{i, 1}^{'}}

=

e_{a_{i, v + 1}^{'}}

(v = 1, 2, ⋯,

k_{i} - 1

),

Q_{i}^{k_{i}} e_{a_{i, 1}^{'}}

=

e_{a_{i, 1}^{'}}

, (so

Q_{i}^{v}

is not diagonal as

Q_{i}^{v} e_{a_{i, 1}^{'}}

=

e_{a_{i, v + 1}^{'}}

≠

e_{a_{i, 1}^{'}}

). Then

Q_{i}^{k_{i}} e_{a_{i, v}^{'}} = Q_{i}^{v - 1} (Q_{i}^{k_{i} - v + 1} e_{a_{i, v}^{'}}) = Q_{i}^{v - 1} (e_{a_{i, 1}^{'}}) = e_{a_{i, v}^{'}}, (v = 1, 2, \dots, k_{i}) .

Hence

Q_{i}^{k_{i}}

is a diagonal matrix of rank

k_{i}

. Therefore

Q_{i}

is a generalized cycle matrix of type II with cycle order

k_{i}

. □

Theorem 3. (Factorization Theorem)

For any permutation matrix A of order n, if A is not the identity, then there are some generalized cycle matrices

P_{1}

,

P_{2}

, ⋯,

P_{r}

of type I, such that, A =

P_{1} P_{2} \dots P_{r}

, where

1 ⩽ r ⩽ ⌊\frac{n}{2}⌋

; r,

P_{i}

(i=1, 2, ⋯, r) are determined by A.

P_{i_{1}}

and

P_{i_{2}}

commute (

1 ⩽ i_{1} \neq i_{2} ⩽ r

).

If the cycle order of

P_{i}

is

k_{i}

, (i = 1, 2, ⋯, r), then

2 ⩽ \sum_{i = 1}^{r} k_{i} ⩽ n

.

Since

T^{- 1} A T

=

diag

{I_{t}

,

N_{k_{1}}

, ⋯,

N_{k_{r}}}

, for convenience, we denote

Y_{1}

=

diag

{I_{t}

,

N_{k_{1}}

,

I_{k_{2}}

, ⋯,

I_{k_{r}}}

,

Y_{2}

=

diag

{I_{t}

,

I_{k_{1}}

,

N_{k_{2}}

,

I_{k_{3}}

, ⋯,

I_{k_{r}}}

,

\dots \dots

,

Y_{r}

=

diag

{I_{t}

,

I_{k_{1}}

,

I_{k_{2}}

, ⋯,

I_{k_{r - 1}}, N_{k_{r}}}

.

Obviously,

diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}} = Y_{1} Y_{2} \dots Y_{r}

and

\begin{matrix} A & = T diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}} T^{- 1} = T Y_{1} Y_{2} \dots Y_{r} T^{- 1} \\ = (T Y_{1} T^{- 1}) (T Y_{2} T^{- 1}) \dots (T Y_{r} T^{- 1}) . \end{matrix}

Obviously,

{(N_{k_{i}})}^{k_{i}} = I_{k_{i}}

,

{(N_{k_{i}})}^{k_{i} - j} \neq I_{k_{i}}

,

0 < j < k_{i}

.

So

{(Y_{i})}^{k_{i}} = I_{n}

,

{(Y_{i})}^{k_{i} - j} \neq I_{n}

,

0 < j < k_{i}

.

It is clear that

Y_{i}

is a generalized cycle matrix of type I with cycle order

k_{i}

(i = 1, 2, ⋯, r), and it is thus sufficient to prove that

P_{i} = T Y_{i} T^{- 1}

(7)

is a generalized cycle matrix of type I with cycle order

k_{i}

. Rather obviously,

P_{i}^{k_{i}}

=

T Y_{i}^{k_{i}} T^{- 1}

=

T I_{n} T^{- 1}

=

I_{n}

, however, it is not easy to prove that there are exact

k_{i}

vanishing entries in the diagonal of

P_{i}

, and that

k_{i}

is the minimal positive integer satisfying the condition

P_{i}^{k_{i}}

=

I_{n}

.

Proof.

Let

D_{t}^{(b)}

=

I_{n} - D_{t}

, where

D_{t}

is determined by Equation (5). Then

rank D_{t} = t, rank D_{t}^{(b)} = n - t .

Now build a 0-1 matrix

J_{i}^{a}

(i = 1, 2, ⋯, r) of order n, such that the j’th column of

J_{i}^{a}

is the j’th column of

I_{n}

(

j \in F_{t + i}^{'}

), and the other columns of

J_{i}^{a}

are 0 vectors. So

\sum_{i = 1}^{r} J_{i}^{(a)} + D_{t} = I_{n} .

Let

J_{i}^{(b)}

=

I_{n} - J_{i}^{(a)}

, then

rank J_{i}^{(a)} = k_{i}, rank J_{i}^{(b)} = n - k_{i} .

We have

\begin{matrix} D_{t}^{(b)} D_{t} = D_{t} D_{t}^{(b)} = 0, & J_{i}^{(a)} J_{i}^{(b)} = J_{i}^{(b)} J_{i}^{(a)} = 0, & Q_{i} J_{i}^{(b)} = J_{i}^{(b)} Q_{i} = 0, \\ Q_{i_{1}} J_{i_{2}}^{(a)} = J_{i_{2}}^{(a)} Q_{i_{1}} = 0, & J_{i_{1}}^{(a)} J_{i_{2}}^{(a)} = J_{i_{2}}^{(a)} J_{i_{1}}^{(a)} = 0, & Q_{i_{1}} Q_{i_{2}} = Q_{i_{2}} Q_{i_{1}} = 0, \end{matrix}

and

\begin{matrix} Q_{i_{1}} J_{i_{2}}^{(b)} = J_{i_{2}}^{(b)} Q_{i_{1}} = Q_{i_{1}} \neq 0, & J_{i_{1}}^{(a)} J_{i_{2}}^{(b)} = J_{i_{2}}^{(b)} J_{i_{1}}^{(a)} = J_{i_{1}}^{(a)} \neq 0, \end{matrix}

where

1 ⩽ i_{1} \neq i_{2} ⩽ r

,

Q_{i}

is defined above Equation (6) on page 6.

It is not difficult to prove that

J_{i_{1}}^{(b)} J_{i_{2}}^{(b)} = J_{i_{2}}^{(b)} J_{i_{1}}^{(b)} = I_{n} - J_{i_{2}}^{(a)} - J_{i_{1}}^{(a)} .

If we denote

P_{i} = Q_{i} + J_{i}^{(b)}

, we have

rank P_{i} = n, I_{n} + Q_{i} = P_{i} + J_{i}^{(a)} .

Clearly,

P_{i_{1}} P_{i_{2}} = (Q_{i_{1}} + I_{n} - J_{i_{1}}^{(a)}) (Q_{i_{2}} + I_{n} - J_{i_{2}}^{(a)}) = Q_{i_{1}} + Q_{i_{2}} + I_{n} - J_{i_{1}}^{(a)} - J_{i_{2}}^{(a)},

and

P_{i_{2}} P_{i_{1}} = (Q_{i_{2}} + I_{n} - J_{i_{2}}^{(a)}) (Q_{i_{1}} + I_{n} - J_{i_{1}}^{(a)}) = Q_{i_{2}} + Q_{i_{1}} + I_{n} - J_{i_{2}}^{(a)} - J_{i_{1}}^{(a)} .

So,

P_{i_{1}} P_{i_{2}} = P_{i_{2}} P_{i_{1}}

, i.e.,

P_{i_{1}}

and

P_{i_{2}}

commute.

Hence

\prod_{i = 1}^{r} P_{i} = \prod_{i = 1}^{r} (Q_{i} + I_{n} - J_{i}^{(a)}) = \sum_{i = 1}^{r} Q_{i} + I_{n} - \sum_{i = 1}^{r} J_{i}^{(a)} = \sum_{i = 1}^{r} Q_{i} + D_{t} = A .

We can also prove the equality above in a different way.

Because

Q_{i_{1}} Q_{i_{2}} = 0

,

Q_{i_{1}} D_{t}

=

D_{t} Q_{i_{1}}

= 0, (

1 ⩽ i_{1} \neq i_{2} ⩽ r

), then

(I_{n} + D_{t}) \prod_{i = 1}^{r} (I_{n} + Q_{i}) = I_{n} + \sum_{i = 1}^{r} Q_{i} + D_{t} = I_{n} + A .

(8)

Since

D_{t} Q_{i}

=

Q_{i} D_{t}

= 0, we have

D_{t} J_{i}^{(a)}

=

J_{i}^{(a)} D_{t}

= 0.

By construction, when

1 ⩽ i ⩽ r

,

v \in F_{t + i}^{'} ⋃ (⋃_{i = 1}^{u} F_{i}^{'})

, the v’th column (or the v’th row) of

P_{j}

(

1 ⩽ j ⩽ r

,

j \neq i

) is equal to the v’th column (or the v’th row) of

I_{n}

, so the v’th column (or the v’th row) of

\prod_{\begin{matrix} _{1 ⩽ j ⩽ r} \\ ^{j \neq i} \end{matrix}}^{r} P_{j}

is equal to the v’th column (or the v’th row) of

I_{n}

.

Therefore, when

J_{i}^{(a)}

is multiplied by

\prod_{\begin{matrix} _{1 ⩽ j ⩽ r} \\ ^{j \neq i} \end{matrix}}^{r} P_{j}

, the v’th column does not change, while the other columns of

J_{i}^{(a)}

are 0 vectors, such that

J_{i}^{(a)} \prod_{\begin{matrix} _{1 ⩽ j ⩽ r} \\ ^{j \neq i} \end{matrix}}^{r} P_{j} = J_{i}^{(a)} .

For the same reason,

D_{t} \prod_{i = 1}^{r} P_{i}

=

D_{t}

.

Noting that

\begin{matrix} (I_{n} + D_{t}) \prod_{i = 1}^{r} (I_{n} + Q_{i}) \\ = & (I_{n} + D_{t}) \prod_{i = 1}^{r} (P_{i} + J_{i}^{(a)}) \\ = & (I_{n} + D_{t}) (\prod_{i = 1}^{r} P_{i} + \sum_{i = 1}^{r} (J_{i}^{(a)} \prod_{\begin{matrix} _{1 ⩽ j ⩽ r} \\ ^{j \neq i} \end{matrix}}^{r} P_{j})) (J_{i_{1}}^{(a)} J_{i_{2}}^{(a)} = J_{i_{2}}^{(a)} J_{i_{1}}^{(a)} = 0, i_{1} \neq i_{2}) \\ = & (I_{n} + D_{t}) (\prod_{i = 1}^{r} P_{i} + \sum_{i = 1}^{r} J_{i}^{(a)}) \\ = & \prod_{i = 1}^{r} P_{i} + \sum_{i = 1}^{r} J_{i}^{(a)} + D_{t} \prod_{i = 1}^{r} P_{i} + D_{t} \sum_{i = 1}^{r} J_{i}^{(a)} \\ = & \prod_{i = 1}^{r} P_{i} + \sum_{i = 1}^{r} J_{i}^{(a)} + D_{t} + 0 = \prod_{i = 1}^{r} P_{i} + I_{n}, \end{matrix}

we have that

(I_{n} + D_{t}) \prod_{i = 1}^{r} (I_{n} + Q_{i}) = \prod_{i = 1}^{r} P_{i} + I_{n} .

(9)

It follows from Equations (8) and (9), that

\prod_{i = 1}^{r} P_{i} + I_{n} = I_{n} + A,

thus

\prod_{i = 1}^{r} P_{i} = A

.

Now, we prove that

P_{i}

is a generalized cycle matrix of type II with cycle order

k_{i}

.

Since

P_{i} = Q_{i} + J_{i}^{(b)}

,

Q_{i} J_{i}^{(b)}

=

J_{i}^{(b)} Q_{i}

= 0, then

P_{i}^{m}

=

Q_{i}^{m} + {(J_{i}^{(b)})}^{m}

=

Q_{i}^{m} + J_{i}^{(b)}

(

\forall m \in Z^{+}

), and

P_{i}^{k_{i}} = Q_{i}^{k_{i}} + J_{i}^{(b)}

.

It follows from

Q_{i}^{k_{i}} e_{a_{i, v}^{'}}

=

A^{k_{i}} e_{a_{i, v}^{'}}

=

e_{a_{i, v}^{'}}

(v = 1, 2, ⋯,

k_{i}

) that

\forall e_{l} \in D ∖ G_{t + i}^{'}, Q_{i} e_{l} = 0 ⟹ Q_{i}^{k_{i}} e_{l} = 0 .

Here

D

is defined in Equation (3).

On the other hand,

J_{i}^{(b)} e_{a_{i, v}^{'}}

= 0 (v = 1, 2, ⋯,

k_{i}

),

J_{i}^{(b)} e_{l}

=

e_{l}

, (

\forall e_{l} \in D ∖G_{t + i}^{'}

).

So, for any

e_{l}

in

B

, if

e_{l} \in G_{i}^{'}

, then

(Q_{i}^{k_{i}} + J_{i}^{(b)}) e_{l}

=

Q_{i}^{k_{i}} e_{l}

=

e_{l}

; otherwise,

e_{l} \notin G_{i}^{'}

, then

(Q_{i}^{k_{i}} + J_{i}^{(b)}) e_{l}

=

J_{i}^{(b)} e_{l}

=

e_{l}

.

This means that

P_{i}^{k_{i}} e_{l}

= (

Q_{i}^{k_{i}} + J_{i}^{(b)}) e_{l}

=

e_{l}

(

\forall e_{l} \in B

), i.e.

P_{i}^{k_{i}}

(e_{1}, e_{2}, \dots, e_{n})

=

(e_{1}, e_{2},

\dots, e_{n}

), or

P_{i}^{k_{i}}

I_{n}

=

I_{n}

. (Actually, by

Q_{i}^{k_{i}}

=

J_{i}^{(a)}

, we have that

P_{i}^{k_{i}} = Q_{i}^{k_{i}} + J_{i}^{(b)} = J_{i}^{(a)} + J_{i}^{(b)} = I_{n} .

)

When

1 ⩽ m < k_{i}

,

Q_{i}^{m}

is not diagonal, and neither is

Q_{i}^{m} + J_{i}^{(b)}

=

P_{i}^{m}

. So,

P_{i}

is a generalized cycle matrix of type II with cycle order

k_{i}

. □

4. On the Number of Permutation Similarity Classes

The number of permutation similarity classes of permutation matrices of order n is the partition number

p (n)

. There is a recursion formula for

p (n)

,

\begin{matrix} p (n) & = p (n - 1) + p (n - 2) - p (n - 5) - p (n - 7) + \dots + \\ {(- 1)}^{k - 1} p (n - \frac{3 k^{2} \pm k}{2}) + \dots \dots \\ = \sum_{k = 1}^{k_{1}} {(- 1)}^{k - 1} p (n - \frac{3 k^{2} + k}{2}) + \sum_{k = 1}^{k_{2}} {(- 1)}^{k - 1} p (n - \frac{3 k^{2} - k}{2}), \end{matrix}

(10)

(see [7], p. 55), where

k_{1} = ⌊\frac{\sqrt{24 n + 1} - 1}{6}⌋, k_{2} = ⌊\frac{\sqrt{24 n + 1} + 1}{6}⌋,

(11)

and

p (0) = 1

. In the above formula,

⌊x⌋

denotes the floor function, i.e. the maximum integer that is less than or equal to the real number x.

Asymptotically, we have (see e.g., [8,9])

p (n) \sim \frac{1}{4 n \sqrt{3}} exp (\sqrt{\frac{2}{3}} π n^{1 / 2}) .

(12)

This formula has been obtained by Godfrey H. Hardy and Srinivasa Ramanujan in 1918 [10] (In [11,12], one may find two different proofs. The evaluation of the constants can be found in [13]).

Formula (12) is relevant for theoretical analysis and very convenient to estimate the value of

p (n)

by simple means. However, the accuracy of the asymptotic Formula (12) is limited when n is small. Another celebrated formula, given in term of a convergent series, has been found by Rademacher in 1937, based on the work of Hardy and Srinivasa Ramanujan, see [7,14].

In [15], several other formulae modified from Formula (12) have been obtained, showing high accuracy and yet expressed in terms of elementary functions, e.g.

p (n) \approx ⌊\frac{exp (\sqrt{\frac{2}{3}} π \sqrt{n})}{4 \sqrt{3} (n + C_{2}^{'} (n))} + \frac{1}{2}⌋, 1 ⩽ n ⩽ 80

(13)

with a relative error less than 0.004%, where

C_{2}^{'} (n) = \{\begin{matrix} 0.4527092482 \times \sqrt{n + 4.35278} - 0.05498719946, & n = 3, 5, 7, \dots, 79; \\ 0.4412187317 \times \sqrt{n - 2.01699} + 0.2102618735, & n = 4, 6, 8, \dots, 80 . \end{matrix}

and

p (n) \approx ⌊\frac{exp (\sqrt{\frac{2}{3}} π \sqrt{n})}{4 \sqrt{3} (n + a_{2} \sqrt{n + c_{2}} + b_{2})} + \frac{1}{2}⌋, n ⩾ 80

(14)

with a relative error less than

5 \times 10^{- 8}

when

n ⩾ 180

, where

a_{2} = 0.4432884566

,

b_{2} = 0.1325096085

and

c_{2} = 0.274078

.

5. Results for Monomial Matrices

Any monomial matrix M can be written as a product of a permutation matrix P and an invertible diagonal matrix D. Turn all the non-zero elements of M into 1, then we have a permutation matrix P. Suppose that the unique non-zero elements in the i’th row of M is

c_{i}

, and the unique non-zero element in the i’th column of M is

d_{i}

, i = 1, 2, ⋯, n. Let

D_{1}

=

diag

{

c_{1}

,

c_{2}

, ⋯,

c_{n}

},

D_{2}

=

diag

{

d_{1}

,

d_{2}

, ⋯,

d_{n}

}, then we have M =

P D_{2}

=

D_{1} P

.

For the permutation matrix P, there is a permutation matrix T such that

T^{- 1} P T

= Y has the canonical form diag

{I_{t}

,

N_{k_{1}}

, ⋯,

N_{k_{r}}}

as proved in Theorem 1. In the expression

T^{- 1} P T

, the permutation matrix

T^{- 1}

changes only the position of the rows, and T just changes the position of the columns of P. Since the non-zero elements of M and P share the same locations in the matrices, so do

T^{- 1} M T

and

T^{- 1} P T

. Denote the unique non-zero element in the i’th row of

T^{- 1} M T

by

a_{i}

, and the unique non-zero element in the i’th column of

T^{- 1} M T

by

b_{i}

, i = 1, 2, ⋯, n. Let

D_{3}

=

diag

{

a_{1}

,

a_{2}

, ⋯,

a_{n}

},

D_{4}

=

diag

{

b_{1}

,

b_{2}

, ⋯,

b_{n}

}, then

T^{- 1} M T

=

D_{3} Y

=

Y D_{4}

.

Finally, we have that

\begin{matrix} M & = D_{1} T [\begin{matrix} I_{t} \\ N_{1} \\ ⋱ \\ N_{r} \end{matrix}] T^{- 1} = T [\begin{matrix} I_{t} \\ N_{1} \\ ⋱ \\ N_{r} \end{matrix}] T^{- 1} D_{2} \\ = T D_{3} [\begin{matrix} I_{t} \\ N_{1} \\ ⋱ \\ N_{r} \end{matrix}] T^{- 1} = T [\begin{matrix} I_{t} \\ N_{1} \\ ⋱ \\ N_{r} \end{matrix}] D_{4} T^{- 1} . \end{matrix}

D_{1}

,

D_{2}

,

D_{3}

and

D_{4}

could be easily obtained from M directly. Their relations can be stated as below.

D_{2} = P^{- 1} D_{1} P

,

D_{3} = T^{- 1} D_{1} T

,

D_{4} = Y^{- 1} D_{3} Y

.

6. Conclusions

For any permutation matrix A of order n, we can obtain its canonical form

B = diag {I_{t}, N_{k_{1}}, \dots, N_{k_{r}}}

and a permutation matrix T by the algorithm described in the proof of Theorem 1, such that,

B = T^{- 1} A T

, where t, r,

k_{1}

, ⋯,

k_{r}

and T are uniquely determined from A. Any matrix permutationally similar to A has the same canonical form.

The permutation matrix A can be written as the sum of some generalized cycle matrices

Q_{1}

,

Q_{2}

, ⋯,

Q_{r}

of type II and a diagonal matrix

D_{t}

of rank t, where t and r are the same as that mentioned above,

Q_{1}

,

Q_{2}

, ⋯,

Q_{r}

and

D_{t}

are determined from A by Equations (4) and (5) in the proof of Theorem 2.

We can also denote A as the product of some generalized cycle matrices

P_{1}

,

P_{2}

, ⋯,

P_{r}

of type I, where t is the same as that mentioned above,

P_{1}

,

P_{2}

, ⋯,

P_{r}

can be constructed from the Equation (7) in the proof of Theorem 3.

7. Concluding Remark

We can also prove Theorem 1 by the combinatorial method, which may seem easier. But the other two theorems could not be easily proved in the same way. Theorem 1 could be written in the form of permutation transformations (which are the members of the symmetry group

S_{n}

). If L is a Latin square, every row (or column) of L could be considered as a permutation transformation. When searching for the invariant isotopism group of L, we will encounter the canonical form of the permutational similarity relations (of permutation matrices or of permutation transformations in

S_{n}

). So the conclusions obtained here could be applied in Latin squares or projective planes.

Author Contributions

Conceptualization, methodology, validation, writing—original draft—W.-W.L.; funding acquisition—Q.-W.W., W.-W.L. and X.H.; discussion, writing-final draft—X.H., Q.-W.W. and W.-W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (No. 11971294), the College Natural Scientific Research Projects organized by Anhui Provincial Department of Education (No. KJ2021A1198, KJ2021ZD0143) and Beijing Natural Science Foundation (No. 1224036).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lam, C.W.H.; Kolesova, G.; Thiel, L. A computer search for finite projective planes of order 9. Discret. Math. 1991, 92, 187–195. [Google Scholar] [CrossRef] [Green Version]
Djordjević, B.D. Doubly stochastic and permutation solutions to AXA = XAX when A is a permutation matrix. Linear Algebra Its Appl. 2023, 661, 79–105. [Google Scholar] [CrossRef]
Chen, J.X.; Zhu, Z.L.; Fu, C.; Yu, H.; Zhang, Y. Reusing the permutation matrix dynamically for efficient image cryptographic algorithm. Signal Process. 2015, 111, 294–307. [Google Scholar] [CrossRef]
Jaballi, A.; Sakly, A.; Hajjaji, A.E. Permutation matrix based robust stability and stabilization for uncertain discrete-time switched TS fuzzy systems with time-varying delays. Neurocomputing 2016, 214, 527–534. [Google Scholar] [CrossRef]
Diab, H.; El-semary, A.M. Cryptanalysis and improvement of the image cryptosystem reusing permutation matrix dynamically. Signal Process. 2018, 148, 172–192. [Google Scholar] [CrossRef]
Nie, X.R.; Wang, Q.W.; Zhang, Y. A System of Matrix Equations over the Quaternion Algebra with Applications. Algebra Colloq. 2017, 24, 233–253. [Google Scholar] [CrossRef]
Hall, M., Jr. A survey of combinatorial analysis. In Some Aspects of Analysis and Probability; Surveys in Applied Mathematics; Kaplansky, I., Hall, M., Jr., Eds.; John Wiley and Sons, Inc.: New York, NY, USA; Chapman and Hall, Limited: London, UK, 1958; Volume IV, pp. 35–104. [Google Scholar]
Weisstein, E.W. “Partition Function P.” From MathWorld—A Wolfram Web Resource. 1999–2015. Available online: http://mathworld.wolfram.com/PartitionFunctionP.html (accessed on 20 November 2022).
Apostol, T.M. Functions of Number Theory, Additive Number Theory: Unrestricted Partitions. In NIST Digital Library of Mathematical Functions (DLMF); Olver, F.W.J., Lozier, D.W., Boisvert, R.F., Eds.; National Institute of Standards and Technology (NIST): Gaithersburg, MD, USA, 2022. Available online: http://dlmf.nist.gov/27.14 (accessed on 20 December 2022).
Hardy, G.H.; Ramanujan, S.R. Asymptotic Formulae in Combinatory Analysis. Proc. Lond. Math. Soc. 1918, 2, 75–115. [Google Scholar] [CrossRef]
Erdős, P. The Evaluation of the Constant in the Formula for the Number of Partitions of n. Ann. Math. Second. Ser. 1942, 43, 437–450. [Google Scholar] [CrossRef]
Newman, D.J. A simplified proof of the partition formula. Mich. Math. J. 1962, 9, 283–287. [Google Scholar] [CrossRef]
Newman, D.J. The Evaluation of the Constant in the Formula for the Number of Partitions of n. Am. J. Math. 1951, 73, 599–601. [Google Scholar] [CrossRef]
Rademacher, H. A Convergent Series for the Partition Function p(n). Proc. Natl. Acad. Sci. USA 1937, 23, 78–84. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, W.W. Estimation of the Partition Number: After Hardy and Ramanujan. arXiv 2016, arXiv:1612.05526. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, W.-W.; Hou, X.; Wang, Q.-W. The Canonical Forms of Permutation Matrices. Symmetry 2023, 15, 332. https://doi.org/10.3390/sym15020332

AMA Style

Li W-W, Hou X, Wang Q-W. The Canonical Forms of Permutation Matrices. Symmetry. 2023; 15(2):332. https://doi.org/10.3390/sym15020332

Chicago/Turabian Style

Li, Wen-Wei, Xin Hou, and Qing-Wen Wang. 2023. "The Canonical Forms of Permutation Matrices" Symmetry 15, no. 2: 332. https://doi.org/10.3390/sym15020332

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Canonical Forms of Permutation Matrices

Abstract

1. Introduction

2. Preliminary

3. Main Results

4. On the Number of Permutation Similarity Classes

5. Results for Monomial Matrices

6. Conclusions

7. Concluding Remark

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI