Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement

Fujita, Naoki; Röhm, André; Mihana, Takatomo; Horisaki, Ryoichi; Li, Aohan; Hasegawa, Mikio; Naruse, Makoto

doi:10.3390/e25010146

Open AccessArticle

Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement

by

Naoki Fujita

^1,*

,

André Röhm

¹

,

Takatomo Mihana

¹

,

Ryoichi Horisaki

¹

,

Aohan Li

²

,

Mikio Hasegawa

³

and

Makoto Naruse

¹

Department of Information Physics and Computing, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan

²

Graduate School of Informatics and Engineering, The University of Electro-Communications, 1-5-1 Chofugaoka, Chofu-shi, Tokyo 182-8585, Japan

³

Department of Electrical Engineering, Faculty of Engineering, Tokyo University of Science, 6-3-1 Niijuku, Katsushika-ku, Tokyo 125-8585, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(1), 146; https://doi.org/10.3390/e25010146

Submission received: 2 November 2022 / Revised: 14 December 2022 / Accepted: 10 January 2023 / Published: 11 January 2023

(This article belongs to the Topic Complex Systems and Network Science)

Download

Browse Figures

Versions Notes

Abstract

:

Fully pairing all elements of a set while attempting to maximize the total benefit is a combinatorically difficult problem. Such pairing problems naturally appear in various situations in science, technology, economics, and other fields. In our previous study, we proposed an efficient method to infer the underlying compatibilities among the entities, under the constraint that only the total compatibility is observable. Furthermore, by transforming the pairing problem into a traveling salesman problem with a multi-layer architecture, a pairing optimization algorithm was successfully demonstrated to derive a high-total-compatibility pairing. However, there is substantial room for further performance enhancement by further exploiting the underlying mathematical properties. In this study, we prove the existence of algebraic structures in the pairing problem. We transform the initially estimated compatibility information into an equivalent form where the variance of the individual compatibilities is minimized. We then demonstrate that the total compatibility obtained when using the heuristic pairing algorithm on the transformed problem is significantly higher compared to the previous method. With this improved perspective on the pairing problem using fundamental mathematical properties, we can contribute to practical applications such as wireless communications beyond 5G, where efficient pairing is of critical importance. As the pairing problem is a special case of the maximum weighted matching problem, our findings may also have implications for other algorithms on fully connected graphs.

Keywords:

pairing; optimization; matching; maximum weighted matching; heuristic algorithm

1. Introduction

The procedure of generating pairs of elements among all entries of a given system often arises in various situations in science, technology, and economy [1,2,3,4,5,6,7]. Here, we call such a process pairing, and the number of elements is considered to be an even number for simplicity. One immediately obvious problem is that the number of pairing configurations grows rapidly with the number of elements. The number of possible pairings is given by

(n - 1)!!

, where n indicates the number of elements in the system and

!!

is the double factorial operator. For example, when n is 100, the total number of possible pairings is on the order of

10^{78}

. Hence, finding the pairing that maximizes the benefit of the total system is difficult.

Notably, the pairing problem corresponds to the maximum weighted matching (MWM) problem on the complete graph. Multiple algorithms exist for solving the MWM problem [8,9,10,11,12,13,14,15]. In contrast to these conventional methods, we propose a heuristic and fast algorithm at the cost of some performance. The advantage of a fast heuristic algorithm is that it can be useful in environments where weights change dynamically or a quick pairing is required, such as in communications technology. A heuristic algorithm for the MWM problem using deep reinforcement learning was recently proposed by [16] with a similar goal. Furthermore, our research proposes algorithms that work under the limited observation constraint, which is explained later. In our previous study, we proposed an algorithm with a computational complexity of

O (n^{2})

[17].

To the best of our knowledge, there is no exact algorithm that works on the order of

O (n^{2})

for arbitrary weights. For example, Gabow [9] proposed a MWM algorithm with a computation time of

| E | | V | + {| V |}^{2} log | V |

, where V is a set of vertices and E is a set of edges. However, randomized or approximate algorithms can reduce computational time for some cases. For example, Cygan et al. [12] developed a randomized algorithm with a computation time of

{L | V |}^{ω}

for graphs with integer weights (

ω < 2.373

is the exponent of

n \times n

matrix multiplication [18] and L is the maximum integer edge weight). Duan et al. [15] proposed an approximate algorithm achieving an approximation ratio of

(1 - ϵ) M

with a computation time of

| E | ϵ^{- 1} log ϵ^{- 1}

for arbitrary weights and

| E | ϵ^{- 1} log N

for integer weights (

ϵ

is a positive arbitrary value and M is the maximum possible weight matching value). Here,

| V | = n, | E | = n (n - 1) / 2

. Here, we aim to improve our previous pairing problem result, i.e., to determine a higher-accuracy heuristic algorithm that works with

O (n^{2})

computational complexity.

Note that the pairing problem should not be confused with the assignment problem, which is another special case of the MWM setting. The assignment problem requires the graph to be a weighted bipartite graph. Furthermore, in the assignment problem there are two classes of objects, where it is the goal to always match an object from the first class with an object from the second. However, in the pairing problem, there is only a single class of objects, and we allow any of them to be potentially paired with any other. The assignment problem is also related to the single-source shortest paths problem. Several well-known assignment algorithms [19,20,21] or single-source shortest paths algorithms [22] are known. For example, the Hungarian algorithm [19] solves the assignment problem

O (n^{3})

, the auction algorithm [20] works with parallelism and the Bellman–Ford algorithm runs with

O (| V | | E |)

[22]. However, in this study, we consider a fully connected graph with an even number of elements, where the MWM problem cannot be solved by assignment problem algorithms.

An example of a pairing problem is found in a recent communication technology called non-orthogonal multiple access (NOMA) [23,24,25,26,27,28,29]. In NOMA, multiple terminals simultaneously share a common frequency band to improve the efficiency of frequency usage. The simultaneous use of the same frequency band causes interference in the signals from the base station to each terminal. To overcome this problem, NOMA uses a signal processing method called successive interference cancellation (SIC) [30] to distinguish individual channel information in the power domain, allowing multiple terminals to rely on the same frequency band. For simplicity, here we consider that the number of terminals that can share a frequency is given by two. Herein, the usefulness of the whole system can be measured by the total communication quality, such as high data throughput and low error rate, which depends crucially on the method of pairing.

The most fundamental parameter of the pairing problem is the merit between any two given elements, which we call individual compatibility, while the summation of compatibilities for a given pairing is called its total compatibility. The detailed definition is introduced below. Our goal is to derive pairings yielding high total compatibility.

In general, we do not need to assume that the individual compatibility of a pair is observable, i.e., only the total compatibility of a given pairing may be observed. Our previous study [17] divided the pairing problem into two phases. The first is the observation phase, where we observe total compatibilities for several pairings and estimate the individual compatibilities. The second is the combining phase, in which a search is performed for a pairing that provides high total compatibility. This procedure is referred to as pairing optimization. The search is based on the compatibility information obtained in the first phase. In [17], we show that the pairing optimization problem can be transformed into a travelling salesman problem (TSP) [31] with a three-layer structure, allowing us to benefit from a variety of known heuristics.

However, we consider that there is substantial room for further performance optimization. This study sheds new light on the pairing problem from two perspectives. The first is to clarify the algebraic structure of the pairing optimization problem. Because we care only about the total compatibility when all elements are paired, there are many compatibility matrices (defined in Section 2) that share the same total compatibilities. In other words, we can consider an equivalence class of compatibility matrices that yield the same total compatibilities and that cannot be distinguished if individual compatibilities are not measurable. We show that the compatibility matrices in each equivalence class have an invariant value.

Second, although any compatibility matrices in the same equivalence class theoretically provide the same total compatibility, the heuristic pairing optimization process can result in different total compatibility values. These differences are not caused by incomplete or noisy observations, but are due to the convergence properties of the heuristic pairing algorithms, which yield better results on some distributions than others. We examine how the statistics of the compatibility matrix affect the pairing optimization problem and propose a compatibility matrix that yields higher total compatibility after optimization. More specifically, we propose a transformation to the compatibility matrix that minimizes the variance of the elements therein, which we call the variance optimization. We confirmed numerically that enhanced total compatibility is achieved via the compatibility matrix after variance optimization. Furthermore, the proposed variance optimization algorithm may also be applicable when no observation phase is required, i.e., when the individual compatibilities are directly observable. In other words, there are cases where a compatibility matrix unsuitable for a heuristic combining algorithm can be converted to one that is easily combinable.

The remainder of this paper is organized as follows. In Section 2, we define the pairing optimization problem mathematically. Section 3 describes the mathematical properties of the equivalence class. Section 4 explains the concept of variance optimization and presents a solution by which it can be achieved. Section 5 presents results of numerical simulations of the proposed variance optimization. Furthermore, there are two optimization problems in this paper. The first is the pairing problem we aim to solve in Section 2.1. Second is the variance optimization which enables us to enhance the performance of the PNN+p2-opt algorithm in Section 4.2. Finally, Section 6 concludes the paper.

2. Problem Setting

In this section, we provide a mathematical definition of the pairing optimization problem that we address in this study, and define some of the mathematical symbols used in the following discussion. In addition, we explain the constraints applied to the pairing optimization problem.

2.1. Pairing Optimization Problem

Here, we assume that the number of elements is an even natural integer n, while the index of each element is a natural number between 1 and n. Parts of the pairing problem can be described elegantly in set theory, while others benefit from using matrix representations. We will use either, where appropriate. Here we use

U (n)

to denote the set of n elements:

\begin{matrix} U (n) \equiv {i ∣ i \in Z, 1 \leq i \leq n} . \end{matrix}

(1)

Then, we define the set of all possible pairs for

U (n)

as

P (n)

, which contains

N (N - 1) / 2

pairs:

\begin{matrix} P (n) \equiv {{i, j} ∣ i, j \in U (n), i < j} . \end{matrix}

(2)

To describe the compatibilities of these pairs, we now define a “compatibility matrix’’ C as follows:

\begin{matrix} C \in R^{n \times n}, \\ \forall {i, j} \in P (n), C_{i, j} = C_{j, i}, \\ 1 \leq i \leq n, C_{i, i} = 0 . \end{matrix}

The compatibility between elements i and j is denoted by

C_{i, j} \in R

. The matrix C is always symmetric and the major diagonal is zero, because pairing i and j does not depend on the order of elements and an element cannot be paired with itself. The set of all possible compatibility matrices is denoted as

Ω_{n}

when the number of elements is n. In other words,

Ω_{n}

is the set of all

n \times n

symmetric distance matrices, or symmetric hollow matrices. To describe a pairing, i.e., which elements are paired together, we now define a pairing matrix

S \in R^{n \times n}

:

\begin{matrix} \forall {i, j} \in P (n), S_{i, j} = S_{j, i} and S_{i, j} \in {0, 1}, \\ 1 \leq i \leq n, S_{i, i} = 0, \\ \forall i, \sum_{j = 1}^{n} S_{i, j} = 1 . \end{matrix}

S is symmetric, because pairing element i with j is equivalent to pairing j with i. The pairing matrix S is also hollow, because pairing i with itself is not allowed. Each row and column contains only a single non-zero element, as each element i can only be paired once. Therefore, a pairing matrix S is an

n \times n

symmetric and hollow permutation matrix. We define the set of all pairing matrices

S (n) \equiv {S}

when the number of elements is n:

\begin{matrix} S \in S (n) . \end{matrix}

(3)

To derive the set representation of a pairing, we introduce the map

f_{set}

as follows:

\begin{matrix} f_{set} (S) \equiv {{i, j} ∣ i < j and S_{i, j} = 1} . \end{matrix}

(4)

A function denoted by

〈 X, C 〉

is then defined as follows, using the Frobenius inner product

{〈 \cdot 〉}_{F}

:

\begin{matrix} C \in Ω_{n}, \\ X \in R^{n \times n}, \\ 〈 X, C 〉 = \frac{1}{2} {〈 X, C 〉}_{F} . \end{matrix}

For a given compatibility matrix C, we call

〈 S, C 〉

for

S \in S (n)

the “total compatibility’’ for pairing S. This formulation is equivalent to the one used in our previous work [17], and corresponds to summing the individual compatibilities

C_{i, j}

of the pairs defined by S:

\begin{matrix} 〈 S, C 〉 = \sum_{{i, j} \in f_{set} (S)} C_{i, j} . \end{matrix}

For any given compatibility matrix C, the pairing optimization problem can then be formulated as follows:

\begin{matrix} \max : 〈 S, C 〉, \\ subject to : S \in S (n) . \end{matrix}

2.2. Limited Observation Constraint

As briefly mentioned in Section 1, in practice there may often exist one more constraint on the pairing optimization problem. We will assume that initially we do not know each compatibility value. Moreover, we assume that only the value of total compatibility

〈 S, C 〉

for any pairing

S \in S (n)

is observable. We call this condition the “limited observation constraint’’.

Under this constraint, we must execute two phases, the “observation phase’’ and the “combining phase’’, as introduced in our previous study [17]. First, we estimate the ground-truth compatibility matrix

C^{g}

through observations of the total compatibilities of several pairings in the observation phase. We denote the estimated compatibility matrix by

C^{e}

. Our previous work [17] calculated the minimum number of observations that are necessary for deducing

C^{e}

and presents a simple algorithm for doing so efficiently.

3. Mathematical Properties of the Pairing Problem

In this section, we consider algebraic structures in the pairing problem. An equivalence relation is defined among compatibility matrices to construct equivalence classes. Then we show a conserved quantity within the equivalence class and that all members of the class yield the same total compatibility for any given pairing. Furthermore, the statistical properties of compatibility matrices are examined, forming the mathematical foundation of the variance optimization to be discussed in Section 4.

3.1. Adjacent Set

We define the adjacent set matrix

R_{i} (1 \leq i \leq n)

as follows:

\begin{matrix} R_{i} \in R^{n \times n}, \\ {(R_{i})}_{k, l} = \{\begin{matrix} 1 if i \in {k, l} and k \neq l \\ 0 otherwise . \end{matrix} \end{matrix}

(5)

We can also describe

f_{set} (R_{i})

as follows:

\begin{matrix} f_{set} (R_{i}) = \{{i, j} ∣ 1 \leq j \leq n, j \neq i\} . \end{matrix}

(6)

With these adjacent sets, the following theorem holds.

Theorem 1.

C \in Ω_{n}

is fully determined by

{〈 S, C 〉 ∣ S \in S (n)}

and

{〈 R_{i}, C 〉 ∣ 1 \leq i \leq n - 1}

.

Note that

〈 R_{n}, C 〉

is not included, i.e., only

n - 1

terms involving

R_{i}

are needed. Here, we have chosen to exclude index n without loss of generality.

Proof of Theorem 1.

Our strategy to prove this involves calculating the dimension of the involved subspaces. First, we prove the equation

\begin{matrix} span {S}_{S \in S (n)} \cap span {R_{i}}_{1 \leq i \leq n - 1} = {O_{n}} \end{matrix}

(7)

where

O_{n}

denotes the

n \times n

zero matrix. Then, we focus on the following equation to check linear independence. Here, we number all pairings such as

S_{1}, S_{2}, \dots S_{u} \dots S_{(N - 1)!!}

. We introduce the coefficients

a_{u}

and

b_{v}

and calculate the overlap of the spans:

\begin{matrix} 1 \leq u \leq (n - 1)!!, a_{u} \in R, \\ 1 \leq v \leq n - 1, b_{v} \in R, \\ \sum_{u = 1}^{(n - 1)!!} a_{u} S_{u} = \sum_{v = 1}^{n - 1} b_{v} R_{v} . \end{matrix}

(8)

We focus on the summation of the kth-column on both sides. Note that for every

S_{u}

there is exactly one non-zero element in column k, while for

R_{v}

there may be more than one if

v = k

and

1 \leq k \leq n - 1

, or exactly one non-zero element otherwise. Then, the following equations hold:

When

1 \leq k \leq n - 1

\begin{matrix} (n - 2) b_{k} + \sum_{l = 1}^{n - 1} b_{l} - \sum_{l = 1}^{(n - 1)!!} a_{l} = 0 . \end{matrix}

(9)

When

k = n

(because of our choice in formulating Theorem 1)

\begin{matrix} \sum_{l = 1}^{n - 1} b_{l} - \sum_{l = 1}^{(n - 1)!!} a_{l} = 0 . \end{matrix}

(10)

With Equations (9) and (10),

b_{k} = 0 (1 \leq k \leq n - 1)

holds. This means that

\begin{matrix} span {S}_{S \in S (n)} \cap span {R_{i}}_{1 \leq i \leq n - 1} = {O_{n}}, \end{matrix}

(11)

\begin{matrix} \dim span {R_{i}}_{1 \leq i \leq n - 1} = n - 1 . \end{matrix}

(12)

By our previous study [17],

\begin{matrix} \dim span {S}_{S \in S (n)} = L_{\min} (n) . \end{matrix}

(13)

Here, we denote

L_{\min} (n) \equiv (n - 1) (n - 2) / 2

. By Equations (12) and (13), the following equation holds:

\begin{matrix} \dim span {S}_{S \in S (n)} + \dim span {R_{i}}_{1 \leq i \leq n - 1} = \dim Ω_{n} . \end{matrix}

(14)

Therefore, by Equations (11) and (14),

\begin{matrix} \dim span {S}_{S \in S (n)} \cup span {R_{i}}_{1 \leq i \leq n - 1} = \dim Ω_{n} . \end{matrix}

(15)

The pairing matrices S are a subset of

Ω_{n}

. In addition, the adjacent set matrices

R_{i}

are also a subset of

Ω_{n}

. Therefore, the following equation holds:

\begin{matrix} span {S}_{S \in S (n)} \cup span {R_{i}}_{1 \leq i \leq n - 1} \subseteq Ω_{n} . \end{matrix}

(16)

With Equations (15) and (16),

\begin{matrix} span {S}_{S \in S (n)} \cup span {R_{i}}_{1 \leq i \leq n - 1} = Ω_{n} . \end{matrix}

(17)

That is,

{S}_{S \in S (n)}

plus

{R_{i}}_{1 \leq i \leq n - 1}

can construct

Ω_{n}

. Finally,

〈 S, C 〉

is a linear transformation of S which comes from the property of the Frobenius inner product. Therefore,

C \in Ω_{n}

can be constructed as a linear combination of

{〈 S, C 〉 ∣ S \in S (n)}

and

{〈 R_{i}, C 〉 ∣ 1 \leq i \leq n - 1}

. Therefore, the theorem holds. □

Corollary 1.

\begin{matrix} A, B \in Ω_{n}, \\ A = B i f a n d o n l y i f \\ \forall S \in S (n), \\ 〈 S, A 〉 = 〈 S, B 〉 a n d 1 \leq i \leq n, 〈 R_{i}, A 〉 = 〈 R_{i}, B 〉 . \end{matrix}

(18)

This corollary is a special case of Theorem 1 because Equation (18) means that A and B have the same total compatibilities for all pairings and all adjacent sets.

Here, we present an example for Theorem 1 for the

n = 4

case to illustrate the relationship of the involved subspaces. We define the following

H_{i}

:

\begin{matrix} H_{i} = \{\begin{matrix} span {S}_{S \in S (n)} if i = 0, \\ span {R_{i}} if 1 \leq i \leq n - 1 . \end{matrix} \end{matrix}

(19)

We represent

H_{i}

as follows, where

D_{i, j} \in Ω_{n}

is defined as the

n \times n

matrix whose

(i, j)

th element is 1 and all other elements are 0:

\begin{matrix} H_{i} = \{\begin{matrix} if i = 0, \\ {k_{1} (D_{1, 2} + D_{3, 4}) + k_{2} (D_{1, 3} + D_{2, 4}) + k_{3} (D_{1, 4} + D_{2, 3}) ∣ k_{1}, k_{2}, k_{3} \in R} \\ if i = 1, \\ {k_{4} (D_{1, 2} + D_{1, 3} + D_{1, 4}) ∣ k_{4} \in R} \\ if i = 2, \\ {k_{5} (D_{2, 1} + D_{2, 3} + D_{2, 4}) ∣ k_{5} \in R} \\ if i = 3, \\ {k_{6} (D_{3, 1} + D_{3, 2} + D_{3, 4}) ∣ k_{6} \in R}, \end{matrix} \end{matrix}

(20)

\begin{matrix} \bar{H} = {l_{i, j} D_{i, j} ∣ 1 \leq i < j \leq n, l_{i, j} \in R} . \end{matrix}

(21)

The image of these spaces is represented in Figure 1. That is,

\begin{matrix} 0 \leq i < j \leq n - 1, i \neq j, H_{i} \cap H_{j} = {O_{n}}, \end{matrix}

(22)

\begin{matrix} \bar{H} = H_{0} \cup H_{1} \cup H_{2} \cup H_{3} . \end{matrix}

(23)

3.2. Equivalence Class

We define the relation ∼ as follows:

\begin{matrix} A, B \in Ω_{n}, \\ A \sim B if and only if \forall S \in S (n), 〈 S, A 〉 = 〈 S, B 〉 . \end{matrix}

(24)

This represents an equivalence relationship between A and B, leading to the construction of an equivalence class.

Regarding this equivalence class, the following theorem holds:

Theorem 2.

\begin{matrix} A, B \in Ω_{n}, \\ A \sim B i f a n d o n l y i f \\ \forall {i, j} \in P (n), \\ A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉) = B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉) . \end{matrix}

(25)

That is, for any matrix C in the equivalence class, the values given by the following are conserved.

\begin{matrix} \forall {i, j} \in P (n), C_{i, j} - \frac{1}{n - 2} (〈 R_{i}, C 〉 + 〈 R_{j}, C 〉) . \end{matrix}

(26)

The matrix form of the conserved values is described in Appendix A.

Proof of Theorem 2.

First, we prove sufficiency. We assume that the following equation holds:

\begin{matrix} \forall {i, j} \in P (n), A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉) = B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉) . \end{matrix}

(27)

With Equation (27), the following equation holds:

\begin{matrix} \sum_{{i, j} \in P (n)} \{A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉)\} \\ = \sum_{{i, j} \in P (n)} \{B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉)\} . \end{matrix}

(28)

Here, the left side can be calculated as follows because the number of pairs including element k in

P (n)

is

n - 1

:

\begin{matrix} \sum_{{i, j} \in P (n)} \{A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉)\} \\ = & \sum_{{i, j} \in P (n)} A_{i, j} - \frac{n - 1}{n - 2} \sum_{k = 1}^{n} 〈 R_{k}, A 〉 \\ = & \sum_{{i, j} \in P (n)} A_{i, j} - \frac{n - 1}{n - 2} \sum_{k = 1}^{n} \sum_{l \neq k} A_{k, l} \\ = & \sum_{{i, j} \in P (n)} A_{i, j} - \frac{2 (n - 1)}{n - 2} \sum_{{k, l} \in P (n)} A_{k, l} \\ = & - \frac{n}{n - 2} \sum_{{i, j} \in P (n)} A_{i, j} . \end{matrix}

(29)

Using Equation (29), Equation (28) is transformed into the following:

\begin{matrix} - \frac{n}{n - 2} \sum_{{i, j} \in P (n)} A_{i, j} = - \frac{n}{n - 2} \sum_{{i, j} \in P (n)} B_{i, j} . \end{matrix}

(30)

Therefore,

\begin{matrix} \sum_{{i, j} \in P (n)} A_{i, j} = \sum_{{i, j} \in P (n)} B_{i, j} . \end{matrix}

(31)

The following equation holds for any pairing S by Equation (27):

\begin{matrix} \sum_{{i, j} \in f_{set} (S)} \{A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉)\} \\ = \sum_{{i, j} \in f_{set} (S)} \{B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉)\} . \end{matrix}

(32)

Here, the following equation holds. Note that

{i, j}

belongs to

f_{set} (S)

; hence,

〈 R_{k}, A 〉

appears only once and all indexes k ranging from 1 to n appear over the summation:

\begin{matrix} \sum_{{i, j} \in f_{set} (S)} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉) & = & \sum_{k = 1}^{n} 〈 R_{k}, A 〉 \\ = & \sum_{k = 1}^{n} \sum_{l, l \neq k} A_{k, l} \end{matrix}

(33)

\begin{matrix} = & 2 \sum_{{k, l} \in P (n)} A_{k, l} . \end{matrix}

(34)

For B, the following equation also holds:

\begin{matrix} \sum_{{i, j} \in f_{set} (S)} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉) & = & \sum_{k = 1}^{n} 〈 R_{k}, B 〉 \end{matrix}

(35)

\begin{matrix} = & 2 \sum_{{k, l} \in P (n)} B_{k, l} . \end{matrix}

(36)

Using these transformations, Equation (32) is transformed as follows:

\begin{matrix} 〈 S, A 〉 - \frac{2}{n - 2} \sum_{{k, l} \in P (n)} A_{k, l} = 〈 S, B 〉 - \frac{2}{n - 2} \sum_{{k, l} \in P (n)} B_{k, l} . \end{matrix}

(37)

With Equation (31),

\begin{matrix} 〈 S, A 〉 = 〈 S, B 〉 . \end{matrix}

(38)

Then,

A \sim B

holds.

Second, we prove the necessity. We assume that

A \sim B

holds. We define

A^{*} \in Ω_{n}

as follows:

\begin{matrix} A_{i, j}^{*} \equiv \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉) + B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉) . \end{matrix}

(39)

By Equations (33), (35) and (39),

\begin{matrix} \forall S \in S (n), 〈 S, A^{*} 〉 & = & \sum_{{i, j} \in f_{set} (S)} A_{i, j}^{*} \\ = & 〈 S, B 〉 + \frac{1}{n - 2} \sum_{i = 1}^{n} 〈 R_{i}, A 〉 - \frac{1}{n - 2} \sum_{i = 1}^{n} 〈 R_{i}, B 〉 . \end{matrix}

(40)

We derive the relationship between

\sum_{i = 1}^{n} 〈 R_{i}, A 〉

and

\sum_{S \in S (n)} 〈 S, A 〉

here in order to transform Equation (40). By Equation (34),

\begin{matrix} \sum_{i = 1}^{n} 〈 R_{i}, A 〉 = 2 \sum_{{i, j} \in P (n)} A_{i, j} . \end{matrix}

(41)

For

\sum_{S \in S (n)} 〈 S, A 〉

, we focus on the fact that the number of appearances of

A_{i, j}

is

(n - 3)!!

,

\begin{matrix} \sum_{S \in S (n)} 〈 S, A 〉 = (n - 3)!! \sum_{{i, j} \in P (n)} A_{i, j} . \end{matrix}

(42)

With Equations (41) and (42), the following relationship holds:

\begin{matrix} \sum_{i = 1}^{n} 〈 R_{i}, A 〉 = \frac{2}{(n - 3)!!} \sum_{S \in S (n)} 〈 S, A 〉 . \end{matrix}

(43)

Therefore, the following holds by

A \sim B

and Equation (43):

\begin{matrix} \sum_{i = 1}^{n} 〈 R_{i}, A 〉 & = & \frac{2}{(n - 3)!!} \sum_{S \in S (n)} 〈 S, A 〉 \\ = & \frac{2}{(n - 3)!!} \sum_{S \in S (n)} 〈 S, B 〉 \\ = & \sum_{i = 1}^{n} 〈 R_{i}, B 〉 . \end{matrix}

(44)

By Equation (44), we can cancel the second and third terms of (40),

\begin{matrix} 〈 S, A^{*} 〉 = 〈 S, B 〉 . \end{matrix}

(45)

In addition,

A \sim B

holds. Therefore,

\begin{matrix} \forall S \in S (n), 〈 S, A^{*} 〉 = 〈 S, B 〉 = 〈 S, A 〉 . \end{matrix}

(46)

Additionally, the following also holds by

A \sim B

and Equation (44):

\begin{matrix} \sum_{j, j \neq i} A_{i, j}^{*} & = & \frac{n - 1}{n - 2} 〈 R_{i}, A 〉 + \frac{1}{n - 2} \sum_{j, j \neq i} 〈 R_{j}, A 〉 + \sum_{j, j \neq i} B_{i, j} \\ - \frac{n - 1}{n - 2} 〈 R_{i}, B 〉 - \frac{1}{n - 2} \sum_{j, j \neq i} 〈 R_{j}, B 〉 \\ = & \frac{1}{n - 2} (\sum_{j = 1}^{n} 〈 R_{j}, A 〉 - \sum_{j = 1}^{n} 〈 R_{j}, B 〉) + 〈 R_{i}, A 〉 \\ = & 〈 R_{i}, A 〉 . \end{matrix}

(47)

By Equation (47),

\begin{matrix} 1 \leq i \leq n, 〈 R_{i}, A^{*} 〉 = 〈 R_{i}, A 〉 . \end{matrix}

(48)

Therefore, by Equations (46) and (48) and Corollary 1,

\begin{matrix} A = A^{*} \end{matrix}

(49)

is valid. That is to say, the following equation holds:

\begin{matrix} {i, j} \in P (n), A_{i, j} - \frac{1}{n - 2} (〈 R_{i}, A 〉 + 〈 R_{j}, A 〉) = B_{i, j} - \frac{1}{n - 2} (〈 R_{i}, B 〉 + 〈 R_{j}, B 〉) . \end{matrix}

(50)

□

3.3. Mean and Covariance

Here, we analyze statistical properties associated with the compatibility matrix and the total compatibility.

We define the mean values of compatibilities and total compatibilities as

\begin{matrix} C \in Ω_{n}, \\ μ_{element} (C) & \equiv & \frac{2}{n (n - 1)} \sum_{1 \leq i < j \leq n} C_{i, j}, \\ μ_{sum} (C) & \equiv & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} 〈 S, C 〉 . \end{matrix}

By Equation (42),

μ_{sum} (C)

is transformed into

\begin{matrix} μ_{sum} (C) & \equiv & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} 〈 S, C 〉 \\ = & \frac{1}{n - 1} \sum_{1 \leq i < j \leq n} C_{i, j} \\ = & \frac{n}{2} μ_{element} (C) \end{matrix}

(51)

where

μ_{element} (C)

indicates the mean value of the elements of the compatibility matrix C and

μ_{sum} (C)

is the mean of the total compatibility across all possible pairing with respect to the compatibility matrix C.

We define the square root of the covariance values for compatibilities and total compatibilities as follows:

\begin{matrix} σ_{element} (A, B) \equiv \sqrt{\sum_{1 \leq i < j \leq n} \frac{2}{n (n - 1)} (A_{i, j} - μ_{element} (A)) (B_{i, j} - μ_{element} (B))}, \\ σ_{sum} (A, B) \equiv \sqrt{\frac{1}{(n - 1)!!} \sum_{S \in S (n)} (〈 S, A 〉 - μ_{sum} (A)) (〈 S, B 〉 - μ_{sum} (B))} . \end{matrix}

(52)

Clearly,

σ_{element}^{2} (C, C)

and

σ_{sum}^{2} (C, C)

are variance values for compatibilities and total compatibilities when the compatibility matrix is C.

Regarding

σ_{sum}^{2} (C, C)

, the following theorem holds.

Theorem 3.

Let

I_{n}

be the

n \times n

identity matrix,

J_{n}

the

n \times n

matrix where all elements are 1, and

C \in Ω_{n}, \hat{C} \equiv C - μ_{element} (C) (J_{n} - I_{n})

. Then, the following equation holds:

\begin{matrix} σ_{sum}^{2} (C, C) = \frac{n (n - 2)}{2 (n - 3)} σ_{element}^{2} (C, C) - \frac{1}{(n - 1) (n - 3)} \sum_{k = 1}^{n} {〈 R_{k}, \hat{C} 〉}^{2} . \end{matrix}

(53)

Proof of Theorem 3.

By definition,

\begin{matrix} σ_{sum}^{2} (C, C) & = & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} {\{〈 S, C 〉 - μ_{sum} (C)\}}^{2} \end{matrix}

Using Equation (51),

\begin{matrix} σ_{sum}^{2} (C, C) & = & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} {\{〈 S, C 〉 - \frac{n}{2} μ_{element} (C)\}}^{2} \end{matrix}

(54)

Here, the following equation holds:

\begin{matrix} 〈 S, \hat{C} 〉 & = & \frac{1}{2} {〈 S, \hat{C} 〉}_{F} \\ = & \frac{1}{2} {〈 S, C 〉}_{F} - \frac{1}{2} μ_{element} (C) 〈 S, J_{n} - I_{n} 〉 \\ = & \frac{1}{2} {〈 S, C 〉}_{F} - \frac{n}{2} μ_{element} (C) \\ = & 〈 S, C 〉 - \frac{n}{2} μ_{element} (C) \end{matrix}

(55)

Therefore, by Equations (54) and (55),

\begin{matrix} σ_{sum}^{2} (C, C) & = & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} {\{〈 S, C 〉 - \frac{n}{2} μ_{element} (C)\}}^{2} \\ = & \frac{1}{(n - 1)!!} \sum_{S \in S (n)} {〈 S, \hat{C} 〉}^{2} \\ = & \frac{1}{(n - 1)!!} \cdot (n - 3)!! \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} \\ + \frac{1}{(n - 1)!!} \cdot (n - 5)!! \sum_{{i, j} \in P (n)} \sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \cap {i, j} = \emptyset \end{matrix}} {\hat{C}}_{i, j} {\hat{C}}_{k, l} \\ = & \frac{1}{n - 1} \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} + \frac{1}{(n - 1) (n - 3)} \sum_{{i, j} \in P (n)} \sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \cap {i, j} = \emptyset \end{matrix}} {\hat{C}}_{i, j} {\hat{C}}_{k, l} \\ = & \frac{1}{n - 1} \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} + \frac{1}{(n - 1) (n - 3)} \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j} \sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \cap {i, j} = \emptyset \end{matrix}} {\hat{C}}_{k, l} . \end{matrix}

(56)

Here, we focus on

\sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \cap {i, j} = \emptyset \end{matrix}} {\hat{C}}_{k, l}

. This term is transformed as follows:

\begin{matrix} \sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \cap {i, j} = \emptyset \end{matrix}} {\hat{C}}_{k, l} & = & {\hat{C}}_{i, j} + \sum_{{k, l} \in P (n)} {\hat{C}}_{k, l} - \sum_{k, k \neq i} {\hat{C}}_{i, k} - \sum_{k, k \neq j} {\hat{C}}_{j, k} \\ = & {\hat{C}}_{i, j} - 〈 R_{i}, \hat{C} 〉 - 〈 R_{j}, \hat{C} 〉 + \sum_{{k, l} \in P (n)} {\hat{C}}_{k, l} \\ = & {\hat{C}}_{i, j} - 〈 R_{i}, \hat{C} 〉 - 〈 R_{j}, \hat{C} 〉 + \sum_{{k, l} \in P (n)} (C_{k, l} - μ_{element} (C)) \\ = & {\hat{C}}_{i, j} - 〈 R_{i}, \hat{C} 〉 - 〈 R_{j}, \hat{C} 〉 + (\sum_{{k, l} \in P (n)} C_{k, l}) - \frac{n (n - 1)}{2} μ_{element} (C) \\ = & {\hat{C}}_{i, j} - 〈 R_{i}, \hat{C} 〉 - 〈 R_{j}, \hat{C} 〉 . \end{matrix}

(57)

Then, using this formula,

\begin{matrix} \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j} \sum_{\begin{matrix} {k, l} \in P (n) \\ {k, l} \neq {i, j} \end{matrix}} {\hat{C}}_{k, l} \\ = \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j} ({\hat{C}}_{i, j} - 〈 R_{i}, \hat{C} 〉 - 〈 R_{j}, \hat{C} 〉) \\ = \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} - \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j} (〈 R_{i}, \hat{C} 〉 + 〈 R_{j}, \hat{C} 〉) \\ = \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} - \sum_{i = 1}^{n} \sum_{j \neq i} {\hat{C}}_{i, j} 〈 R_{i}, \hat{C} 〉 \\ = \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} - \sum_{i = 1}^{n} {〈 R_{i}, \hat{C} 〉}^{2} . \end{matrix}

(58)

By Equations (56) and (58), the following equation holds:

\begin{matrix} σ_{sum}^{2} (C, C) & = & \frac{n - 2}{(n - 1) (n - 3)} \sum_{{i, j} \in P (n)} {\hat{C}}_{i, j}^{2} - \frac{1}{(n - 1) (n - 3)} \sum_{k = 1}^{n} {〈 R_{k}, \hat{C} 〉}^{2} \\ = & \frac{n (n - 2)}{2 (n - 3)} σ_{element}^{2} (C, C) - \frac{1}{(n - 1) (n - 3)} \sum_{k = 1}^{n} {〈 R_{k}, \hat{C} 〉}^{2} . \end{matrix}

(59)

Therefore, the theorem holds. □

4. Variance Optimization

This section examines the performance enhancement from deriving a pairing that yields higher total compatibility by exploiting the algebraic structures identified in the previous section. We first show that the variance of the elements in a compatibility matrix affects the performance of the heuristic algorithm proposed in our previous study. Then we propose the transformation of a compatibility matrix to another one that minimizes the variance while ensuring that the total compatibility is maintained.

4.1. Performance Degradation through the Observation Phase

In our previous study [17], we proposed an algorithm for recognizing the compatibilities among elements through multiple measurements of total compatibility. To summarize, we estimate the compatibility matrix denoted by

\tilde{C} \in Ω_{n}

, which is given by

\begin{matrix} C \in Ω_{n}, \\ {\tilde{C}}_{i, j} = \{\begin{matrix} 0 if 1 \in {i, j} \\ C_{i, j} - C_{1, i} - C_{1, j} + \frac{2}{n - 2} \sum_{k = 2}^{n} C_{1, k} otherwise . \end{matrix} \end{matrix}

(60)

This

\tilde{C} \in Ω_{n}

is one of the elements in the equivalence class. That is,

C \sim \tilde{C}

holds. By this property and Equation (60), the dimension of

{S}_{S \in S (n)}

is given by

(n - 1) (n - 2) / 2

, which we refer to as

L_{\min} (n)

. This means that the number of observations required to grasp the compatibilities through an observation phase is

L_{\min} (n)

.

Indeed, our previous study proposed an observation algorithm which needs

O (n^{2})

measurements. We have also confirmed numerically that the observation strategy provides a compatibility matrix, which is in the equivalence class of the ground-truth compatibility matrix

C^{g}

. In the numerical studies, the elements of the ground-truth compatibility matrix,

C_{i, j}^{g}

, were specified by uniformly distributed random numbers in the range of

[0, 1]

.

However, finding a pairing yielding a greater total compatibility becomes difficult based on

C^{e}

, including the above-mentioned

\tilde{C}

, even though

C^{e}

is in the equivalence class where the ground-truth compatibility

C^{g}

is included. In searching for a better pairing, we use a heuristic algorithm, which is named Pairing-2-opt [17]. We consider the difficulty comes from the fact that the variance of the elements of the compatibility matrix

σ_{element}^{2} (C^{e}, C^{e})

would be larger than those of

σ_{element}^{2} (C^{g}, C^{g})

, which is highly likely to cause the combining algorithm to become stuck in a local minimum.

Hence, our idea is to find a compatibility matrix X which is in the same equivalence class of matrix C

\begin{matrix} \forall S \in S (n), 〈 S, X 〉 = 〈 S, C 〉 \end{matrix}

(61)

while simultaneously minimizing the variance of the elements of

σ_{element}^{2} (X, X)

.

4.2. Transforming the Compatibility Matrix with Minimized Variance

We solve the following optimization problem:

\begin{matrix} \min : & σ_{element}^{2} (X, X), \\ subject to : & X, C \in Ω_{n}, C is fixed, \\ X \sim C . \end{matrix}

(62)

By Theorem 3 and

σ_{sum}^{2} (X, X) = σ_{sum}^{2} (C, C)

, we transform this problem into the following form:

\begin{matrix} \min : & \sum_{k = 1}^{n} {〈 R_{k}, \hat{X} 〉}^{2}, \\ subject to : & X, C \in Ω_{n}, C is fixed, \\ X \sim C, \\ \hat{X} \equiv X - μ_{element} (X) (J_{n} - I_{n}) . \end{matrix}

(63)

The optimal solution for this problem holds because the sum of squares is minimized when all values are 0:

\begin{matrix} 1 \leq k \leq n, 〈 R_{k}, \hat{X} 〉 = 0 . \end{matrix}

(64)

Hence, the following equation is derived:

\begin{matrix} 1 \leq k \leq n, 〈 R_{k}, X 〉 = (n - 1) μ_{element} (C) . \end{matrix}

(65)

By Equation (65) and Theorem 2, the optimal solution is represented as follows:

\begin{matrix} X_{i, j} = \frac{2 (n - 1)}{n - 2} μ_{element} (C) + C_{i, j} - \frac{1}{n - 2} (〈 R_{i}, C 〉 + 〈 R_{j}, C 〉) . \end{matrix}

(66)

Thus, the compatibility matrix with minimal variance is derived. In addition, this discussion and solution mean that the optimal-variance solution is unique with respect to the equivalence class.

5. Simulation

In this section, we evaluate the performance of the proposed method on the pairing optimization problem. There are two important points that should be clarified through the simulations. One is to quantitatively evaluate the performance reduction of the combining algorithm proposed in the previous study, based on the observation phase. The other is to demonstrate the performance enhancement due to the variance optimization discussed in Section 4.

5.1. Setting

We configure the ground-truth compatibility matrix

C^{g} \in Ω_{n}

with two different distributions. The first is the uniform distribution:

\begin{matrix} \forall {i, j} \in P, C_{i, j}^{g} \sim U (0, 1) . \end{matrix}

(67)

Here, we denote the uniform distribution between 0 and 1 as

U (0, 1)

. The second distribution is the Poisson distribution:

\begin{matrix} \forall {i, j} \in P, C_{i, j}^{g} \sim P o i s s o n (1) . \end{matrix}

(68)

Here, we denote the Poisson distribution whose mean is

λ

as

P o i s s o n (λ)

. In the numerical simulation, the number of elements in the system n varied from 100 to 1000 in intervals of 100. For each n, we conducted 100 trials with different randomly generated ground-truth compatibility matrices

C^{g}

based on Equation (67) or Equation (68). We quantified the performance for each derived pairing

S \in S (n)

by

2 〈 S, C^{g} 〉 / n

and evaluated its average over 100 trials for each value of n.

5.2. Simulation Flow

The ground-truth compatibility matrix

C^{g}

is transformed into

C^{e_{1}}

by the observation algorithm based on Equation (60). The variance optimization transforms

C^{e_{1}}

into

C^{e_{2}}

. The combining algorithm, which is called PNN+p2-opt [17], yields a pairing with the intention of achieving higher total compatibility. The exchange limit l is an internal parameter in PNN+p2-opt. This determines the number of maximum trials, and is set to 600 in the present study.

We evaluated the performance on the basis of

C^{g}

,

C^{e_{1}}

, and

C^{e_{2}}

, as shown in flows (i), (ii), and (iii), respectively, in Figure 2.

5.3. Performance

The blue, red, and yellow curves in Figure 3 demonstrate the performance of cases (i), (ii), and (iii), respectively, as a function of the number of elements for the uniform distribution (Figure 3a) and the Poisson distribution (Figure 3b). For the uniform distributed ground-truth we observe that the performance of case (ii) is inferior to that of case (i), demonstrating the performance degradation by the transformation from

C^{g}

to

C^{e_{1}}

through observation. Furthermore, the performance of case (iii) is enhanced compared with that of case (ii), which confirms the performance gain from variance optimization. The results differ for the Poisson distribution. Here, the performance of case (iii) is higher than case (i). That is, for the Poisson case the variance optimization (Flow (iii)) not only counteracted the performance loss of the observation algorithm (Flow (ii)), but actually enhanced the performance compared to the ground truth matrix

C^{g}

(Flow (i)). Further numerical tests revealed that the relationship of performances for a Gaussian distribution are similar to those for the uniform distribution. Conversely, the performance for a binary distribution hardly differed between any of the algorithms.

The variance of

C^{g}

,

C^{e_{1}}

, and

C^{e_{2}}

are evaluated as shown in Figure 4 as a function of the number of elements. We clearly observe that the variance of

C^{e_{1}}

is higher than

C^{g}

while the variance of

C^{e_{2}}

becomes comparable to the ground-truth case

C^{g}

for both the uniform and Poisson distributions.

From these numerical results, we can conclude that the variance optimization minimizes the variance and enhances the performance of the achieved total compatibility. It is worth noting that the performance with the uniform distribution after variance optimization is still lower than the case based on the ground-truth matrix

C^{g}

, as observed in Figure 3a. This occurs because the variance optimization algorithm does not transform

C^{e_{1}}

to the original compatibility matrix

C^{g}

. In other words, there exist additional factors that influence the performance of the combining algorithm that are related to the compatibility distribution. The distribution of the original compatibility

C^{g}

(uniform distribution) is seemingly beneficial for the performance of the heuristic combining algorithm, even when compared to the compatibility matrix with minimum variance

C^{e_{2}}

.

6. Conclusions

One of the most challenging issues in the pairing problem is how to understand the underlying compatibilities among the elements under study. An accurate and efficient approach is essential for practical applications such as wireless communications and online social networks. This study reveals several algebraic structures in the pairing optimization problem.

We introduce an equivalence class in the compatibility matrices, containing matrices that yield the same total compatibility although the matrices themselves differ. This can also be expressed through a conserved value or invariance in the equivalence class. Based on such insights, we propose a transformation of the initially estimated compatibility matrix to another form that minimizes the variance of the elements. We demonstrate that the highest total compatibility found heuristically is improved significantly with the proposed transformation relative to the direct approach.

In the future, the proposed algorithm may be applied to bipartite matching and assignment problems, for example. Therefore, if the compatibility between elements that should not be paired is set to a negative value with a relatively large absolute value, we may solve the problem heuristically. Hence, the variance optimization proposed in this study may aid in performance enhancement.

Author Contributions

Conceptualization, N.F., M.H. and M.N.; methodology, N.F.; software, N.F.; validation, N.F., A.R. and T.M.; formal analysis, N.F. and A.R.; investigation, N.F., A.R., T.M., R.H., A.L., M.H. and M.N.; resources, T.M., R.H. and M.N.; data curation, N.F.; writing—original draft preparation, N.F., A.R. and M.N.; writing—review and editing, N.F., A.R., T.M., R.H., A.L., M.H. and M.N.; visualization, N.F.; supervision, M.N.; project administration, M.H. and M.N.; funding acquisition, M.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the Japan Science and Technology Agency through the Core Research for Evolutionary Science and Technology (CREST) Project (JPMJCR17N2), and in part by the Japan Society for the Promotion of Science through the Grants-in-Aid for Scientific Research (A) (JP20H00233) and Transformative Research Areas (A) (JP22H05197). AR is a JSPS International Research Fellow.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

We would like to thank the editors of this study.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Matrix Form of Conserved Quantities

In Theorem 2, the following values are conserved in the same equivalence class.

\begin{matrix} \forall {i, j} \in P (n), C_{i, j} - \frac{1}{n - 2} (〈 R_{i}, C 〉 + 〈 R_{j}, C 〉) . \end{matrix}

(A1)

We can transform Equation (A1) into the following form using the Hadamard product ∘.

\begin{matrix} C - \frac{1}{n - 2} (J_{n} - I_{n}) \circ (J_{n} C + C J_{n}) . \end{matrix}

(A2)

Therefore, the following equation holds:

\begin{matrix} A \sim B if and only if \\ A - \frac{1}{n - 2} (J_{n} - I_{n}) \circ (J_{n} A + A J_{n}) = B - \frac{1}{n - 2} (J_{n} - I_{n}) \circ (J_{n} B + B J_{n}) . \end{matrix}

(A3)

Appendix B. Computational Time

We compared the computational time of four different algorithms in the new Figure A1 in the new version of the manuscript. Three of them are for the cases from Figure 2. For cases (ii) and (iii), the computational time also includes the time needed for variance optimization. We compared them to a conventional MWM algorithm whose code (https://jp.mathworks.com/matlabcentral/fileexchange/42827-weighted-maximum-matching-in-general-graphs (accessed on 10 January 2023)) was developed and distributed by Daniel R. Saunders (http://danielrsaunders.com (accessed on 10 January 2023)). This conventional algorithm is based on “Efficient algorithms for finding maximum matching in graphs” by Zvi Galil [32]. The number of elements changes from 100 to 1000. One hundred different compatibility matrices were simulated and averaged to obtain the computational time.

Figure A1. Comparison of computational time between different four algorithms, which are case (i), (ii), (iii) and the conventional MWM algorithm.

Figure A1 shows that, as expected, the PNN+p2-opt algorithm is significantly faster than the conventional algorithm, and Flow (i), Flow (ii), Flow (iii) work faster in this order. These computational times can be explained as follows: First, PNN+p2-opt is heuristic and a

O (n^{2})

algorithm. Therefore, PNN+p2-opt is significantly faster than the conventional MWM algorithm that aims to find the absolute best solution. Second, we count the computational time, including the variance optimization procedure. The variance optimization takes some time, so the computational time of flow (ii) and flow (iii) is longer than flow (i). Third, flow (ii) has a tendency to become stuck in local minima, resulting in less computational time than flow (iii), due to the faster termination of the p2-opt algorithm.

In the future, the comparison to machine-learning-based methods such as the one proposed in Ref. [16] is of great interest. However, at this point, it is unclear how to conduct a fair comparison, as the ML-based algorithm requires extensive training on multiple examples before it is able to solve the problem. Nevertheless, as machine learning is a rapidly evolving field, it is possible that ML-based algorithms specialized for the pairing problem could be developed in the near future.

References

Gale, D.; Shapley, L.S. College admissions and the stability of marriage. Am. Math. Mon. 1962, 69, 9–15. [Google Scholar] [CrossRef]
Roth, A.E. The economics of matching: Stability and incentives. Math. Oper. Res. 1982, 7, 617–628. [Google Scholar] [CrossRef] [Green Version]
Ergin, H.; Sönmez, T.; Ünver, M.U. Dual-Donor Organ Exchange. Econometrica 2017, 85, 1645–1671. [Google Scholar] [CrossRef] [Green Version]
Kohl, N.; Karisch, S.E. Airline crew rostering: Problem types, modeling, and optimization. Ann. Oper. Res. 2004, 127, 223–257. [Google Scholar] [CrossRef]
Gambetta, J.M.; Chow, J.M.; Steffen, M. Building logical qubits in a superconducting quantum computing system. npj Quantum Inf. 2017, 3, 1–7. [Google Scholar] [CrossRef] [Green Version]
Gao, Y.; Dai, Q.; Wang, M.; Zhang, N. 3D model retrieval using weighted bipartite graph matching. Signal Process. Image Commun. 2011, 26, 39–47. [Google Scholar] [CrossRef]
Bellur, U.; Kulkarni, R. Improved matchmaking algorithm for semantic web services based on bipartite graph matching. In Proceedings of the IEEE International Conference on Web Services (ICWS 2007), Salt Lake City, UT, USA, 9–13 July 2007; pp. 86–93. [Google Scholar]
Edmonds, J. Paths, trees, and flowers. Can. J. Math. 1965, 17, 449–467. [Google Scholar] [CrossRef]
Gabow, H.N. Data structures for weighted matching and nearest common ancestors with linking. In Proceedings of the First Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, CA, USA, 22–24 January 1990; pp. 434–443. [Google Scholar]
Huang, C.C.; Kavitha, T. Efficient algorithms for maximum weight matchings in general graphs with small edge weights. In Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, Kyoto, Japan, 17–19 January 2012; pp. 1400–1412. [Google Scholar]
Pettie, S. A simple reduction from maximum weight matching to maximum cardinality matching. Inf. Process. Lett. 2012, 112, 893–898. [Google Scholar] [CrossRef] [Green Version]
Cygan, M.; Gabow, H.N.; Sankowski, P. Algorithmic applications of baur-strassen’s theorem: Shortest cycles, diameter, and matchings. J. ACM (JACM) 2015, 62, 1–30. [Google Scholar] [CrossRef]
Duan, R.; Pettie, S. Approximating maximum weight matching in near-linear time. In Proceedings of the 2010 IEEE 51st Annual Symposium on Foundations of Computer Science, Las Vegas, NV, USA, 23–26 October 2010; pp. 673–682. [Google Scholar]
Hanke, S.; Hougardy, S. New Approximation Algorithms for the Weighted Matching Problem. Citeseer. 2010. Available online: https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=f6bc65fe193c8afd779f2867831a869c59554661 (accessed on 10 January 2023).
Duan, R.; Pettie, S. Linear-time approximation for maximum weight matching. J. Acm (JACM) 2014, 61, 1–23. [Google Scholar] [CrossRef]
Wu, B.; Li, L. Solving maximum weighted matching on large graphs with deep reinforcement learning. Inf. Sci. 2022, 614, 400–415. [Google Scholar] [CrossRef]
Fujita, N.; Chauvet, N.; Röhm, A.; Horisaki, R.; Li, A.; Hasegawa, M.; Naruse, M. Efficient Pairing in Unknown Environments: Minimal Observations and TSP-based Optimization. IEEE Access 2022, 10, 57630–57640. [Google Scholar] [CrossRef]
Williams, V.V. Multiplying matrices faster than Coppersmith-Winograd. In Proceedings of the Forty-Fourth Annual ACM Symposium on Theory of Computing, New York, NY, USA, 20–22 May 2012; pp. 887–898. [Google Scholar]
Kuhn, H.W. The Hungarian method for the assignment problem. Nav. Res. Logist. Q. 1955, 2, 83–97. [Google Scholar] [CrossRef] [Green Version]
Bertsekas, D.P. The auction algorithm: A distributed relaxation method for the assignment problem. Ann. Oper. Res. 1988, 14, 105–123. [Google Scholar] [CrossRef] [Green Version]
Munkres, J. Algorithms for the assignment and transportation problems. J. Soc. Ind. Appl. Math. 1957, 5, 32–38. [Google Scholar] [CrossRef] [Green Version]
Goldberg, A.; Radzik, T. A Heuristic Improvement of the Bellman-Ford Algorithm; Technical Report; STANFORD UNIV CA DEPT OF COMPUTER SCIENCE: Stanford, CA, USA, 1993. [Google Scholar]
Aldababsa, M.; Toka, M.; Gökçeli, S.; Kurt, G.K.; Kucur, O. A tutorial on nonorthogonal multiple access for 5G and beyond. Wirel. Commun. Mob. Comput. 2018, 2018, 9713450. [Google Scholar] [CrossRef] [Green Version]
Ding, Z.; Fan, P.; Poor, H.V. Impact of user pairing on 5G nonorthogonal multiple-access downlink transmissions. IEEE Trans. Veh. Technol. 2015, 65, 6010–6023. [Google Scholar] [CrossRef]
Chen, L.; Ma, L.; Xu, Y. Proportional fairness-based user pairing and power allocation algorithm for non-orthogonal multiple access system. IEEE Access 2019, 7, 19602–19615. [Google Scholar] [CrossRef]
Ali, Z.; Khan, W.U.; Ihsan, A.; Waqar, O.; Sidhu, G.A.S.; Kumar, N. Optimizing resource allocation for 6G NOMA-enabled cooperative vehicular networks. IEEE Open J. Intell. Transp. Syst. 2021, 2, 269–281. [Google Scholar] [CrossRef]
Zhang, H.; Duan, Y.; Long, K.; Leung, V.C. Energy efficient resource allocation in terahertz downlink NOMA systems. IEEE Trans. Commun. 2020, 69, 1375–1384. [Google Scholar] [CrossRef]
Shahab, M.B.; Irfan, M.; Kader, M.F.; Young Shin, S. User pairing schemes for capacity maximization in non-orthogonal multiple access systems. Wirel. Commun. Mob. Comput. 2016, 16, 2884–2894. [Google Scholar] [CrossRef] [Green Version]
Zhu, L.; Zhang, J.; Xiao, Z.; Cao, X.; Wu, D.O. Optimal user pairing for downlink non-orthogonal multiple access (NOMA). IEEE Wirel. Commun. Lett. 2018, 8, 328–331. [Google Scholar] [CrossRef]
Higuchi, K.; Benjebbour, A. Non-orthogonal multiple access (NOMA) with successive interference cancellation for future radio access. IEICE Trans. Commun. 2015, 98, 403–414. [Google Scholar] [CrossRef] [Green Version]
Halim, A.H.; Ismail, I. Combinatorial optimization: Comparison of heuristic algorithms in travelling salesman problem. Arch. Comput. Methods Eng. 2019, 26, 367–380. [Google Scholar] [CrossRef]
Galil, Z. Efficient algorithms for finding maximum matching in graphs. ACM Comput. Surv. (CSUR) 1986, 18, 23–38. [Google Scholar] [CrossRef]

Figure 1. A schematic illustration of the relationship among

H_{0}, H_{1}, H_{2}

and

H_{3}

.

Figure 1. A schematic illustration of the relationship among

H_{0}, H_{1}, H_{2}

and

H_{3}

.

Figure 2. Schematic illustration of the three heuristic pairing optimization algorithms tested in the simulation. Case (i) (blue) applies the combining algorithm directly to the ground-truth compatibility matrix

C^{g}

. Case (ii) (red) first applies the observation algorithm to obtain an estimated compatibility matrix

C^{e_{1}}

, followed by the combining algorithm. Case (iii) (yellow) first estimates the compatibility from observation (

C^{e_{1}}

), followed by the variance optimization (

C^{e_{2}}

), and then executes the combining algorithm.

Figure 2. Schematic illustration of the three heuristic pairing optimization algorithms tested in the simulation. Case (i) (blue) applies the combining algorithm directly to the ground-truth compatibility matrix

C^{g}

. Case (ii) (red) first applies the observation algorithm to obtain an estimated compatibility matrix

C^{e_{1}}

, followed by the combining algorithm. Case (iii) (yellow) first estimates the compatibility from observation (

C^{e_{1}}

), followed by the variance optimization (

C^{e_{2}}

), and then executes the combining algorithm.

Figure 3. Comparison of the achieved total compatibility for Flows (i), (ii), and (iii), as described in the caption for Figure 2. Each graph shows the mean and standard deviation of the performance of 100 different compatibility matrices with each given number of elements, simulated under (a) uniform distributions and (b) Poisson distributions.

Figure 4. Comparison of the variance of the compatibility matrices of

C^{g}

,

C^{e_{1}}

,

C^{e_{2}}

as a function of the number of elements in the system under (a) uniform distributions and (b) Poisson distributions.

Figure 4. Comparison of the variance of the compatibility matrices of

C^{g}

,

C^{e_{1}}

,

C^{e_{2}}

as a function of the number of elements in the system under (a) uniform distributions and (b) Poisson distributions.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fujita, N.; Röhm, A.; Mihana, T.; Horisaki, R.; Li, A.; Hasegawa, M.; Naruse, M. Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement. Entropy 2023, 25, 146. https://doi.org/10.3390/e25010146

AMA Style

Fujita N, Röhm A, Mihana T, Horisaki R, Li A, Hasegawa M, Naruse M. Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement. Entropy. 2023; 25(1):146. https://doi.org/10.3390/e25010146

Chicago/Turabian Style

Fujita, Naoki, André Röhm, Takatomo Mihana, Ryoichi Horisaki, Aohan Li, Mikio Hasegawa, and Makoto Naruse. 2023. "Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement" Entropy 25, no. 1: 146. https://doi.org/10.3390/e25010146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pairing Optimization via Statistics: Algebraic Structure in Pairing Problems and Its Application to Performance Enhancement

Abstract

1. Introduction

2. Problem Setting

2.1. Pairing Optimization Problem

2.2. Limited Observation Constraint

3. Mathematical Properties of the Pairing Problem

3.1. Adjacent Set

3.2. Equivalence Class

3.3. Mean and Covariance

4. Variance Optimization

4.1. Performance Degradation through the Observation Phase

4.2. Transforming the Compatibility Matrix with Minimized Variance

5. Simulation

5.1. Setting

5.2. Simulation Flow

5.3. Performance

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Matrix Form of Conserved Quantities

Appendix B. Computational Time

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI