The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma

Dong, Zhiyuan; Wu, Ai-Guo

doi:10.3390/math9121443

Open AccessArticle

The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma

by

Zhiyuan Dong

^†

and

Ai-Guo Wu

^*,†

School of Mechanical Engineering and Automation, Harbin Institute of Technology (Shenzhen), Shenzhen 150001, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2021, 9(12), 1443; https://doi.org/10.3390/math9121443

Submission received: 27 April 2021 / Revised: 8 June 2021 / Accepted: 18 June 2021 / Published: 21 June 2021

(This article belongs to the Special Issue Advances in Quantum Field Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we extend the quantum game theory of Prisoner’s Dilemma to the N-player case. The final state of quantum game theory of N-player Prisoner’s Dilemma is derived, which can be used to investigate the payoff of each player. As demonstration, two cases (2-player and 3-player) are studied to illustrate the superiority of quantum strategy in the game theory. Specifically, the non-unique entanglement parameter is found to maximize the total payoff, which oscillates periodically. Finally, the optimal strategic set is proved to depend on the selection of initial states.

Keywords:

quantum game theory; quantum entanglement; quantum strategy; Prisoner’s Dilemma

1. Introduction

Game theory is the study of mathematical models of strategic selection among rational decision-makers [1,2]. It has been widely applied to many fields, such as economics, social science, information technology, systems theory and computer science. Due to the rapid development of precision instrument manufacture, quantum game theory attracts more and more attention and shows its superiority in many research fields [3,4,5,6,7,8].

Quantum game theory is an extension of classical game theory to the quantum category. In contrast to classical game theory, the states of quantum game theory are superposed on many basis states of the corresponding Hilbert space, which can be further entangled by quantum manipulation. This manipulation follows the quantum principles developed in [9,10]. Roughly speaking, the choices of “Cooperate” and “Defect” in the Prisoner’s Dilemma can be regarded as a two-level quantum bit (qubit) with two possible states (e.g.,

| 0 〉

and

| 1 〉

in a 2-dimensional Hilbert space) in quantum game theory. Each player in this quantum Prisoner’s Dilemma has their own qubit and can only manipulate it without communications. These relatively independent qubits are entangled by a quantum gate, which is known to all players. The quantum theory of the Prisoner’s Dilemma, pioneered by Eisert et al. [11], has been extensively studied in [12,13,14]. Particularly, the multiplayer quantum game is considered in [13], which points out the possibility of a quantum game employed in the architecture of quantum computers. A review of theoretical and experimental developments in quantum game theory is given in [14], together with their role in the development of quantum algorithms and communication protocols.

Recently, the superiority of quantum strategy has been shown to solve the difficulties of reaching a Pareto optimal in the classical games. For example, multipartite zero-sum game with quantum settings have been considered in [15]. The quantum single-photon states are employed to prepare the strategy, which realizes tripartite quantum fair zero-sum games with Nash equilibrium. Nash equilibria and correlated equilibria for classical and quantum games have been discussed in [16] with their Pareto efficiency. The advantages of quantum mixed Pauli strategies are shown to make the games close to Pareto optimal. In [5], all possible variants of the PQ penny flip game have been investigated, which constructs a semiautomaton that captures the corresponding intrinsic behaviors. New concepts of winning automaton and complete automaton for each player are also proposed. The classical Prisoner’s Dilemma associated with quantum automata is considered in [6], which presents a quantum version of conditional strategy and its performance analysis. Besides, the quantum Prisoner’s Dilemma game has been proposed to study the food loss and waste in a two-echelon food supply chain [17]. Both the classical game and the separable quantum game are proved to be useless for the Pareto optimal strategy. However, it can be achieved in the context of maximally entangled quantum game. The quantum Prisoner’s Dilemma with 3 players is theoretically investigated in [18] and experimentally realized in [19].

In this paper, we extend the quantum theory of the Prisoner’s Dilemma to the N-player case, which exhibits the following features. The total payoff of the game is proved to oscillate periodically with the entanglement parameter. The minimum period of this oscillation is found and the optimal entanglement parameter of maximizing the total payoff is not unique. Besides, the quantum Prisoner’s Dilemma with different initial states is also extensively investigated, which illustrates that the optimal strategic set depends on the selection of initial states. Finally, an invariant optimal strategic set is derived by changing the form of entanglement gate, which yields a “Pareto optimum” of the quantum Prisoner’s Dilemma. Based on the discussions above, a comprehensive study on the 3-player Prisoner’s Dilemma is presented in this paper.

The rest of this paper is organized as follows. We firstly introduce the Prisoner’s Dilemma in the case of quantum game theory in Section 2, then the general form of the N-player Prisoner’s Dilemma is derived. The 2-player Prisoner’s Dilemma is briefly discussed in Section 3, and it is shown that the quantum strategy has no advantage without entanglement in the game. The total payoffs of the 3-player Prisoner’s Dilemma with respect to several parameters are extensively presented in Section 4, including the initial state, the choices of other players, and the entanglement gate. Section 5 concludes this paper.

Notation.

i = \sqrt{- 1}

.

| ϕ 〉

is the state of the game, which can be mathematically described by a column vector.

{\hat{σ}}_{x}

,

{\hat{σ}}_{y}

, and

{\hat{σ}}_{z}

are Pauli operators.

Z^{+}

denotes the set of positive integers. The adjoint operator or complex conjugate transpose is denoted by †, i.e.,

X^{†} = {(X^{*})}^{T}

. Finally, ⊗ means the tensor product.

2. The General Case

In this section, we discuss the Prisoner’s Dilemma with N players. Assume the N players are arrested and they cannot communicate with each other. The police separately question each of them, and they can choose “Cooperate” or “Defect”. Each player does not know the other players’ choices. We assume that each of them cares more for their own freedom (payoff) than the total welfare of their accomplices. Normally, choosing “Defect” can give each player more payoff than choosing “Cooperate”. Two explicit payoff tables are given as examples in the following sections. In the quantum game theory,

| 0 〉

and

| 1 〉

denote the choices of “Cooperate” and “Defect”, which are mathematically represented by the two bases of a 2-dimensional Hilbert space. The strategy is under player’s control. Each of the players can choose “Cooperate” (

| 0 〉

) or “Defect” (

| 1 〉

), which corresponds to the manipulation on the initial state. For example, we have

i {\hat{σ}}_{x} | 0 〉 = i | 1 〉

,

i {\hat{σ}}_{x} | 1 〉 = i | 0 〉

. Similarly,

i {\hat{σ}}_{y} | 0 〉 = - | 1 〉

,

i {\hat{σ}}_{y} | 1 〉 = | 0 〉

. That is, the Pauli operators,

{\hat{σ}}_{x}

and

{\hat{σ}}_{y}

, can be used to swap the choices from “Cooperate” to “Defect”, and vice versa.

We denote the N players by

z_{1}, z_{2}, \dots, z_{N}

. The initial state is chosen to be

| ψ_{ini} 〉 = {\hat{J}}_{x} | z_{1} z_{2} \dots z_{N} 〉,

(1)

where

| z_{j} 〉

can be

| 0 〉

(Cooperate) or

| 1 〉

(Defect),

1 \leq j \leq N

, and the entanglement gate of the game is

{\hat{J}}_{x} = exp \{i \frac{γ}{2} \underset{N copies}{\underset{︸}{{\hat{σ}}_{x} \otimes {\hat{σ}}_{x} \otimes \dots \otimes {\hat{σ}}_{x}}}\}, γ \geq 0 .

(2)

Here, the entanglement gate

{\hat{J}}_{x}

is known to all of the N players. Particularly,

γ

denotes the entanglement parameter and

γ = 0

means the separated quantum game.

The strategic move of each player

z_{i}

,

i = 1, 2, \dots, N

, is denoted by

\hat{U} (θ, ϕ) = [\begin{matrix} cos \frac{θ}{2} & e^{i ϕ} sin \frac{θ}{2} \\ - e^{- i ϕ} sin \frac{θ}{2} & cos \frac{θ}{2} \end{matrix}],

(3)

where

0 \leq θ \leq π

and

0 \leq ϕ \leq \frac{π}{2}

. A more general form of the unitary operator

\hat{U}

can be found in [20], Equation (7). Clearly, we have

\hat{U} (0, 0) = \hat{I}

, which means that the player

z_{j}

keeps to choose the original choice

| z_{j} 〉

,

1 \leq j \leq N

; while

\hat{U} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

,

\hat{U} (π, 0) = i {\hat{σ}}_{y}

, which denote that the player swaps the original choice.

After measurement, the final state is

| ψ_{fin} 〉 = {\hat{J}}_{x}^{†} ({\hat{U}}_{1} \otimes {\hat{U}}_{2} \otimes \dots \otimes {\hat{U}}_{N}) {\hat{J}}_{x} | z_{1} z_{2} \dots z_{N} 〉 .

(4)

The succeeding measurement yields a particular result with a certain probability. Therefore the payoff of player

z_{j}

,

1 \leq j \leq N

, should be the expected payoff

P_{j} = \sum_{k = 1}^{2^{N}} s_{j, k} P_{| z_{1} z_{2} \dots z_{N} 〉}, z_{1}, z_{2}, \dots, z_{N} \in {0, 1},

(5)

where

P_{| z_{1} z_{2} \dots z_{N} 〉} = {|〈 z_{1} z_{2} \dots z_{N} | ψ_{fin} 〉|}^{2}

, which means the probability of collapsing the final state

| ψ_{fin} 〉

to

| z_{1} z_{2} \dots z_{N} 〉

. Clearly, we have

\sum_{z_{1} z_{2}, \dots z_{N}}^{2^{N}} P_{| z_{1} z_{2} \dots z_{N} 〉} = 1

.

s_{j, k}

is the payoff of player

z_{j}

with all the

2^{N}

possible states

| z_{1} z_{2} \dots z_{N} 〉

, and

z_{1}, z_{2}, \dots, z_{N} \in {0, 1}

. Inserting the final state (4) into the expected payoff (5), yields the payoff of player

z_{j}

P_{j} = \sum_{k = 1}^{2^{N}} s_{j, k} {|〈 z_{1} z_{2} \dots z_{N} | {\hat{J}}_{x}^{†} ({\hat{U}}_{1} \otimes {\hat{U}}_{2} \otimes \dots \otimes {\hat{U}}_{N}) {\hat{J}}_{x} | z_{1} z_{2} \dots z_{N} 〉|}^{2},

(6)

where

j = 1, \dots, N

, and

z_{1}, z_{2}, \dots, z_{N} \in {0, 1}

.

3. The 2-Player Prisoner’s Dilemma

In this section, we mainly discuss the case of 2-player Prisoner’s Dilemma. The payoff matrix of the two players, Alice and Bob, is given in Figure 1. To be specific, strategy C means that the player remains silent, while strategy D denotes that the player confesses. If both of them choose strategy C (remain silent), each of them will get the payoff 3; if both of them choose strategy D (confess), each of them will get the payoff 1. On the other hand, if Alice chooses strategy C and Bob chooses strategy D, Alice will get the payoff 0 and 5 will be the payoff of Bob, and vice versa.

Since Alice and Bob cannot communicate with each other, strategy D is the dominant choice for each of them no matter which strategy the other one chooses. In terms of classical game theory, the strategic set

(D, D)

is the unique Nash equilibrium of the game and each of the two players will get the payoff 1.

In what follows, we firstly focus on the separated quantum game, i.e.,

γ = 0

in (2), which results in

{\hat{J}}_{x} = \hat{I}

. Assume that the initial state is

| ψ_{ini} 〉 = {\hat{J}}_{x} | 00 〉 = | 00 〉 .

(7)

Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, and Bob chooses the classical strategy D, which can be represented by

{\hat{U}}_{B} (π, \frac{π}{2})

, i.e.,

{\hat{U}}_{B} (π, \frac{π}{2}) | 0 〉 = i | 1 〉

. After time evolution, the final state can be calculated as

| ψ_{fin} 〉 = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2})) {\hat{J}}_{x} | 00 〉 = i cos \frac{θ}{2} | 01 〉 - i e^{- i ϕ} sin \frac{θ}{2} | 11 〉 .

(8)

By the expected payoff given in (5), one can obtain the payoff of Alice

P_{A} = 0 \times {|i cos \frac{θ}{2}|}^{2} + 1 \times {|- i e^{- i ϕ} sin \frac{θ}{2}|}^{2} = {sin}^{2} \frac{θ}{2} .

(9)

Clearly, Alice will choose

θ = π

to maximize the payoff, which corresponds to the quantum strategies

{\hat{U}}_{A} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

or

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

. As a result, Alice also swaps the initial state

| 0 〉

and choose the strategy

| 1 〉

, which leads to the classical Nash equilibrium

(D, D)

. Indeed, even if both of the two players choose the quantum strategy (3), all the resulting Nash equilibria are the same as the classical strategic set

(D, D)

.

On the other hand, we turn to the maximum entangled case, i.e.,

γ = \frac{π}{2}

in (2). In this case, both Alice and Bob choose the same strategies as the separate quantum game above. After time evolution, the final state is given by

\begin{matrix} | ψ_{fin} 〉 & = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2})) {\hat{J}}_{x} | 00 〉 \\ = - cos ϕ sin \frac{θ}{2} | 00 〉 + i cos \frac{θ}{2} | 01 〉 - sin ϕ sin \frac{θ}{2} | 11 〉 . \end{matrix}

(10)

By the expected payoff given in (5), the payoff of Alice is given by

\begin{matrix} P_{A} & = 3 \times {|- cos ϕ sin \frac{θ}{2}|}^{2} + 0 \times {|i cos \frac{θ}{2}|}^{2} + 1 \times {|- sin ϕ sin \frac{θ}{2}|}^{2} \\ = (1 + 2 {cos}^{2} ϕ) {sin}^{2} \frac{θ}{2} \leq 3 . \end{matrix}

(11)

Consequently, Alice will choose

{\hat{U}}_{A} (π, 0)

to maximize the payoff, which leads to the quantum strategy

i {\hat{σ}}_{y}

. Considering the symmetry of the game,

i {\hat{σ}}_{y}

is also the optimal strategy for Bob. Thus,

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

is a Nash equilibrium in the maximally entangled case and

P_{A} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}) = P_{B} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}) = 3 .

(12)

Notice that no improvement of the payoff can be made by deviating from the strategic set

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

, which yields a “Pareto optimum”.

4. The 3-Player Prisoner’s Dilemma

In this section, we consider the case of a 3-player Prisoner’s Dilemma. Alice, Bob and Colin are separated and cannot communicate with each other. The strategies C, D mean that the player remains silent and confesses, respectively. The payoff matrix [18] of the three players is given with three numbers in triplets. The first number in the parenthesis denotes the payoff of Alice, the second number denotes the payoff of Bob, and the third one denotes the payoff of Colin. To be specific, if they all choose the strategy C, each of them will get the payoff 3, i.e.,

(C, C, C) \mapsto (3, 3, 3)

; on the other hand, if they all choose the strategy D, each of them will get the payoff 1, i.e.,

(D, D, D) \mapsto (1, 1, 1)

. Moreover, if one of them chooses the strategy C and the others choose the strategy D, the former will get the payoff 0 and the latter will get the payoff 4, e.g.,

(C, D, D) \mapsto (0, 4, 4)

(or

(D, C, D) \mapsto (4, 0, 4)

,

(D, D, C) \mapsto (4, 4, 0)

); if one of them chooses the strategy D and the others choose the strategy C, 5 is the payoff of the former and 2 is the payoff of the latter, e.g.,

(D, C, C) \mapsto (5, 2, 2)

(or

(C, D, C) \mapsto (2, 5, 2)

,

(C, C, D) \mapsto (2, 2, 5)

).

Again, the dominant strategy for each of them is still the strategy D, i.e., choosing “defect” is better than “cooperate” to earn more payoff no matter what strategies the other two players choose. Due to the symmetry of the game, the strategic set

(D, D, D)

is a Nash equilibrium. However, it is obviously not a “Pareto optimum”. In what follows, we introduce the quantum strategy and investigate the payoff of each player.

4.1. The Separated Case

In this section, we firstly consider the separated case, i.e.,

γ = 0

. Assume that the initial state is given by

| Ψ_{ini} 〉 = {\hat{J}}_{x} | 000 〉 = | 000 〉,

(13)

and both Bob and Colin choose the strategy

{\hat{U}}_{B} (π, \frac{π}{2}) = {\hat{U}}_{C} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

. For comparison, Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

. After time evolution, the final state can be calculated as

\begin{matrix} | Ψ_{fin} 〉 & = ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2}) \otimes {\hat{U}}_{C} (π, \frac{π}{2})) | 000 〉 \\ = - cos \frac{θ}{2} | 011 〉 + e^{- i ϕ} sin \frac{θ}{2} | 111 〉 . \end{matrix}

(14)

According to the payoff matrix given in the 3-player case, the payoff of Alice is given by

P_{A} = 0 \times {|- cos \frac{θ}{2}|}^{2} + 1 \times {|e^{- i ϕ} sin \frac{θ}{2}|}^{2} = {sin}^{2} \frac{θ}{2} .

(15)

Thus, it is better to choose

θ = π

for Alice to maximize the payoff, which corresponding to the strategy

{\hat{U}}_{A} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

or

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

. As a result, the total game attains a Nash equilibrium

(D, D, D)

, which is not a “Pareto optimum”.

4.2. The Entanglement Parameter

If

γ \neq 0

in (2), then the payoffs of all players are connected by the entanglement gate. In this section, we firstly investigate the maximal entanglement parameter by considering the payoff of Alice. The initial state is fixed to be

| Ψ_{ini} 〉 = {\hat{J}}_{x} | 000 〉,

(16)

where the entanglement gate is given by (2) with

γ \neq 0

. According to prior knowledge in the 2-player Prisoner’s Dilemma discussed above, it has been concluded that both

i {\hat{σ}}_{x}

and

i {\hat{σ}}_{y}

can be used to swap the initial state

| 0 〉

and choose the strategy

| 1 〉

. However, only

i {\hat{σ}}_{y}

can make the game reach a “Pareto optimum” in the maximum entangled case. In this section, the strategic sets are respectively denoted by

(i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{x})

and

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

for comparison. Then the final state of the game can be calculated by

\begin{matrix} | Ψ_{fin} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{x}) 〉 = {\hat{J}}_{x}^{†} (i {\hat{σ}}_{x} \otimes i {\hat{σ}}_{x} \otimes i {\hat{σ}}_{x}) {\hat{J}}_{x} | 000 〉 = - i | 111 〉, \\ | Ψ_{fin} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y}) 〉 = {\hat{J}}_{x}^{†} (i {\hat{σ}}_{y} \otimes i {\hat{σ}}_{y} \otimes i {\hat{σ}}_{y}) {\hat{J}}_{x} | 000 〉 = i sin γ | 000 〉 - cos γ | 111 〉, \end{matrix}

(17)

which yields the corresponding payoffs of Alice for the two cases

\begin{matrix} P_{A} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{x}) = 1 \\ P_{A} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y}) = 3 \times {|i sin γ|}^{2} + 1 \times {|- cos γ|}^{2} = 2 {sin}^{2} γ + 1 . \end{matrix}

(18)

In Figure 2, the payoffs of Alice with the two strategic sets are simulated. It can be confirmed that

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

is the optimal strategic set, which enables the game to attain a “Pareto optimum”. Moreover, the maximal entanglement parameter

γ

is not unique. In Figure 2, the payoff oscillates periodically and reaches its maximum at

γ = \frac{(2 k - 1) π}{2}

,

k \in Z^{+}

. In what follows, we mainly discuss the “Pareto optimum” of the game based on two initial states (

{\hat{J}}_{x} | 000 〉

and

{\hat{J}}_{x} | 111 〉

) with the maximally entangled gate, e.g.,

γ = \frac{π}{2}

.

4.2.1. The Initial State ${\hat{J}}_{x} | 000 〉$

Assume that the initial state is given by

| Ψ_{ini} 〉 = {\hat{J}}_{x} | 000 〉,

(19)

and the maximally entangled gate is

{\hat{J}}_{x} = exp \{i \frac{π}{4} {\hat{σ}}_{x} \otimes {\hat{σ}}_{x} \otimes {\hat{σ}}_{x}\} .

(20)

Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, while Bob and Colin choose the strategy

{\hat{U}}_{B} (π, \frac{π}{2}) = {\hat{U}}_{C} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

. After time evolution, the final state is given by

\begin{matrix} | Ψ_{fin} 〉 & = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2}) \otimes {\hat{U}}_{C} (π, \frac{π}{2})) {\hat{J}}_{x} | 000 〉 \\ = - i cos ϕ sin \frac{θ}{2} | 000 〉 - cos \frac{θ}{2} | 011 〉 - i sin ϕ sin \frac{θ}{2} | 111 〉 . \end{matrix}

(21)

According to the payoff matrix given in the 3-player case, the payoff of Alice is given by

\begin{matrix} P_{A} & = 3 \times {|- i cos ϕ sin \frac{θ}{2}|}^{2} + 0 \times {|cos \frac{θ}{2}|}^{2} + 1 \times {|- i sin ϕ sin \frac{θ}{2}|}^{2} \\ = (1 + 2 {cos}^{2} ϕ) {sin}^{2} \frac{θ}{2} \leq 3 . \end{matrix}

(22)

Thus, it is better to choose

θ = π

,

ϕ = 0

for Alice to maximize the payoff, which corresponding to the strategy

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

. Similarly, in the case where other two players choose the strategy

i {\hat{σ}}_{x}

, the maximum payoff of Bob and Colin can be derived as

\begin{matrix} P_{B} (i {\hat{σ}}_{x}, {\hat{U}}_{B} (θ, ϕ), i {\hat{σ}}_{x}) \leq P_{B} (i {\hat{σ}}_{x}, i {\hat{σ}}_{y}, i {\hat{σ}}_{x}), \\ P_{C} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, {\hat{U}}_{C} (θ, ϕ)) \leq P_{C} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{y}) . \end{matrix}

(23)

Due to the symmetry property of the game, it can be verified that the optimal strategic set is

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

. In this case, the total game reaches a Nash equilibrium

(C, C, C)

, which is also a “Pareto optimum”.

4.2.2. The Initial State ${\hat{J}}_{x} | 111 〉$

For comparison, in this section we assume that the initial state is given by

| ψ_{ini} 〉 = {\hat{J}}_{x} | 111 〉,

(24)

and the maximally entangled gate has the form (20). All of the three players choose the same strategies as the case discussed above, that is, Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, while Bob and Colin choose the strategy

{\hat{U}}_{B} (π, \frac{π}{2}) = {\hat{U}}_{C} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

. After time evolution, the final state is given by

\begin{matrix} | Ψ_{fin} 〉 & = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2}) \otimes {\hat{U}}_{C} (π, \frac{π}{2})) {\hat{J}}_{x} | 111 〉 \\ = - i sin ϕ sin \frac{θ}{2} | 000 〉 - cos \frac{θ}{2} | 100 〉 + i cos ϕ sin \frac{θ}{2} | 111 〉 . \end{matrix}

(25)

According to the payoff matrix given in the 3-player case, the payoff of Alice is given by

\begin{matrix} P_{A} & = 3 \times {|- i sin ϕ sin \frac{θ}{2}|}^{2} + 5 \times {|- cos \frac{θ}{2}|}^{2} + 1 \times {|i cos ϕ sin \frac{θ}{2}|}^{2} \\ = 5 + 2 ({sin}^{2} ϕ - 2) {sin}^{2} \frac{θ}{2} \leq 5 . \end{matrix}

(26)

Thus, it is better to choose

θ = 0

for Alice to maximize the payoff, which corresponding to the strategy

{\hat{U}}_{A} (0, ϕ) = \hat{I}

. Similarly, when the other two players choose the strategy

i {\hat{σ}}_{x}

, the maximum payoff of Bob and Colin can be derived as

\begin{matrix} P_{B} (i {\hat{σ}}_{x}, {\hat{U}}_{B} (θ, ϕ), i {\hat{σ}}_{x}) \leq P_{B} (i {\hat{σ}}_{x}, \hat{I}, i {\hat{σ}}_{x}), \\ P_{C} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, {\hat{U}}_{C} (θ, ϕ)) \leq P_{C} (i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, \hat{I}) . \end{matrix}

(27)

However, if all of the three players choose the quantum strategy

\hat{I}

, the total game attains at a Nash equilibrium

(D, D, D)

, which is not a “Pareto optimum”. Consequently, any player choosing

i {\hat{σ}}_{x}

is not a proper way to obtain a Nash equilibrium for the maximally entangled game. Therefore, in what follows we focus on the case of the other two players choosing the strategy

i {\hat{σ}}_{y}

, instead of

i {\hat{σ}}_{x}

.

4.3. The Case of the Other Two Players Choosing $i {\hat{σ}}_{y}$

Firstly, we assume that the initial state is

| ψ_{ini} 〉 = {\hat{J}}_{x} | 000 〉

, and Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, while Bob and Colin choose the strategy

{\hat{U}}_{B} (π, 0) = {\hat{U}}_{C} (π, 0) = i {\hat{σ}}_{y}

. After time evolution, the final state is given by

\begin{matrix} | Ψ_{fin} 〉 & = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, 0) \otimes {\hat{U}}_{C} (π, 0)) {\hat{J}}_{x} | 000 〉 \\ = i cos ϕ sin \frac{θ}{2} | 000 〉 + cos \frac{θ}{2} | 011 〉 + i sin ϕ sin \frac{θ}{2} | 111 〉 . \end{matrix}

(28)

According to the payoff matrix given in the 3-player case, the payoff of Alice is given by

\begin{matrix} P_{A} & = 3 \times {|i cos ϕ sin \frac{θ}{2}|}^{2} + 0 \times {|cos \frac{θ}{2}|}^{2} + 1 \times {|i sin ϕ sin \frac{θ}{2}|}^{2} \\ = (1 + 2 {cos}^{2} ϕ) {sin}^{2} \frac{θ}{2} \leq 3 . \end{matrix}

(29)

Thus, Alice will choose

θ = π

,

ϕ = 0

to maximize the payoff, which corresponding to the strategy

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

. Indeed, in the case of the other two players choose the strategy

i {\hat{σ}}_{y}

, the payoff of Bob and Colin can be calculated as

\begin{matrix} P_{B} (i {\hat{σ}}_{y}, {\hat{U}}_{B} (θ, ϕ), i {\hat{σ}}_{y}) \leq P_{B} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y}), \\ P_{C} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, {\hat{U}}_{C} (θ, ϕ)) \leq P_{C} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y}) . \end{matrix}

(30)

As a result, all of the three players will get the payoff

P_{A} = P_{B} = P_{C} = 3

, which is a Nash equilibrium

(C, C, C)

and also a “Pareto optimum”.

Secondly, we assume that the initial state is

| ψ_{ini} 〉 = {\hat{J}}_{x} | 111 〉

, and Alice, Bob and Colin choose the same strategy as the discussions above. After time evolution, the final state is given by

\begin{matrix} | Ψ_{fin} 〉 & = {\hat{J}}_{x}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, 0) \otimes {\hat{U}}_{C} (π, 0)) {\hat{J}}_{x} | 111 〉 \\ = i sin ϕ sin \frac{θ}{2} | 000 〉 + cos \frac{θ}{2} | 100 〉 - i cos ϕ sin \frac{θ}{2} | 111 〉 . \end{matrix}

(31)

According to the payoff matrix given in the 3-player case, the payoff of Alice is

\begin{matrix} P_{A} & = 3 \times {|i sin ϕ sin \frac{θ}{2}|}^{2} + 5 \times {|cos \frac{θ}{2}|}^{2} + 1 \times {|- i cos ϕ sin \frac{θ}{2}|}^{2} \\ = 5 + 2 ({sin}^{2} ϕ - 2) {sin}^{2} \frac{θ}{2} \leq 5 . \end{matrix}

(32)

Thus, Alice will choose

θ = 0

to maximize the payoff, which yields the strategy

{\hat{U}}_{A} (0, ϕ) = \hat{I}

. Moreover, it can be verified that

\begin{matrix} P_{B} (i {\hat{σ}}_{y}, {\hat{U}}_{B} (θ, ϕ), i {\hat{σ}}_{y}) \leq P_{B} (i {\hat{σ}}_{y}, \hat{I}, i {\hat{σ}}_{y}), \\ P_{C} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, {\hat{U}}_{C} (θ, ϕ)) \leq P_{C} (i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, \hat{I}) . \end{matrix}

(33)

Consequently, the total game will attain at the Nash equilibrium

(D, D, D)

. In sum, when the initial state is

| ψ_{ini} 〉 = {\hat{J}}_{x} | 000 〉

, we can obtain the optimal strategic set

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

no matter whether the other two players initially choose

i {\hat{σ}}_{x}

or

i {\hat{σ}}_{y}

, which yields a “Pareto optimum”

(C, C, C)

. However, when the initial state is prepared as

| ψ_{ini} 〉 = {\hat{J}}_{x} | 111 〉

, we can only get a Nash equilibrium

(D, D, D)

. That is, the optimal strategic set depends on the initial state of the game.

4.4. The Entanglement Gate

Based on the discussions above, it can be observed that the strategic set

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

is invalid to get a “Pareto optimum”

(C, C, C)

with the entangled gate

{\hat{J}}_{x} = exp \{i \frac{π}{4} {\hat{σ}}_{x} \otimes {\hat{σ}}_{x} \otimes {\hat{σ}}_{x}\} .

(34)

In this section, we aim to seek another entangled gate, which can keep the optimal strategic set to be

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

and yield a “Pareto optimum”

(C, C, C)

under the initial state

| 111 〉

. In what follows we assume that the maximally entangled state is changed to be

{\hat{J}}_{y} = exp \{i \frac{π}{4} {\hat{σ}}_{y} \otimes {\hat{σ}}_{y} \otimes {\hat{σ}}_{y}\} .

(35)

On the one hand, if Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, while Bob and Colin choose the strategy

{\hat{U}}_{B} (π, \frac{π}{2}) = {\hat{U}}_{C} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

, then the final state after time evolution is given by

\begin{matrix} | Ψ_{fin} 〉 & = {\hat{J}}_{y}^{†} ({\hat{U}}_{A} (θ, ϕ) \otimes {\hat{U}}_{B} (π, \frac{π}{2}) \otimes {\hat{U}}_{C} (π, \frac{π}{2})) {\hat{J}}_{y} | 111 〉 \\ = - cos ϕ sin \frac{θ}{2} | 000 〉 - cos \frac{θ}{2} | 100 〉 + i sin ϕ sin \frac{θ}{2} | 111 〉 . \end{matrix}

(36)

According to the payoff matrix given in the 3-player case, the payoff of Alice is given by

\begin{matrix} P_{A} & = 3 \times {|- cos ϕ sin \frac{θ}{2}|}^{2} + 0 \times {|- cos \frac{θ}{2}|}^{2} + 1 \times {|i sin ϕ sin \frac{θ}{2}|}^{2} \\ = (1 + 2 {cos}^{2} ϕ) {sin}^{2} \frac{θ}{2} \leq 3 . \end{matrix}

(37)

Thus, Alice will choose

θ = π

,

ϕ = 0

to maximize the payoff, which corresponding to the strategy

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

. Due to the symmetry among the three players, the inequalities (23) also hold, which means that

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

is the optimal strategic set.

On the other hand, if Alice chooses the quantum strategy

{\hat{U}}_{A} (θ, ϕ)

, while Bob and Colin choose the strategy

{\hat{U}}_{B} (π, 0) = {\hat{U}}_{C} (π, 0) = i {\hat{σ}}_{y}

, then the payoff of Alice can be calculated in a similar way. Moreover, the inequalities (30) hold in this case.

In fact, no matter Bob and Colin initially choose the strategies

{\hat{U}}_{B} (π, \frac{π}{2}) = {\hat{U}}_{C} (π, \frac{π}{2}) = i {\hat{σ}}_{x}

or the strategy

{\hat{U}}_{B} (π, 0) = {\hat{U}}_{C} (π, 0) = i {\hat{σ}}_{y}

, Alice will persist in choosing the strategy

{\hat{U}}_{A} (π, 0) = i {\hat{σ}}_{y}

to maximize the payoff under the entangled gate

{\hat{J}}_{y}

with the initial state

| 111 〉

, see Figure 3.

Consequently, when the initial state is prepared as

| 111 〉

, we can still choose the strategic set

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

to get the “Pareto optimum”

(C, C, C)

of the game by introducing another different maximally entangled state (35).

In this section, a comprehensive study for the 3-player Prisoner’s Dilemma has been presented, which exhibits some interesting features. It should be noted that once the parameter

s_{j, k}

in (5) is fixed, the payoff of player

z_{j}

,

j = 1, \dots, N

, in the N-player Prisoner’s Dilemma can be solved by (6). As a result, those features can be generalized to the N-player case in a similar way.

5. Conclusions

In this paper, the general form of N-player Prisoner’s Dilemma in the quantum game theory has been derived explicitly, and yields the payoff of each player under the range of strategic choices. In addition, we have illustrated the advantages of quantum strategy in game theory by introducing the 2-player and 3-player cases. The entanglement parameter is proved to be non-unique, which can be used to obtain the “Pareto optimum” of the game. To be specific, the 3-player Prisoner’s Dilemma with different initial states is discussed and it has been found that the optimal strategic set depends on the selection of the initial state.

From the point of view of the players, each of them can choose the optimal quantum strategy to maximize the payoff based on the initial state of the game. Compared with the classical case, the advantages of quantum features in game theory are determined by the entanglement parameter. Moreover, considering quantum games with incomplete information is in the perspective of our future research.

Author Contributions

Writing—original draft, Z.D.; Writing—review & editing, A.-G.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 62003111).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are obtained directly from the simulation by the authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

Myerson, R.B. Game Theory; Harvard University Press: Cambridge, MA, USA, 2013. [Google Scholar]
Tadelis, S. Game Theory: An Introduction; Princeton University Press: Princeton, NJ, USA, 2013. [Google Scholar]
Klarreich, E. Playing by quantum rules. Nature 2001, 414, 244–245. [Google Scholar] [CrossRef]
Benjamin, S.C.; Hayden, P.M. Multiplayer quantum games. Phys. Rev. A 2001, 64, 030301. [Google Scholar] [CrossRef] [Green Version]
Andronikos, T.; Sirokofskich, A.; Kastampolidou, K.; Varvouzou, M.; Giannakis, K.; Singh, A. Finite automata capturing winning sequences for all possible variants of the PQ penny flip game. Mathematics 2018, 6, 20. [Google Scholar] [CrossRef] [Green Version]
Giannakis, K.; Theocharopoulou, G.; Papalitsas, C.; Fanarioti, S.; Andronikos, T. Quantum conditional strategies and automata for Prisoners’ Dilemmata under the EWL scheme. Appl. Sci. 2019, 9, 2635. [Google Scholar] [CrossRef] [Green Version]
Accardi, L.; Boukas, A. Von Neumann’s minimax theorem for continuous quantum games. J. Stoch. Anal. 2020, 1, 5. [Google Scholar] [CrossRef]
Andronikos, T.; Sirokofskich, A. The Connection between the PQ Penny Flip Game and the Dihedral Groups. Mathematics 2021, 9, 1115. [Google Scholar] [CrossRef]
Dowling, J.P.; Milburn, G.J. Quantum technology: The second quantum revolution. Philos. Trans. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 2003, 361, 1655–1674. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Eisert, J.; Wilkens, M.; Lewenstein, M. Quantum games and quantum strategies. Phys. Rev. Lett. 1999, 83, 3077–3080. [Google Scholar] [CrossRef] [Green Version]
Benjamin, S.C.; Hayden, P.M. Comment on “Quantum Games and Quantum Strategies”. Phys. Rev. Lett. 2001, 87, 069801. [Google Scholar] [CrossRef] [PubMed] [Green Version]
de Sousa, P.B.M.; Ramos, R.V. Multiplayer quantum games and its application as access controller in architecture of quantum computers. Quantum Inf. Process. 2008, 7, 125–135. [Google Scholar] [CrossRef] [Green Version]
Khan, F.S.; Solmeyer, N.; Balu, R.; Humble, T.S. Quantum games: A review of the history, current state, and interpretation. Quantum Inf. Process. 2018, 17, 1–42. [Google Scholar] [CrossRef] [Green Version]
Cheng, H.M.; Luo, M.X. Tripartite Dynamic Zero-Sum Quantum Games. Entropy 2021, 23, 154. [Google Scholar] [CrossRef] [PubMed]
Szopa, M. Efficiency of Classical and Quantum Games Equilibria. Entropy 2021, 23, 506. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Zhao, Y.; Fu, J.; Xu, L. Reducing food loss and waste in a two-echelon food supply chain: A quantum game approach. J. Clean. Prod. 2021, 285, 125261. [Google Scholar] [CrossRef]
Du, J.; Li, H.; Xu, X.; Zhou, X.; Han, R. Entanglement enhanced multiplayer quantum games. Phys. Lett. A 2002, 302, 229–233. [Google Scholar] [CrossRef] [Green Version]
Du, J.; Li, H.; Xu, X.; Shi, M.; Wu, J.; Zhou, X.; Han, R. Experimental realization of quantum games on a quantum computer. Phys. Rev. Lett. 2002, 88, 137902. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dong, Z.; Zhang, G.; Amini, N.H. Single-photon quantum filtering with multiple measurements. Int. J. Adapt. Control Signal Process. 2018, 32, 528–546. [Google Scholar] [CrossRef] [Green Version]

Figure 1. The payoff matrix of the 2-player Prisoner’s Dilemma. The first number in the parenthesis denotes the payoff of Alice, the second number denotes the payoff of Bob.

Figure 2. The red line denotes the payoff of Alice with the strategic set of

(i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{x})

; while the blue curve is that with the strategic set of

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

.

Figure 2. The red line denotes the payoff of Alice with the strategic set of

(i {\hat{σ}}_{x}, i {\hat{σ}}_{x}, i {\hat{σ}}_{x})

; while the blue curve is that with the strategic set of

(i {\hat{σ}}_{y}, i {\hat{σ}}_{y}, i {\hat{σ}}_{y})

.

Figure 3. The payoff of Alice with respect to the strategy parameters

θ

and

ϕ

,

0 \leq θ \leq π

,

0 \leq ϕ \leq \frac{π}{2}

.

Figure 3. The payoff of Alice with respect to the strategy parameters

θ

and

ϕ

,

0 \leq θ \leq π

,

0 \leq ϕ \leq \frac{π}{2}

.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dong, Z.; Wu, A.-G. The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma. Mathematics 2021, 9, 1443. https://doi.org/10.3390/math9121443

AMA Style

Dong Z, Wu A-G. The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma. Mathematics. 2021; 9(12):1443. https://doi.org/10.3390/math9121443

Chicago/Turabian Style

Dong, Zhiyuan, and Ai-Guo Wu. 2021. "The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma" Mathematics 9, no. 12: 1443. https://doi.org/10.3390/math9121443

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma

Abstract

1. Introduction

2. The General Case

3. The 2-Player Prisoner’s Dilemma

4. The 3-Player Prisoner’s Dilemma

4.1. The Separated Case

4.2. The Entanglement Parameter

4.2.1. The Initial State ${\hat{J}}_{x} | 000 〉$

4.2.2. The Initial State ${\hat{J}}_{x} | 111 〉$

4.3. The Case of the Other Two Players Choosing $i {\hat{σ}}_{y}$

4.4. The Entanglement Gate

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Superiority of Quantum Strategy in 3-Player Prisoner’s Dilemma

Abstract

1. Introduction

2. The General Case

3. The 2-Player Prisoner’s Dilemma

4. The 3-Player Prisoner’s Dilemma

4.1. The Separated Case

4.2. The Entanglement Parameter

4.2.1. The Initial State J ^ x | 000 〉

4.2.2. The Initial State J ^ x | 111 〉

4.3. The Case of the Other Two Players Choosing i σ ^ y

4.4. The Entanglement Gate

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2.1. The Initial State ${\hat{J}}_{x} | 000 〉$

4.2.2. The Initial State ${\hat{J}}_{x} | 111 〉$

4.3. The Case of the Other Two Players Choosing $i {\hat{σ}}_{y}$